Aller au contenu

Language Engineer - Native in Turkish

  • On-site
    • Barcelona, Catalunya [Cataluña], Spain

Job description

At M47, we spark AI!

At M47, we are looking for a (Technical) Language Engineer who sits at the intersection of Software Development and Linguistics. This is a highly technical role designed for someone with a Computer Science mindset who wants to apply their coding skills to Natural Language Processing (NLP).

Unlike traditional linguistic roles, you will not be translating content manually. Instead, you will use your technical skills to automate processes, script solutions for large datasets, and debug the interaction between linguistic data and AI models. You will work alongside Software Engineers and ML Researchers to ensure our models understand Turkish with native-level fluency.

About your day-to-day:

  • Scripting & Automation: Design and write Python scripts from scratch to process, clean, and validate massive text datasets (JSON, XML, CSV).

  • Algorithmic Analysis: Investigate model failures by analyzing logs and data patterns, using code to identify root causes rather than manual review.

  • Regular Expressions (Regex): Write complex Regex rules to automatically detect and fix linguistic errors across thousands of lines of data.

  • Pipeline Debugging: Troubleshoot technical issues during data ingestion and export, working with the core engineering team to resolve encoding or formatting errors.

  • Tooling Support: Provide technical feedback to improve internal annotation tools and automate repetitive tasks using shell scripts or simple bots.

Version Control: Manage datasets and codebase changes using Git (branching, committing, and merging).

Job requirements

This is for you if you have:

  • BS/MS in Computer Science, Software Engineering, or a related technical field.

  • Native level of Turkish (Spoken and Written) with excellent grammatical knowledge.

  • Strong proficiency in Python. You must be able to proceed with coding tasks independently without heavy reliance on LLM tools.

  • Comfortable working in a Unix-based environment (Bash, grep, piping commands).

  • You prefer to write a script to fix a problem once, rather than fixing it manually 100 times.

  • Professional fluency in English (C1/C2).

Nice to have:

  • Practical experience with NLP libraries such as spaCy, NLTK, or Hugging Face.

  • Experience with SQL for querying datasets.

  • Familiarity with cloud environments (AWS, Azure).

  • Previous experience in Software QA or Test Automation.

What is in it for you?

💪🏽 Indefinite full-time contract

☀️ Office located at the heart of Barcelona

💸 Comprehensive compensation package, including private medical insurance coverage and flexible remuneration through Cobee, including meals, gym pass, transport, and kindergarten.

📚Learning budget to support your career ambition.

🏃Access to Urban Sports (wellness app) 

📄 TaxDown to cover your tax declaration

🌍 Great international, inclusive, and dynamic work environment (more than 20 nationalities!)

For this position, you must be a holder of a valid working permit for Spain (or EU passport) and be available to work full-time onsite.

**M47 Labs not only encourages but is actively working on empowering its diverse and inclusive talent. M47 Labs is committed to ensure a non-discriminative workplace, work life and selection process and such decisions will not be influenced by race, color, religion, gender identity or expression, sexual orientation, disability, social and conjugal status, age or other applicable characteristics. M47 Labs prohibits discrimination and harassment of any kind and all employment is decided on the basis of qualifications, merit, and business needs.**

In accordance with the provisions of Regulation (EU) 2016/679 of 27 April (GDPR) and the Organic Law 3/2018 of 5 December (LOPDGDD), we inform you that personal data and email addresses collected from the Data Subject will be processed under the responsibility of M47 LABS & INTERNATIONAL FIDUCIA SL for a legitimate interest and for the purpose of sending communications about our products and services and will be retained for as long as none of the parties object. The data will not be communicated to third parties, unless under legal obligation. You can exercise your rights of access, rectification, portability and erasure of your data and those of restriction and objection to their processing by contacting DIPUTACIÓ, 279 3 6 - 08007 BARCELONA (Barcelona). E-mail: info@m47labs.com. If you consider that the processing does not comply with current legislation, you may file a complaint with the Spanish supervisory authority at www.aepd.es.

or