Skip to main content

Data engineer

It's our mission to unleash the potential in every team. Join the OpenAIRE culture, where everyone of us has a mission to create, to build, to deliver, to advance oneself, Open Science and OpenAIRE. And have fun while doing it.
Apply Now
Pisa, Athens or remote in Europe
Working Time
Full time


We are looking for a developer with the profile of a Data Engineer to join our team. You will use various methods to transform raw data into useful data systems. You should have strong analytical skills and the ability to combine data from different sources. If you are detail-oriented, with excellent organizational skills and experience in this field, we’d like to hear from you. 

About the job

OpenAIRE AMKE is a non-profit company operating as a service provider for the European research setting for at least 10 years. OpenAIRE is seeking a Java Developer with experience in building high-performing, scalable, enterprise-grade applications.

The OpenAIRE service infrastructure sits on a big data cluster where more than 500Mi records and close to 15Mi full-texts are collected, processed, and made accessible relying on cutting-edge data wrangling technologies and methods to build a knowledge graph known as the OpenAIRE Research Graph. Data engineering challenges are to be tackled at all levels of the Graph data flow:

  • Collecting, validating, and storing metadata records and related files of different formats from thousands data sources;
  • Deduplicating and interlinking metadata records;
  • Enabling platforms to support full-text mining and deep learning techniques;
  • Building enabling technologies to support the analysis of big data graphs;
  • Scalable subscription and notification platforms to distribute Graph content to interested sources;
  • Supporting efficient and user targeted search and discovery technologies.
Job Overview
  • You will become a member of the OpenAIRE technical team and will lead/implement activities which will
    • Analyze and organize raw data 
    • Build data systems and pipelines
    • Evaluate business needs and objectives
    • Prepare data for prescriptive and predictive modeling
    • Combine raw information from different sources
    • Explore ways to enhance data quality and reliability
    • Identify opportunities for data acquisition
    • Develop analytical tools and programs
  • Work closely with the CTO and other team members on several projects
  • Potentially represent OpenAIRE in European project technical meetings.
Responsibilities will include
  • Contribute to all phases of the development lifecycle
  • Write well designed, testable, efficient code
  • Ensure designs are in compliance with specifications
  • Prepare and produce releases of software components
  • Support continuous improvement by investigating alternatives and technologies and presenting these for architectural review

Minimum requirements

  • Proven hands-on Software Development experience (at least 5 years)
  • Proven working experience in Java development
  • Hands-on experience in designing and developing applications using Java EE platforms
  • Object-Oriented Analysis and design using common design patterns.
  • Good understanding of Relational Databases, SQL and ORM technologies (JPA2, Hibernate)
  • Experience with test-driven development
  • Proven experience with management of software projects using Maven and Version Control System (e.g., Git)
  • Good knowledge of the architecture/commands of Unix-like systems
  • Good knowledge of TCP/IP network (TCP, UDP, IPv4, IPv6, DNS)
Qualifications, Skills and experience
  • Experience in the implementation of algorithms on top of the Apache Spark framework - required
  • Experience in modeling and use of databases on NoSQL systems (e.g. Mongodb, HBase), and on systems for information retrieval (Solr, ElasticSearch) - required
  • Experience in using and applying
    • Spring & SpringBoot Framework
    • IoC Paradigm (e.g. Spring) 
    • MVC paradigm and framework to support web programming (e.g. Angular.js, Bootstrap, or similar) 
    • XML database (e.g. existDB)
  • Previous experience and consolidated skills in Java and at least one other language (e.g. C / C ++ / C #, Perl, Python, Go)
    • Web programming (html, javascript, css)
    • Web services (SOAP, REST), web applications, and application containers (e.g. Tomcat, Jetty, Docker)
    • The architecture/commands of Unix-like systems
    • SQL language to create and query databases
    • Management of software projects with Maven and Version Control System (e.g., Git)
  • Fluent written and spoken English language
  • Excellent team-work skills
Favored qualifications
  • Foundation elements of distributed and parallel programming
  • Knowledge of XML / XPATH / XSLT / XQUERY
  • Experience in the Spring Framework
  • Modeling and use of databases on NoSQL systems (e.g. Mongodb), and on systems for information retrieval (Solr, ElasticSearch)
  • Big data processing solutions (Hadoop HDFS, Hive, GraphX, etc.)
  • Workflow management using 

Terms of employment

  • The position is offered for a period of two-years as an associate  (in consultancy terms). It is renewable upon satisfactory performance.
  • This will be a full-time or part-time position. 
  • Even though OpenAIRE offices are located in Athens, selected candidates will join forces with the OpenAIRE team located in Pisa, at Consiglio Nazionale delle Ricerche, or in Athens, at Athena Research Center, but can carry out their activities mainly from remote.
  • Candidates should be available for travelling to meet with the technical team in Europe, up to 6 times per year, on request of the technical coordination (paid travel). 
  • Depending on skills, experience and country of residence, gross salary for a full-time position will be in the range of 50.000 EUR to 76.000 EUR per year. The salary is intended to be gross, i.e. taxes, insurance, etc. are included, as well as expenses to travel to Pisa/Athens if working remotely.
Dates and deadlines
  • Job submission deadline:  July 25, 2022.
  • Expected starting date: August/Sept 2022

How to apply

If you're interested in applying for this position, submit your application and cover letter explaining your motivations to apply and how you meet the requirements outlined above to amke [at]
Please note that we do not issue or fund a working VISA for Europe.

About OpenAIRE

OpenAIRE is a non-profit organisation (working from 2008, but establlished as a legal entity in 2018 with 47 members from 34 countries) operating an open scholalry communication infrastructure in Europe. Our services are used by resarchers and actors in the R&I ecosystem around the world.

OpenAIRE is based in Athens but operates a virtual office of 10 people, a network of 34 Open Science experts placed in every European country, and a technical team consisting of 30+ developers and engineers in Pisa (Italy), Athens (Greece), Warsaw (Poland), Bielefeld (Germany), Geneva (Switzerland), Minho (Portugal).

An equal opportunity employer

Here at OpenAIRE we believe in equality & diversity. OpenAIRE ensures equal opportunities, treatment and access to all candidates regardless of their sex, race, colour, ethnic or social origin, genetic features, language, religion or belief, political or any other opinion, membership of a national minority, property, birth, disability, age or sexual orientation.

Ioanna Giannaki

Our office manager will acknowledge the receipt of your application, will forward any questions you might have to the responsible team member, and will ensure your application is processed within the advertised deadlines. 

Apply Now