Big Data Engineer Job

Date: Jan 20, 2018

Location: Prague, CZ

Requisition ID: ENG003044

MSD is a global health care leader with a diversified portfolio of prescription medicines, vaccines and animal health products. Today, we are building a new kind of healthcare company – one that is ready to help create a healthier future for all of us.

Create awesome digital products and enjoy a reward that technology careers don’t often bring: the satisfaction of helping to save lives. As a part of Big Data Platform team you will contribute to MSD Big Data Lake and cooperate with other teams to tackle biggest opportunities at the intersection of healthcare, information and technology. Be in charge of development, deployment and maintenance of Big Data clusters. Apply best practices, execute releases, provide infrastructure as-a-code, monitor Hadoop clusters.

We are searching for person in a full-time position willing to adopt new technologies and interested in automation and continuous delivery excellence. You will be working with an international team located in Prague office FIVE.


  • Responsible for implementation and ongoing administration of Hadoop infrastructure
  • Cluster maintenance including creation and removal of nodes
  • Performance tuning of Hadoop clusters
  • Monitoring of Hadoop cluster connectivity and security
  • Hadoop services support and maintenance - HDFS, Hive, HBase and Kafka
  • Software patches and upgrades
  • Automation of manual tasks using Ansible
  • Collaborating with application teams to install operating system and Hadoop updates, patches and version upgrades
  • Deployment of Hadoop cluster, add and remove nodes, keep track of jobs, monitor critical parts of the cluster, configure high availability
  • Research and recommend technical and operational improvements for improved reliability and efficiencies


  • Strong experience with UNIX/LINUX based systems & scripting (either of Bash or Python)

  • Knowledge of Hadoop ecosystem - YARN, MapReduce, HDFS, HBase, Zookeeper, Kafka, Spark, Hive

  • Experience with configuration management tools such as Ansible, Puppet, Chef or Salt

  • Knowledge of directory services such as LDAP & ADS

  • Knowledge of monitoring tools such as Nagios or Icinga2

  • Distributed systems troubleshooting skills

  • Ability to communicate in English


  • Experience with configuring security in Hadoop using Kerberos or PAM
  • Experience with cloud services such as AWS
  • Experience troubleshooting Java applications
  • Experience with agile development

Job: Engineering, Development & Integration
Other Locations:
Employee Status: Regular
Travel: Yes, 10 % of the Time
Number of Openings:
Shift (if applicable):
Hazardous Materials:
Company Trade Name: MSD

Job Segment: Database, Engineer, Java, Cloud, Technology, Engineering, Research