Cloud Engineer

Information Technology
in Orlando
, FL
Reference: 19-04479


Are you a MLOps / DevOps guru who is passionate for creating solutions using cutting-edge technologies? Do you excel under pressure? Can you summarize highly complex problems so that others can help you solve them? Do you have rock solid integrity and are the team members we can trust and count on? Does everyone turn to you for answers to tough questions? Do you instrument everything, collect data about data, and look for the correlation in everything? If you smile in the face of pressure, can work independently, but are also great team player, we’re looking for you!
In this role, you will be managing very large-scale, highly-available Hadoop infrastructure supporting many-terabytes of data and growing very rapidly.


  • Deploy and manage very large-scale, highly-available Hadoop infrastructure supporting many-terabytes of data and growing very rapidly
  • Handle design, architecture, instrumentation and operations of a massive data platform
  • Deploy and manage of AWS or GCP based services and infrastructure
  • Work closely with data scientists to deploy highly scalable inferencing services using Client and CV models on GPU-enabled clusters / Kubernetes
  • Develop scalable CI/CD pipelines for deploying Machine Learning and Computer Vision Models
  • Develop, deploy and manage highly scalable services for data ingest and real-time data pipelines for video feeds
  • Implement, troubleshoot, and optimize distributed solutions based on modern big data technologies like Hive, Hadoop, Spark, Elastic Search, Kafka, etc. to solve large scale processing problems in a Cloud cluster environment
  • Solve challenging operational problems across a broad range operations and services
  • Lead innovation by exploring, investigating, instrumenting, recommending, benchmarking and implementing data centric technology solutions for the platform
  • Develop processes, optimization methods, and risk management strategies
  • Day-to-day operational support

Basic Qualification:

  • Bachelor’s degree or equivalent experience
  • 8 to 12 years of progressively related experience in systems engineering and/or software development
  • 1 to 3 years of project or team lead experience
  • BS or MS in Computer Science, Information Technology, or similar field or related work experience
  • Production experience with Dev Ops Tools like: Terraform, Packer, Chef, Docker, Puppet
  • Experienced in Client Ops
  • Deploying deep learning models on GPU-enabled Kubernetes clusters
  • RHEL, Ubuntu, Centos, SUSE Linux configuration, deployment, and troubleshooting
  • Experience in creating IaaS and PaaS with data security, availability, resiliency along with strong monitoring through an operational mindset
  • Knowledge and experience troubleshooting Load Balancers, Network configuration, Firewalls to help collaborate with other Systems and Network Engineering Teams for Data Management Platforms
  • Big Data Hadoop technologies (HDFS, HBase, ElasticSearch/Solr, Spark, NIFI, Hive, Kerberos, Druid, Hue, ORC, Kafka, Redis)
  • Experience working on any public cloud provider – AWS, Azure, Google Cloud
  • Database knowledge of SQL (Oracle, SQL, MariaDB) and NoSQL (DynamoDB, HBase)
  • Understanding of REST, micro-services with Multi-cloud distributed architecture, support and planning utilizing enterprise Java, Python
  • Demonstrated interest and ability to quickly learn new technologies and put them into production
  • Natural influencer, and selfless team player, willing to help your team achieve great things
  • Excellent problem-solving skills; proven technical leadership and communication skills
  • Willingness to participate in a 24×7 on-call rotation for escalation
  • Have a quality mindset, squash bugs with a passion, while working hard to prevent them in the first place through unit testing, test-driven development, version control, continuous integration and deployment

Nice To Haves:

  • AWS Certifications, Google Certifications, Azure Certifications
  • Contribution to open-source projects