Your browser cookies must be enabled in order to apply for this job. Please contact if you need further instruction on how to do that.

Senior Data Engineer

Engineering | San Francisco, CA | Full Time

Job Description


Sr. Data Engineer

About Us:

Tapjoy is a mobile value exchange platform, driving personalized app discovery for consumers, customer acquisition and engagement for app and brand advertisers, and rich monetization for innovative developers.  The Tapjoy network spans over 20,000 apps and 800 million global consumers on iOS, Android and Windows Phone.  Tapjoy is headquartered in San Francisco with offices in 15 cities across the U.S., Asia and Europe.  Investors include InterWest Partners, North Bridge Venture Partners, D.E. Shaw Ventures, Rho Ventures and J.P. Morgan Asset Management.


Technologies We Use:

Our philosophy is to “get stuff done”, and the tools we use reflect that. We use Agile development practices, iterating quickly on features and deploying them as soon as they are ready, often multiple times per day. We leverage many open source tools and cloud-based solutions, including:

  • Ruby on Rails (Apache/Passenger)
  • Amazon Web Services
    • EC2
    • RDS (MySQL)
    • SimpleDB (NoSQL)
    • Elasticache (Memcached)
    • S3
    • ELB/Auto-scaling
    • SQS
    • Cloudfront
  • Hadoop/Hive/Hue
  • Mahout
  • Vertica
  • Git/Github
  • Haml/jQuery/jQTouch/Webkit Transitions
  • Syslog-ng


Sr. Data Engineer

Tapjoy, look for a proactive and outgoing Data/ETL software engineer to work closely with  our data science group on a plethora of interesting Big Data projects harnessing data from 600  Million+ mobile devices. We need someone who has worked for 3+ years in very high volume environments, with very large data sets in a fast paced environment. Tapjoy is Big Data driven!

Role and Responsibilities:

ETL data warehousing software developer


  • Familiar with Hadoop/Hive (don't need be an expert)
  • Scripting language: Ruby, Perl, Python or Shell Scripting
  • Proficient with SQL, write queries with performance and tuning on mind
  • Worked on ETL project before – we are processing billions of raw events daily, knowing how to scale is a big plus
  • Data instrumentation/capture/tracking experience a big plus

In addition to the above hard skills, the successful candidate will clearly demonstrate the ability to work independently as part of a team that makes use of an iterative development approach. The candidate must thrive in a rapidly changing environment and take initiative to ensure that projects succeed. This is data engineer that is driven by data processing within an ever changing landscape that involves working with scaling systems using MapReduce.