Your browser cookies must be enabled in order to apply for this job. Please contact if you need further instruction on how to do that.

Junior Data Scientist

Data Science | Rosslyn, VA | Full Time

Job Description

Junior Data Scientist

BitVoyant produces business and network intelligence, enabling companies to intimately understand their interactions and risks in real-time and at Internet scale.  We’re growing our team to create, enrich, and mine this intelligence for insights that matter to our customers.

As a Data Scientist (Big Data), you will apply cutting-edge statistical and mathematical methods to the collection, correlation and analysis of large structured and unstructured data sets from a broad, ever-evolving set of sources. The successful applicant will apply industry knowledge, contextual understanding, and innovative insights into the development of novel analytical techniques and tools in response to urgent, complex questions.  The ideal candidate will:


·       Provide valuable insights into new data sets quickly

·       Mine large amounts of data for insights with both automated and ad-hoc queries

·       Communicate complex findings and ideas in plain language that is friendly to business and operational audiences

·       Problem solve – turn requirements into solutions

·       Collaborate with team members with varying technical and non-technical backgrounds towards a shared goal 

·       Rapidly research and iterate on analysis methods by finding elegant, reliable and understandable techniques to build analytics that matter

·       Create technical assessments and develop custom approaches in response to time-critical urgent customer needs

·       Employ sophisticated analytics programs, machine learning, and statistical methods to prepare data for use in predictive and prescriptive             modeling 

·       Examine new and more convincing methods for data reporting, visualization & presentation

·       Develop new approaches to apply large-scale computing technology to solve customer problems

·       Be involved in all stages of analytic development from idea formation through prototyping, automation, and productizing

·       Maintain up-to-date knowledge of technology standards, industry trends, emerging techniques, and analytic best practices

·       Keep informed of evolution and impact of adversary tactics and strategies

·       Ensure analytical issues are quickly resolved and help implement strategies and solutions to reduce the likelihood of reoccurrence

Requirements - Our ideal candidate will meet most but not necessarily all of the following requirements:

US Citizenship or authorization to work in the US

Possess TS/SCI security clearance

Bachelor’s degree or higher in Computer Science, Physics, Mathematics, Statistics or equivalent

Possess solid understanding of Cyber Security terminology, concepts and technology

Possess solid understanding of Internet communication protocol

 Must have at least 1 to 2 years demonstrated experience with

  • Hadoop, Elasticsearch


  • Python

  • JSON data formats, Parquet files

  • Structured and Unstructured data

  • Supervised and Unsupervised Machine Learning techniques

Desired skills  

  • Big Data Platforms and tools (e.g. Cloudera, Hortonworks, MapR, Hadoop, Pig, Hive, etc.)

  • Impala, and SQL – for queries

  •  NoSQL databases (ex. HBase, MongoDB,ArangoDB, Neo4J) – for behavioral analysis

  • C/C++/C#, Scala, PySpark, Java, R, R Shiny, or other comparable programming language

  • Data formats – ex. JSON, flat files, Parquet, ORC files, Avro

  • Extract-Transform-Load (ETL) processes

  • Statistical data analysis packages (SPSS, R, etc.)

  • Data visualization tools (ex. Tableau, Qlik, IPython, etc.)

  • Comfortable with rapid prototyping to solve immediate problems

  • Super user level expertise in Linux / Unix, Mac, and Windows operating systems.

  • Additional Machine Learning techniques, ex. Auto Encoders, Deep Learning, Hierarchical Clustering, Outlier and Anomaly Detection, etc.

Ability to process, filter, and establish the utility of data through various analytics techniques

Demonstrated creativity, foresight, and mature judgment in answering complex and difficult analytical questions

Strong written and verbal communication skills