RVA's leading Data Conference Returns March 2022.

Join the technology community March 2022, for our next conference full of innovation, in-depth workshops, and incredible presentations given by a diverse line-up of expert speakers + thought leaders at rvatech/DataSummit. Get immersed in the world of Data & discover the trends, and innovations shaping the future of technology!

Register Now

About the Event

Tech is continuously evolving and innovating, making it essential to stay ahead of the curve and explore the year’s trending technology. In 2022, artificial intelligence and data science are continuing to move from the realm of university research to be a critical part of a software developers toolkit and a key differentiator for forward-leaning companies.


Schedule

9:30 AM – 5:oo PM


Student Tickets

Discount Rate – The special discounted rate for student tickets is 25% off. That’s $50 off the regular ticket price!

 

Discount code: DATA22STUDENT

 

**All student tickets must provide a student .edu email address when registering**

Conference Registration

Purchase Your Tickets


Venue

Science Museum of Virginia – Dewey Gottwald Center

 

2301 W Leigh Street Richmond, VA 23220


Common Questions


Where exactly am I going?

The Dewey Gottwald Center is located behind the museum. Parking lot entrance located off of West Leigh Street or DMV Drive.

Where do I park?

Science Museum parking is ample and free. Spaces near the front of the building are designated for guests with mobility challenges. All other guests should park in the lot located directly behind the Science Museum off West Leigh Street or DMV Drive.


You may also view a detailed parking map here.

Meet the Keynotes

Dr. Scott Penberthy

 

Director, Applied AI, Office of the CTO

at Google


Session: Thinking like Newton

Dr. Scott will share his perspective and personal experience on the journey from being a kid playing baseball and writing code in Midlothian Va, to the rise of AI and Machine Learning, its use in business and most recently the discovery of the source code to life. He’ll show how simple curiosities can lead to grand discoveries, often by making naive observations and asking what at first appear to be stupid questions. These are now leading to the vaccination of millions against terrible disease, the prevention of cancer, and soon perhaps a cure for gray hair.

Dr. Timothy Haas

 

Professor of Statistics and Wildlife Conservation

Author at University of Wisconsin


Session: How Firms can Apply Data Science to Save Species

Ecosystem loss, extinctions and climate change are ongoing challenges to life on Earth, and coming up with a plan to tackle their effects requires an accurate picture of what’s happening where, and who is involved.

Prof Tim Haas, University of Wisconsin Milwaukee, has taught and refined such models for years. In his latest paper, he lays out the case for a model unifying human behaviour, climate and ecosystem data, the computational power required to run it, and the credibility criteria any model should meet to prove its worth.

Conference Agenda

9:30 - 10 AM
Registration + Morning Networking

10 - 10:50 AM | Keynote - Dr. Timothy Haas
"How Firms can Apply Data Science to Save Species"

Speaker: Dr. Timothy Haas – Univ. of Wisconsin

 

Ecosystem loss, extinctions and climate change are ongoing challenges to life on Earth, and coming up with a plan to tackle their effects requires an accurate picture of what’s happening where, and who is involved.

 

Prof Tim Haas, University of Wisconsin Milwaukee, has taught and refined such models for years. In his latest paper, he lays out the case for a model unifying human behaviour, climate and ecosystem data, the computational power required to run it, and the credibility criteria any model should meet to prove its worth.

10:50 - 11:20 AM | Keynote - Ovetta Sampson
V.P Machine Learning Experience Design & Responsible AI at Capital One

11:25 AM - 12:05 PM | Breakout Session #1
"Machine Learning Orchestration with Airflow"
Data Ethics Panel

Session: Machine Learning Orchestration with Airflow

 

Speaker: Dr. Santona Tuli – Astronomer

 

While more popular as a ETL and data engineering tool, Airflow is also extremely powerful at orchestrating machine learning pipelines. In this talk, I show how you can write complete end-to-end pipelines starting with retrieving raw data to serving ML predictions to end-users, entirely in Airflow.

 

A plethora of tools have flooded the field of data science, many of which have overlapping functionality. As a data scientist‚ especially as one from a traditionally disenfranchised background with limited resources‚ selecting and learning the right tools in order to maximize one’s competitiveness in the job market, can be intimidating and, possibly, challenging. In this talk, I will walk through an end-to-end data science workflow using only easily accessible open-source software (OSS). Many of these OSS in fact form the basis of the enterprise products that companies purchase as tools for their data professionals to use. Knowing the underlying software can give data scientists the competitive advantage of being tool-agnostic and quick at picking up new tools in an ever-evolving landscape. We will use OSS for data featurization, machine learning model training and serving as well as validation and visualization of results.


 

Session: Panel on Data Ethics

 

Speakers: 

Abigail Byram – Data Engineer at SingleStone Consulting

Jodi Kuhn – Director, Office of Data Quality and Visualization at Commonwealth of Virginia

Zachary Knitter – Data Engineer, Office of Data Quality and Visualization at Commonwealth of Virginia

12:05 - 1 PM
Lunch Break + Networking
Live Data Privacy Podcast Session

Lunch provided by Groovin’ Gourmet 

 

Live Podcast on Data Privacy and Governance

 

Featured Guests:

Marcus Thornton, Deputy Chief Data Officer – Virginia Office of Data Governance and Analytics

Souj Narasimhacharya, Senior Vice President of Engineering – Koalafi

Ford Prior, Principal Cloud Engineer – CarMax

 

1- 1:45 PM | Breakout Session #2
"From Model to Micro-service - ML Deployments At Scale"
"Achieving Data Acumen: Improving Workforce Literacy"

Session: From Model to Micro-service – ML Deployments At Scale

 

Speaker: Ed Shee, Head of Developer Relations at Seldon

 

Until recently, the data science / machine learning field has been pretty immature in it’s adoption of DevOps tools and processes. That’s now changing rapidly as engineering teams realise that, in order to gain any value from their ML models, they need to get them into production.

In this talk, Ed will introduce the open source Seldon Core library and show how it simplifies the steps required to containerise, serve, log and monitor an ML model during deployment. Ed will demonstrate live how to build a model using popular machine learning tools, save and store the model artefact and then deploy it to Kubernetes to handle production traffic.

You will learn how to turn an ML model into a production microservice that handles REST/gRPC traffic and how to scale your deployment. You’ll also learn how to use complex model deployment techniques and how to monitor both the infrastructure and the models themselves, spotting drift and outliers as they take place.


 

Session: Achieving Data Acumen: Improving Workforce Literacy

 

Speaker: Peter Aiken – VCU & Anything Awesome

1:50- 2:25 PM | Breakout Session #3
"Sculpting Data for Machine Learning"
"Principles of Building Greenfield Data Platforms"

Session: Sculpting Data for Machine Learning

 

Speakers:

Jigyasa Grover, Machine Learning Engineer at Twitter

Rishabh Misra, Machine Learning Engineer at Twitter

 

In the contemporary world of machine learning algorithms – “data is the new oil”. For the state-of-the-art ML algorithms to work their magic it’s important to lay a strong foundation with access to relevant data. Volumes of crude data are available on the web nowadays, and all we need are the skills to identify and extract meaningful datasets. This talk aims to present the power of the most fundamental aspect of Machine Learning – Dataset Curation, which often does not get its due limelight. It will also walk the audience through the process of constructing good quality datasets as done in formal settings with a simple hands-on Pythonic example. The goal is to institute the importance of data, especially in its worthy format, and the spell it casts on fabricating smart learning algorithms.


 

Session: Principles of Building Greenfield Data Platforms

 

Speaker: Sam Portillo, Data Engineer at Ippon

2:30 - 3:15 PM | Breakout Session #4
"The Pre-cloud to Multi-Cloud journey with Cassandra on Kubernetes"
"Not-so-good-vibrations: Electric Grid Oscillations and Unearned Data Wisdom"

Session: The Pre-cloud to Multi-Cloud journey with Cassandra on Kubernetes

 

Speaker: Raghavan “Rags” Srinivas, Lead multi-cloud Developer Advocate at Datastax

 

Although some of today’s cloud properties, like elasticity, scalability, self-healing, durability, DR, etc. seemed revolutionary during the pre-cloud days, Cassandra had many of these properties already. With the ever growing popularity of Kubernetes, the k8ssandra open source project is intended to bring Cassandra’s advantages to the cloud and help simplify operations.

The biggest challenge in the Kubernetes world today is multi-cloud. Although it is easy to conceptualize, it is hard to implement. Attend this session for a quick overview of Cassandra, k8ssandra and Astra (DBaas). You will see how the worlds of Cassandra and Kubernetes on the cloud collide in a remarkably cohesive way to incorporate the best of both.

After attending this primarily demo driven session, attendees will walk away with a good understanding of the k8ssandra project and how it is evolving to support multi-region and multi-cloud. Along this journey we will look at one-off deployments on GKE, EKS including multi-cloud (GKE and EKS). On EKS, we leverage EKS Kubefed.


 

Session: Not-so-good-vibrations: Electric Grid Oscillations and Unearned Data Wisdom

 

Speaker: Kevin Jones, Manager – Electric Transmission Operations Engineering Support at Dominion Energy

 

This will be a fun and novel session that will expose participants to new flavors of data as well as electric grid use cases that address some of the more pernicious challenges of decarbonization. The talk will be grounded (no pun intended) for the practitioner audience by then exploring the perils of applying unearned wisdom on data-focused digital transformation through the lens of this unique set of use cases and data types. It will conclude by providing a (possible) alternative framework, as well as its contemporary significance and necessity, for the transfer and application of such wisdom.

3:20 - 3:55 PM | Keynote - Dr. Scott Penberthy - Director of Applied AI, Office of the CTO at Google
"Thinking like Newton"

Session: Thinking like Newton

Speaker: Dr. Scott Penberthy, Director of Applied AI, Office of the CTO at Google

 

Dr. Scott will share his perspective and personal experience on the journey from being a kid playing baseball and writing code in Midlothian Va, to the rise of AI and Machine Learning, its use in business and most recently the discovery of the source code to life. He’ll show how simple curiosities can lead to grand discoveries, often by making naive observations and asking what at first appear to be stupid questions. These are now leading to the vaccination of millions against terrible disease, the prevention of cancer, and soon perhaps a cure for gray hair.

2022 Speakers



Dr. Scott Penberthy

Director, Applied AI, Office of the CTO

Google

Dr. Timothy Haas

Professor

University of Wisconsin

Ed Shee

Head of Developer Relations

Seldon

Dr. Santona Tuli

Staff Data Scientist

Astronomer

Sam Portillo

Data Engineer

Ippon

Peter Aiken

Associate Professor of Information Systems

VCU + Anything Awesome

Abigail Byram

Data Engineer

SingleStone Consulting

Jigyasa Grover

Machine Learning Engineer

Twitter

Rishabh Misra

Machine Learning Engineer

Twitter

Raghavan Srinivas

Lead multi-cloud Developer Advocate

Datastax

Kevin Jones

Engineering Analytics & Modeling

Dominion Energy

Marcus Thornton

Deputy Chief Data Officer

Virginia Office of Data Governance and Analytics

Souj Narasimhacharya

Senior Vice President of Engineering

Koalafi

Jodi Kuhn

Director, Office of Data Quality and Visualization

at Commonwealth of Virginia

Zachary Knitter

Data Engineer, Office of Data Quality and Visualization

at Commonwealth of Virginia

Register For This Event