RVA's leading Data Conference Returns March 2022.
Join the technology community March 2022, for our next conference full of innovation, in-depth workshops, and incredible presentations given by a diverse line-up of expert speakers + thought leaders at rvatech/DataSummit. Get immersed in the world of Data & discover the trends, and innovations shaping the future of technology!
Register NowAbout the Event
Tech is continuously evolving and innovating, making it essential to stay ahead of the curve and explore the year’s trending technology. In 2022, artificial intelligence and data science are continuing to move from the realm of university research to be a critical part of a software developers toolkit and a key differentiator for forward-leaning companies.
Schedule
9:30 AM – 5:oo PM
Student Tickets
Discount Rate – The special discounted rate for student tickets is 25% off. That’s $50 off the regular ticket price!
Discount code: DATA22STUDENT
**All student tickets must provide a student .edu email address when registering**
Venue
Science Museum of Virginia – Dewey Gottwald Center
2301 W Leigh Street Richmond, VA 23220
Common Questions
Where exactly am I going?
The Dewey Gottwald Center is located behind the museum. Parking lot entrance located off of West Leigh Street or DMV Drive.
Where do I park?
Science Museum parking is ample and free. Spaces near the front of the building are designated for guests with mobility challenges. All other guests should park in the lot located directly behind the Science Museum off West Leigh Street or DMV Drive.
You may also view a detailed parking map here.
Meet the Keynotes
Dr. Scott Penberthy
Director, Applied AI, Office of the CTO
at Google
Session: Thinking like Newton
Dr. Scott will share his perspective and personal experience on the journey from being a kid playing baseball and writing code in Midlothian Va, to the rise of AI and Machine Learning, its use in business and most recently the discovery of the source code to life. He’ll show how simple curiosities can lead to grand discoveries, often by making naive observations and asking what at first appear to be stupid questions. These are now leading to the vaccination of millions against terrible disease, the prevention of cancer, and soon perhaps a cure for gray hair.
Dr. Timothy Haas
Professor of Statistics and Wildlife Conservation
Author at University of Wisconsin
Session: How Firms can Apply Data Science to Save Species
Ecosystem loss, extinctions and climate change are ongoing challenges to life on Earth, and coming up with a plan to tackle their effects requires an accurate picture of what’s happening where, and who is involved.
Prof Tim Haas, University of Wisconsin Milwaukee, has taught and refined such models for years. In his latest paper, he lays out the case for a model unifying human behaviour, climate and ecosystem data, the computational power required to run it, and the credibility criteria any model should meet to prove its worth.
Conference Agenda
9:30 - 10 AM
Registration + Morning Networking
10 - 10:50 AM | Keynote - Dr. Timothy Haas
"How Firms can Apply Data Science to Save Species"
Speaker: Dr. Timothy Haas – Univ. of Wisconsin
Ecosystem loss, extinctions and climate change are ongoing challenges to life on Earth, and coming up with a plan to tackle their effects requires an accurate picture of what’s happening where, and who is involved.
Prof Tim Haas, University of Wisconsin Milwaukee, has taught and refined such models for years. In his latest paper, he lays out the case for a model unifying human behaviour, climate and ecosystem data, the computational power required to run it, and the credibility criteria any model should meet to prove its worth.
10:50 - 11:20 AM | Keynote - Ovetta Sampson
V.P Machine Learning Experience Design & Responsible AI at Capital One
11:25 AM - 12:05 PM | Breakout Session #1
"Machine Learning Orchestration with Airflow"
Data Ethics Panel
Session: Machine Learning Orchestration with Airflow
Speaker: Dr. Santona Tuli – Astronomer
While more popular as a ETL and data engineering tool, Airflow is also extremely powerful at orchestrating machine learning pipelines. In this talk, I show how you can write complete end-to-end pipelines starting with retrieving raw data to serving ML predictions to end-users, entirely in Airflow.
A plethora of tools have flooded the field of data science, many of which have overlapping functionality. As a data scientist‚ especially as one from a traditionally disenfranchised background with limited resources‚ selecting and learning the right tools in order to maximize one’s competitiveness in the job market, can be intimidating and, possibly, challenging. In this talk, I will walk through an end-to-end data science workflow using only easily accessible open-source software (OSS). Many of these OSS in fact form the basis of the enterprise products that companies purchase as tools for their data professionals to use. Knowing the underlying software can give data scientists the competitive advantage of being tool-agnostic and quick at picking up new tools in an ever-evolving landscape. We will use OSS for data featurization, machine learning model training and serving as well as validation and visualization of results.
Session: Panel on Data Ethics
Speakers:
Abigail Byram – Data Engineer at SingleStone Consulting
Jodi Kuhn – Director, Office of Data Quality and Visualization at Commonwealth of Virginia
Zachary Knitter – Data Engineer, Office of Data Quality and Visualization at Commonwealth of Virginia
12:05 - 1 PM
Lunch Break + Networking
Live Data Privacy Podcast Session
Lunch provided by Groovin’ Gourmet
Live Podcast on Data Privacy and Governance
Featured Guests:
Marcus Thornton, Deputy Chief Data Officer – Virginia Office of Data Governance and Analytics
Souj Narasimhacharya, Senior Vice President of Engineering – Koalafi
Ford Prior, Principal Cloud Engineer – CarMax
1- 1:45 PM | Breakout Session #2
"From Model to Micro-service - ML Deployments At Scale"
"Achieving Data Acumen: Improving Workforce Literacy"
Session: From Model to Micro-service – ML Deployments At Scale
Speaker: Ed Shee, Head of Developer Relations at Seldon
Until recently, the data science / machine learning field has been pretty immature in it’s adoption of DevOps tools and processes. That’s now changing rapidly as engineering teams realise that, in order to gain any value from their ML models, they need to get them into production.
In this talk, Ed will introduce the open source Seldon Core library and show how it simplifies the steps required to containerise, serve, log and monitor an ML model during deployment. Ed will demonstrate live how to build a model using popular machine learning tools, save and store the model artefact and then deploy it to Kubernetes to handle production traffic.
You will learn how to turn an ML model into a production microservice that handles REST/gRPC traffic and how to scale your deployment. You’ll also learn how to use complex model deployment techniques and how to monitor both the infrastructure and the models themselves, spotting drift and outliers as they take place.
Session: Achieving Data Acumen: Improving Workforce Literacy
Speaker: Peter Aiken – VCU & Anything Awesome
1:50- 2:25 PM | Breakout Session #3
"Sculpting Data for Machine Learning"
"Principles of Building Greenfield Data Platforms"
Session: Sculpting Data for Machine Learning
Speakers:
Jigyasa Grover, Machine Learning Engineer at Twitter
Rishabh Misra, Machine Learning Engineer at Twitter
In the contemporary world of machine learning algorithms – “data is the new oil”. For the state-of-the-art ML algorithms to work their magic it’s important to lay a strong foundation with access to relevant data. Volumes of crude data are available on the web nowadays, and all we need are the skills to identify and extract meaningful datasets. This talk aims to present the power of the most fundamental aspect of Machine Learning – Dataset Curation, which often does not get its due limelight. It will also walk the audience through the process of constructing good quality datasets as done in formal settings with a simple hands-on Pythonic example. The goal is to institute the importance of data, especially in its worthy format, and the spell it casts on fabricating smart learning algorithms.
Session: Principles of Building Greenfield Data Platforms
Speaker: Sam Portillo, Data Engineer at Ippon
2:30 - 3:15 PM | Breakout Session #4
"The Pre-cloud to Multi-Cloud journey with Cassandra on Kubernetes"
"Not-so-good-vibrations: Electric Grid Oscillations and Unearned Data Wisdom"
Session: The Pre-cloud to Multi-Cloud journey with Cassandra on Kubernetes
Speaker: Raghavan “Rags” Srinivas, Lead multi-cloud Developer Advocate at Datastax
Although some of today’s cloud properties, like elasticity, scalability, self-healing, durability, DR, etc. seemed revolutionary during the pre-cloud days, Cassandra had many of these properties already. With the ever growing popularity of Kubernetes, the k8ssandra open source project is intended to bring Cassandra’s advantages to the cloud and help simplify operations.
The biggest challenge in the Kubernetes world today is multi-cloud. Although it is easy to conceptualize, it is hard to implement. Attend this session for a quick overview of Cassandra, k8ssandra and Astra (DBaas). You will see how the worlds of Cassandra and Kubernetes on the cloud collide in a remarkably cohesive way to incorporate the best of both.
After attending this primarily demo driven session, attendees will walk away with a good understanding of the k8ssandra project and how it is evolving to support multi-region and multi-cloud. Along this journey we will look at one-off deployments on GKE, EKS including multi-cloud (GKE and EKS). On EKS, we leverage EKS Kubefed.
Session: Not-so-good-vibrations: Electric Grid Oscillations and Unearned Data Wisdom
Speaker: Kevin Jones, Manager – Electric Transmission Operations Engineering Support at Dominion Energy
This will be a fun and novel session that will expose participants to new flavors of data as well as electric grid use cases that address some of the more pernicious challenges of decarbonization. The talk will be grounded (no pun intended) for the practitioner audience by then exploring the perils of applying unearned wisdom on data-focused digital transformation through the lens of this unique set of use cases and data types. It will conclude by providing a (possible) alternative framework, as well as its contemporary significance and necessity, for the transfer and application of such wisdom.
3:20 - 3:55 PM | Keynote - Dr. Scott Penberthy - Director of Applied AI, Office of the CTO at Google
"Thinking like Newton"
Session: Thinking like Newton
Speaker: Dr. Scott Penberthy, Director of Applied AI, Office of the CTO at Google
Dr. Scott will share his perspective and personal experience on the journey from being a kid playing baseball and writing code in Midlothian Va, to the rise of AI and Machine Learning, its use in business and most recently the discovery of the source code to life. He’ll show how simple curiosities can lead to grand discoveries, often by making naive observations and asking what at first appear to be stupid questions. These are now leading to the vaccination of millions against terrible disease, the prevention of cancer, and soon perhaps a cure for gray hair.
2022 Speakers
Dr. Scott Penberthy
Director, Applied AI, Office of the CTO
Dr. Timothy Haas
Professor
University of Wisconsin
Ed Shee
Head of Developer Relations
Seldon
Dr. Santona Tuli
Staff Data Scientist
Astronomer
Sam Portillo
Data Engineer
Ippon
Peter Aiken
Associate Professor of Information Systems
VCU + Anything Awesome
Abigail Byram
Data Engineer
SingleStone Consulting
Jigyasa Grover
Machine Learning Engineer
Rishabh Misra
Machine Learning Engineer
Raghavan Srinivas
Lead multi-cloud Developer Advocate
Datastax
Kevin Jones
Engineering Analytics & Modeling
Dominion Energy
Marcus Thornton
Deputy Chief Data Officer
Virginia Office of Data Governance and Analytics
Souj Narasimhacharya
Senior Vice President of Engineering
Koalafi
Jodi Kuhn
Director, Office of Data Quality and Visualization
at Commonwealth of Virginia
Zachary Knitter
Data Engineer, Office of Data Quality and Visualization
at Commonwealth of Virginia