061219 Update


A bi-monthly newsletter with updates on data and computing news and events for UW-Madison researchers.

In the June 12, 2019 update:

  • Institute for Foundations in Data Science
  • Upcoming Campus Events
  • Upcoming Training and Workshops
  • Campus Opportunities and Groups
  • External Opportunities

Are you…

  • unsure which campus data and computing resources you need for your research?
  • interested in making connections and starting new collaborations with data scientists and other researchers on campus?
  • looking for training in data and computing skills?

The Data Science Hub can help! Send an email to the Data Science facilitator (facilitator@datascience.wisc.edu) or come by Hub Central in the Discovery Building during office hours (Th 3:00-5:00 PM). Sarah will also be at Open Coding Lab in Steenbock Library (T 2:30-4:30 PM). Check calendar for latest details and updates.

Institute for Foundations in Data Science (IFDS)

As data continues to accumulate at an ever increasing rate, so does the need for powerful and novel methods to extract information from data, in a form that is useful to individuals, society, researchers, and commerce. During the past decade, researchers across campus have done important work in diverse aspects of fundamental data science, as well as in applications to numerous areas of domain science, engineering, and medicine. This group forms the core of the Institute for Foundations of Data Science (IFDS) at UW-Madison. Building on previous work, and pursuing new goals sparked at the interfaces of mathematics, statistics, and theoretical computer science, IFDS aims to produce excellent research and to epitomize the possibilities of collaborative approach to investigating fundamental issues in data science. The IFDS is supported by a $1.5 million grant from the National Science Foundation’s Transdisciplinary Research in Principles of Data Science (TRIPODS) initiative.


Upcoming Campus Events (Calendar View)

Biostatistics & Medical Informatics (BMI):
June 17, 11:30am, HSLC 1345​, Town Hall for Candidate for the Chair of Biostatistics & Medical Informatics, Eleanor Feingold, University of Pittsburgh
June 18, 1:00pm, HSLC 1345​, Genetics, Epidemiology, and Statistical Challenges of Low-Dimensional Big Data. Eleanor Feingold, University of Pittsburgh

June 21, 1:00 pm, Atrium Discovery Building, SROP/Biomedical Data Science Poster Session

Upcoming Training and Workshops

Stata for Researchers

Stata is the most popular statistical software at the SSCC, as it is both very powerful and relatively easy to learn. This class will teach you the fundamentals of Stata and give you a strong foundation you can build on to become an expert Stata user. You do not need any experience with Stata to benefit from this workshop, but people who learned how to run a few Stata commands for a class or who figured out some things on their own will benefit from its broader and more rigorous approach. The material covered is also available in the SSCC Knowledge Base under Stata for Researchers. This workshop will be held June 17-20 from 10:00am – 12:00pm in 3218 Sewell Social Sciences.  REGISTRATION FOR THIS TRAINING IS REQUIRED 

Geospatial Data Carpentry8:30am – 4:30pm, July 11-12, 2019

This hands-on workshop teaches basic concepts, skills and tools for working more effectively with geospatial data. The workshop is for any researcher who has data they want to analyze, and no prior computational experience is required. Event hosted by the Data Science Hub. Registration is now open.

Campus Opportunities and Groups

PhD Level Research Assistant Positions 

Dr. Qiongshi Lu has multiple PhD-level RA openings in his research group specializing in statistical genetics. The student RAs are expected to work on methodological research motivated by human genetics research problems, and will work on large-scale genetic data for a variety of diseases and human traits. Experience in genetic analysis is NOT required, but a strong interest in genetics is highly valued. You can find more information about recent research projects on the lab website. To apply, please send Dr. Lu your CV and transcript.

Dell’s Data Protection Suite Bootcamp

Senior Engineer, Adam VandeVen, has created a virtual environment of Dell’s Data Protection Suite. The lab will provide a proof of concept on integrating with native toolsets and simplifying the administrative burden of protecting data through automation opportunities. This bootcamp will cover how to automate the protection of virtual workloads by creating rules that protect VMs by VMware folder or tags, protect massive datasets that are difficult to manage on NAS appliances using NDMP, protect data wherever in popular databases such as SQL and Oracle, use Recoverpoint for Virtual Machines to protect virtual workloads within seconds of a change occurring, and understand options regarding tiering and disaster recovery, both on-premises and in the public cloud. The bootcamp will be held in Union South (check the TITU board for room) on Thursday, June 27th  from 9:00am-1:00pm. The bootcamp is free to employees but limited to 15 people so please RSVP. Please bring your laptop. Lunch will be provided.

High Performance Networking at Google
This discussion of High Performance Networking concepts is targeted towards a technical audience. The event will start with a quick discussion of the OS implications of ever increasing network speeds. The next section describes a typical Google datacenter, as motivation for thinking about networking challenges at scale. A brief overview of Linux’s general purpose network stack highlights some opportunities for different tradeoffs. Finally RDMA (Remote Direct Memory Access) is discussed in detail. Expected benefits of attending this talk will be 1) insight into some of the networking problems are being addressed by large providers (by proxy gains of using a solution like BigQuery) and 2) exposure to RDMA (knowing what it is, what it can do, etc.)  The event will be held Madison Central Public Library, 201 West Mifflin St. Room 302 · Madison, WI

Computational Plant Sciences (ComPS) Group
A common problem both bench and field plant scientists face is that advance in high-throughput measurement platforms have outpaced our ability to readily analyze the datasets produced. To help solve this problem, the researchers in Plant Sciences formed a community of practice, a peer-to-peer mentoring network that will work across lab, departmental, and college boundaries to help plant scientists build computational and data science skillsets. Over the summer ComPS will hold monthly meetings at 4:30pm at the Union Terrace on the first Wednesday of the month.

Research Systems Administrators Group (RSAG)
RSAG meets on the third Wednesdays of every month and allows systems administrators of research systems to share expertise. Join the email list by sending an email message to join-rsag@lists.wisc.edufor updates and future meetings.

External Opportunities

Trusted CI Webinar Series

This seminar series will provide information about readily available cybersecurity services tailored to the NSF science community. The webinars are recorded and will be linked to this page for later viewing. Each webinar includes time for questions from attendees and uses Zoom. The next webinar will be on June 24th: The Trusted CI Framework: An Architecture for Cybersecurity Programs with the Trusted CI team (Register now).

Machine-learning Opportunities at the University of Wisconsin Milwaukee

The Data Science/Machine Learning group in the Department of Physics plans to hire up to three postdoctoral associates in applications of machine learning to physics and biology. They seek outstanding candidates to pioneer the development and application of advanced machine learning techniques to key problems at the forefront of science. While familiarity with machine learning would be an advantage, exceptional candidates will receive serious consideration regardless of background. Please send applications (cover letter, resume and publications list) to Prof. Abbas Ourmazd.

Big Data Scientist Training Enhancement Program (BD-STEP)

Managing, analyzing, and leveraging data generated across the cancer continuum has the potential to advance cancer knowledge and improve patient care. The National Cancer Institute (NCI) and the Veterans Health Administration (VHA) of the Department of Veterans Affairs (VA) are recruiting postdoctoral fellows for the Big Data Scientist Training Enhancement Program (BD-STEP) to increase capacity for data-driven cancer research by supporting healthcare-focused data science training. The primary goal of BD-STEP is to help develop a next generation of researchers who will be capable of using large clinical and research data sets to efficiently and effectively advance cancer research. Additional information about the program is available at https://cssi.cancer.gov/bd-step and https://www.va.gov/oaa/specialfellows/programs/sf_bdstep.asp. Applications should be submitted online and will be accepted through July 19, 2019.

Data Driven Wisconsin 2019 Call for Proposals
We  are pleased to announce the fourth annual Data-Driven Wisconsin conference (formerly Big Data Wisconsin conference), to be held in Milwaukee on Wednesday, July 31. We are seeking speakers to give talks on all aspects of data science, big data technology, and their applications in business, science, government and health. Jordan Ellenberg, NY Times best-selling author of How Not to Be Wrong: The Power of Mathematical Thinking will be giving our opening keynote and Cathy O’Neil, NY Times best-selling author of Weapons of Math Destruction: How Big Data Increase Inequality and Threatens Democracy will be giving the afternoon keynote.

Check calendar for latest details and updates for all listed events. If you have a relevant event or group you’d like to see included in next month’s newsletter.  Please send us an email at newsletter@datascience.wisc.edu.