Hi. My name is Sean, and I am a data scientist. Which means I am, in effect, a data janitor who applies machine learning, natural language processing, social network analysis (sociallycompute.io), data mining and statistics to a lot of different data sets (ocdx.io) and organizations (CHAOSS).  My research these days focuses on understanding new types of organizations in open source software and health care. Both from a data focused point of view.


The Data I sift through is mostly (maybe entirely) data about people.  Who they talk to and what they say online . Data science is a set of tools and techniques I use to ask social science research questions.  And build systems that are hopefully of use to people.

Put the data science in a salad with social science: Now You may think of me as a “sociotechnical” (sociotech.net) scholar/builder/designer.  These days, that means I am concerned with the social impacts of computing, and how small groups, communities and culture are enacted through technology. Cell phones. Games. Facebook. Lasers attached to shark skulls. All of it.

Off to the left on this page, you see my current research projects. Much data. Very science. (See Doge Speak). I also have a startup company called Miner Technologies and am the Director of a new Masters in Data Science and Analytics at the University of Missouri. I am an Associate Professor at Mizzou.

You can learn more about me by checking me out on the various social networking sites:

  1. Github
  2. Google Scholar
  3. Twitter
  4. Instagram

My current “Curriculum Vitae” is Here.

Look around on this site. Drop me a note if you have questions or want to learn more. I am always recruiting Ph.D students, undergraduate programmers and industry focused collaborations.

If you are looking for more of a press release flavored bio, here ya go!:

Sean is an open source software researcher and a founding member of the Linux Foundations working group on community health analytics for open source software CHAOSS, co-lead of the CHAOSS metrics software working group and leader of the open source metrics tool AUGUR which can be forked and cloned and experimented wtih on GitHub. After a decade as a software engineer, Sean decided his calling was in research. His open source research is framed around a broader agenda of social computing research, which he pursues as an associate professor of computer science at the University of Missouri.

Sean is also the founder of their Data Science Masters program at Missouri, which he’s now passed on to better managers. Sean’s publications focus on understanding how social technologies influence organizational, small group and community dynamics, typically including analysis of electronic trace data from systems combined with the perspectives of people whose behavior is traced. Group Informatics is a methodology and ontology Sean has articulated with the aim of helping build consensus among researchers and developers for how to ethically and systematically make sense of electronic trace data. Structural fluidity, a construct Sean developed with his collaborators Peppo Valetto and Kelly Blincoe, aims to make sense of structural dynamics in virtual software organizations, and how those dynamics affect performance. Working with Josh Introne, Bryan Semaan and Ingrid Erickson, Sean is elaborating on mechanisms for identifying structural fluidity and organizational dynamics in electronic trace data using the lens of complex systems theory. His other work includes collaborations with Matt Germonprez on the Open Collaboration Data Exchange and Open Source Health metrics projects. He lives in Columbia, MO with his wife Kate, two step daughters and a dog named Huckleberry.