by Petr Keil, R you cereal? blog, January 2, 2012
Data-driven scientists (data miners) such as Rosling believe that data can tell a story, that observation equals information, that the best way towards scientific progress is to collect data, visualize them and analyze them (data miners are not specific about what analyze means exactly). When you listen to Rosling carefully he sometimes makes data equivalent to statistics: a scientist collects statistics. He also claims that “if we can uncover the patterns in the data then we can understand.” I know this attitude: there are massive initiatives to mobilize data, integrate data, there are methods for data assimilation and data mining, and there is an enormous field of scientific data visualization. … And they are all excited about big data: the larger is the number of observations (N) the better. Rosling is right that data are important and that science uses statistics to deal with the data. But he completely ignores the second component of statistics: hypothesis (here equivalent to model or theory). …
To read this article as well as the interesting debate that followed in the comments, please visit Data-driven science is a failure of imagination | R you cereal?.
- Data Scientists Will Unlock Big Data’s Promise (blogs.wsj.com)
- What is Data Science? (architects.dzone.com)
- The Human Face of Big Data, a Book Review (sys-con.com)
Open Data, Open Knowledge, Open Solutions: Possibilities and Pitfalls
Thursday, September 22, 2011; 11:30 a.m. – 1 p.m.
Watch Live from the World Bank Annual Meetings in Washington, DC! As part of the World Bank’s 2011 Annual Meetings and Civil Society Forum, The World Bank will host a discussion with leading members of the civil society, open government, open development communities to discuss a new “Open Development Agenda,” in which individuals are empowered to create better solutions for development issues. The session will begin with an overview of Open Development, its implications for development partners, and how this move toward greater openness in data and knowledge is changing the entire development paradigm. It will include a lively moderated conversation on the opportunities presented by open data, open knowledge, and open solutions and how these relate to development challenges and aid effectiveness. Topics will include: What are the potential limitations of “open”? How can we draw on knowledge, learning, and innovation from a much wider pool of “solvers” and donor resources? Participants will also have an opportunity to see new mobile apps and the updated Mapping for Results portal. The session will close with an open dialogue, where participants will have an opportunity to present their ideas and feedback on the changing roles of the private sector, civil society organizations, and governments in making development more effective.
- The Open Knowledge Foundation Comes of Age (mt-soft.com.ar)
- NYT: World Bank Is Opening Its Treasure Chest of Data (geodatapolicy.wordpress.com)
The Future of Scientific Knowledge Discovery in Open Networked Environments: A National Symposium and Workshop
The Future of Scientific Knowledge Discovery in Open Networked Environments:
A National Symposium and Workshop
Washington, DC, March 10-11, 2011
The Symposium is free and open to the public, but advance registration is requested. Please RSVP to Cheryl Levey email@example.com.
Board on Research Data and Information
in collaboration with
Computer Science and Telecommunications Board
National Academy of Sciences
Digital technologies and networks have enhanced access to and use of scientific data, information, and literature significantly, and also have great promise for accelerating the discovery and the communication of knowledge both within the scientific community and in the broader society. This is particularly the case for scientific data and information that are openly available online. Scientific knowledge discovery in open networked environments, referred to in this proposal as computer-mediated or computational scientific knowledge discovery, may be defined as a research process that is enabled by different digital computing technologies such as data mining, information retrieval and extraction, artificial intelligence, distributed grid computing, and many other automated methods. Together, these technological capabilities are supporting the emergence of computer-mediated knowledge discovery as a new paradigm in the conduct of research.
A symposium and workshop will be convened in Washington, DC to bring together key stakeholders in this area for intensive and structured discussions in order to obtain a better understanding of the state-of-the-art and future trends in the study of computational scientific knowledge discovery in the open online environment and to develop a range of options for future work in this area. Specifically, the project will be performed pursuant to the following statement of task: