Advertisement
Data Mining
Subscribe to Data Mining

The Lead

Robo Brain — a large-scale computational system that learns from publicly available Internet resources — is currently downloading and processing about 1 billion images, 120,000 YouTube videos, and 100 million how-to documents and appliance manuals. The in

Robo Brain Teaches Robots Everything from the Internet

August 28, 2014 11:52 am | by Cornell University | News | Comments

Robo Brain — a large-scale computational system that learns from publicly available Internet resources — is currently downloading and processing about 1 billion images, 120,000 YouTube videos, and 100 million how-to documents and appliance manuals. The information is being translated and stored in a robot-friendly format that robots will be able to draw on when they need it.

Citizen Science: Images of Earth at Night Crowdsourced for Science

August 19, 2014 2:59 pm | by NASA | News | Comments

A wealth of images of Earth at night taken by astronauts on the International Space Station (ISS...

HPC Innovation Excellence Award: University of Wisconsin-Madison

June 23, 2014 4:33 pm | Award Winners

University of Wisconsin Researchers utilized HPC resources in combination with multiple advanced...

Data Mining Software Version Histories

June 23, 2014 6:30 am | by AlphaGalileo | News | Comments

Making changes within a complex software system is often error-prone – even the smallest mistake...

View Sample

FREE Email Newsletter

Projects that allow the general public to collaborate with scientists are becoming useful sources of knowledge on a large scale. Online databases such as iNaturalist.org and DiscoverLife.org — based at UGA — rely on amateur observers to contribute photo

New Data Collection, Analysis and Sharing Tools Help Protect Threatened Species

May 29, 2014 9:41 pm | by Science Newsline | News | Comments

Athens, Ga. – New tools to collect and share information could help stem the loss of the world's threatened species, according to a paper published today in the journal Science. The study—by an international team of scientists that included John L. Gittleman, dean of the University of Georgia Odum...

Huynh Phung Huynh, Scientist and Capability Group Manager, A*STAR Institute of High Performance Computing

Huynh Phung Huynh

April 16, 2014 8:45 am | Biographies

Huynh Phung Huynh's research interests include high performance computing (HPC): compiler optimization for GPU, many cores and other accelerators; Parallel computing: framework for parallel programming or scheduling; and HPC for data mining and machine learning algorithms.

Data Mining Disaster

March 28, 2014 4:33 pm | News | Comments

Computer technology that can mine data from social media during times of natural or other disaster could provide invaluable insights for rescue workers and decision makers. Advances in information technology have had a profound impact on disaster management.

Advertisement

Mathematics for Safer Medicine: Calculating Uncertainties within Technical Systems

January 7, 2014 6:20 am | by Heidelberg Institute for Theoretical Studies | News | Comments

The new HITS research group “Data Mining and Uncertainty Quantification” analyzes large amounts of data and calculates uncertainties in technical systems. With Prof. Vincent Heuveline as their group leader, the group of mathematicians and computer scientists especially focuses on increasing the security of technology in operating rooms.

Text Mining: The Next Data Frontier

January 6, 2014 2:04 pm | by Mark A. Anawis | Blogs | Comments

Josiah Stamp said: “The individual source of the statistics may easily be the weakest link.” Nowhere is this more true than in the new field of text mining, given the wide variety of textual information. By some estimates, 80 percent of the information available occurs as free-form text which, prior to the development of text mining, needed to be read in its entirety in order for information to be obtained from it.

'Approximate Computing' Improves Efficiency, Saves Energy

December 18, 2013 4:03 pm | by Emil Venere, Purdue University | News | Comments

Researchers are developing computers capable of "approximate computing" to perform calculations good enough for certain tasks that don't require perfect accuracy, potentially doubling efficiency and reducing energy consumption.       

Meet HPC Innovator Taghrid Samak

December 3, 2013 4:03 pm | by Jon Bashor, Berkeley Lab Computational Research Division | Articles | Comments

Everything leading up to the actual coding, figuring out how to make it work, is what Samak enjoys most. One of the problems she is working on with the Department of Energy’s Joint Genome Institute (JGI) is a data mining method to automatically identify errors in genome assembly, replacing the current approach of manually inspecting the assembly.

Harnessing Collective Wisdom from Social Networks

November 7, 2013 12:49 pm | by National Science Foundation | News | Comments

In his 1937 book, "Think and Grow Rich," author Napoleon Hill identified 13 steps to success, one of which was the power of the mastermind. "No two minds ever come together without thereby creating a third, invisible, intangible force, which may be likened to a third mind," Hill wrote.

Advertisement

Hardware for Big Data, Graphs and Large-scale Computation

September 9, 2013 9:58 am | by Rob Farber | Articles | Comments

Recent announcements by Intel and NVIDIA indicate that massively parallel computing with GPUs and Intel Xeon Phi will no longer require passing data via the PCIe bus. The bad news is that these standalone devices are still in the design phase and are not yet available for purchase.

IBM Narrows Big Data Skills Gap, Partnering with More than 1,000 Global Universities

August 15, 2013 10:49 am | by IBM | News | Comments

IBM announced on August 24, 2013, that it has added nine new academic collaborations to its more than 1,000 partnerships with universities across the globe, focusing on Big Data and analytics - all of which are designed to prepare students for the 4.4 million jobs that will be created worldwide to support Big Data by 2015. The company also announced more than $100,000 in awards for Big Data curricula.

HPC Architectures Begin Long-Term Shift Away from Compute Centrism

August 15, 2013 8:43 am | by Steve Conway, IDC | Articles | Comments

The HPC market is entering a kind of perfect storm. For years, HPC architectures have tilted farther and farther away from optimal balance between processor speed, memory access and I/O speed. As successive generations of HPC systems have upped peak processor performance without corresponding advances in per-core memory capacity and speed, the systems have become increasingly compute centric

StatSoft Receives Top Ratings in KDnuggets Poll

June 11, 2013 2:36 pm | by StatSoft | News | Comments

The 14th annual KDnuggets Software Poll, conducted in May 2013, attracted record participation of 1,880 internet voters, more than doubling the previous year's numbers. KDnuggets.com is a data mining portal and newsletter publisher for the data mining community with more than 12,000 subscribers.

New Algorithm Cluster Improves Health Record Data Mining

May 14, 2013 9:18 pm | by New Jersey Institute of Technology | News | Comments

The time may be fast approaching for researchers to take better advantage of the vast amount of valuable patient information available from U.S. electronic health records. Lian Duan, an NJIT computer scientist with an expertise in data mining, has done just that with the recent publication of "Adverse Drug Effect Detection," IEEE Journal of Biomedical and Health Informatics (March, 2013).

Advertisement

Pathway Studio for Web

April 5, 2013 10:38 am | Elsevier, Inc. | Product Releases | Comments

Pathway Studio, a research solution for biologists, is now available in a Web-based version. The integrated data mining and visualization software features comprehensive knowledge bases produced by applying MedScan, Elsevier’s proprietary text-mining technology, to a large corpus of biological literature.

NSF funded Superhero Supercomputer Helps Battle Autism

March 26, 2013 7:45 pm | News | Comments

When it officially came online at the San Diego Supercomputer Center (SDSC) in early January 2012, Gordon was instantly impressive. In one demonstration, it sustained more than 35 million input/output operations per second--then, a world record.

i3D Enterprise Service

March 22, 2013 3:06 pm | Shimadzu Scientific Instruments | Product Releases | Comments

i3D Enterprise Service integrates storage, processing and data mining in an enterprise-level private cloud. Laboratory data can be automatically and securely uploaded from instruments to a private cloud and processed on the cloud, enabling workflow execution and data mining in a fraction of the time.

SampleManager 11

March 22, 2013 2:51 pm | Thermo Fisher Scientific | Product Releases | Comments

SampleManager 11 laboratory information management system (LIMS) features advanced tools that are designed to improve laboratory process mapping, management and automation. Users can build workflows to reflect their individual laboratory processes and take ownership of workflow management.

Big Data, Big Science, Big Collaboration: Delivering Connected R&D for Better Value

March 15, 2013 3:26 pm | by Yike Guo, Imperial College | Articles | Comments

Today, we are more connected than ever. We live in an always-on world whose digital economy has made data a new form of resource that fundamentally changes our lives. But has this revolution really occurred across R&D domains? At a time when global R&D investment is over $1.5 trillion, leading voices still bemoan a lack of open access to decision-making data and an innovation deficit syndrome.

Breakthrough Prize in Life Sciences Announced

February 21, 2013 5:38 am | News | Comments

Art Levinson, Sergey Brin, Anne Wojcicki, Mark Zuckerberg, Priscilla Chan and Yuri Milner announced the launch of the Breakthrough Prize in Life Sciences, recognizing excellence in research aimed at curing intractable diseases and extending human life

SDSC Invites Researches to Apply for Access to Gordon Supercomputer

September 28, 2012 11:13 am | News | Comments

The San Diego Supercomputer Center (SDSC) at the University of California, San Diego, is seeking innovative applications for the next round of user allocations on its data-intensive Gordon supercomputer, which went into operation earlier this year

Mining Big Data for Faster Carbon Footprint Assessments

September 20, 2012 11:09 am | by Columbia Engineering | News | Comments

Researchers at Columbia Engineering have developed new software that can simultaneously calculate the carbon footprints of thousands of products faster than ever before. “Our novel approach generates standard-compliant product carbon footprints for companies with large portfolios at a fraction of previously required time and expertise,”

Introducing a Paperless Lab

August 8, 2012 8:09 am | by Ulf Fuchslueger and Andreas Schild | Articles | Comments

Laboratories working in the pharmaceutical industry in the areas of R&D and quality control find themselves increasingly having to cope with conflicting demands — tougher regulatory requirements and harsher economic realities. In order to meet these demands, new ways of dealing with process, data and system management are necessary.

SC12 Registration Now Open

July 25, 2012 11:37 am | News | Comments

SC12 will streamline conference information and move to a virtually real-time method of determining technical program thrusts

View the World through the Eyes of Wikipedia

June 28, 2012 5:47 pm | News | Comments

SGI, a leader in technical computing, has partnered with Kalev H. Leetaru of the University of Illinois to create the first-ever historical mapping and exploration of the full text contents of the English-language edition of Wikipedia, in time and space.

Exemplar Biomarker Discovery LIMS

June 5, 2012 12:10 pm | Sapio Sciences Llc | Product Releases | Comments

Exemplar Biomarker Discovery LIMS for personalized medicine is designed to address pharmaceutical and biotech companies’ needs for a single, integrated solution that addresses everything from sample management through study data management, assay data management, data mining and statistical analysis

X
You may login with either your assigned username or your e-mail address.
The password field is case sensitive.
Loading