Mac OS X Lion clean install

Did a clean install (i.e. without installing Snow Leopard first) of Mac OS X Lion on a friend's MacBook Pro tonight.  Kind of disappointed that the installer didn't recognize the blank disk and ask to partition it.  Yes the partitioning utility is there and it was easy, but an unnecessary step was needed before the install could begin.  I guess Apple is banking on their users not actually doing an install on a blank disk.

Posted
 

Big Data Requires a Big, New Architecture - CIO Central - CIO Network - Forbes

The problem is that, in the world of big data, we don’t really know what value the data has when  it’s initially accepted from the array of sources available to us.

For instance unstructured data by itself many times takes on a whole new level of value to an organization when it is analyzed in context with transaction-based data.

Posted
 

RecordBreaker: Automatic structure for your text-formatted data | Apache Hadoop for the Enterprise | Cloudera

RecordBreaker is a project that automatically turns your text-formatted data (logs, sensor readings, etc) into structured data, without any need to write parsers or extractors. In particular, RecordBreaker targets Avro as its output format. The project’s goal is to dramatically reduce the time spent preparing data for analysis, enabling more time for the analysis itself.

Hadoop’s HDFS is often used to store large amounts of text-formatted data: log files, sensor readings, transaction histories, etc. Much of this data is “near-structured”: the data has a format that’s obvious to a human observer, but is not made explicit in the file itself.

This looks like is would have potential in identifying and cataloging data files inside Hadoop.

Posted
 

Big Data – Same Problems? by Chris Bradley

Media_httpwwwbeyenetw_ijszn

Chart of Data Quality Maturity Levels for Big Data Analytics by Chris Bradley.

Posted