July 2007

Jul
26

Feature of the Week: Automatic Detection of Scrapeable Content

Related posts:Feature of the Week: Content negotiation...Detection of Textured Areas in Images Using a Disorganization Indicator Based on Component Counts...PaperDiff: A Script Independent Automatic Method for Finding The Text Differences Between Two Document Images...Feature of the Week...Feature of the Week...Feature of the week...

Jul
24

Offline/Realtime Traffic Classification Using Semi-Supervised Learning

HPL-2007-121 Offline/Realtime Traffic Classification Using Semi-Supervised Learning - Erman, Jeffrey; Mahanti, Anirban; Arlitt, Martin; Cohen, Ira; Williamson, Carey
Keyword(s): traffic classification; semi-supervised learning; clustering
Abstract: Identifying and categorizing network traffic by application type is challenging because of the continued evolution of applications, especially of those with a desire to be undetectable. The diminished effectiveness of port-based identification and the overheads of deep packet inspection approaches m ...
Full Report Related posts:Quantifying Counts, Costs, and Trends Accurately via Machine Learning...BNS Scaling: An Improved Representation over TF.IDF for SVM Text Classification...Scaling Up Text Classification for Large File Systems...Thumbs up? Sentiment Classification using Machine Learning Techniques...Boosting Inductive Transfer for Text Classification using Wikipedia...Transfer Learning From Multiple Source Domains via Consensus Regularization...

Jul
24

YouTube Traffic Characterization: A View From the Edge

HPL-2007-119 YouTube Traffic Characterization: A View From the Edge - Gill, Phillipa, Arlitt, Martin; Li, Zongpeng; Mahanti, Anirban
Keyword(s): traffic characterization; YouTube; Web 2.0; caching
Abstract: This paper presents a traffic characterization study of the popular video sharing service, YouTube. Over a three month period we observed almost 25 million transactions between users on an edge network and YouTube, including more than 600,000 video downloads. We also monitored the globally popular v ...
Full Report Related posts:Offline/Realtime Traffic Classification Using Semi-Supervised Learning...EtherApe...Characterization of Noise in Digital Photographs for Image Processing...Tutorials...Getting to the National Press Club’s AFFIRM Event...Back to Melonville...

Jul
24

Capacity Management and Demand Prediction for Next Generation Data Centers

HPL-2007-116 Capacity Management and Demand Prediction for Next Generation Data Centers - Gmach, Daniel; Rolia, Jerry; Cherkasova, Ludmila; Kemper, Alfons
Keyword(s): capacity management; next generation data centers; performance models; measurements; workload analysis; automation; enterprise applications; shared resource pools
Abstract: Advances in server, network, and storage virtualization are enabling the creation of resource pools of servers that permit multiple application workloads to share each server in the pool. This paper proposes and evaluates aspects of a capacity management process for automating the efficient use of s ...
Full Report Related posts:Water Efficiency Management in Datacenters (Part I): Introducing a water usage metric based on available energy consumption...Capacity and Performance Overhead in Dynamic Resource Allocation to Virtual Containers...Analysis of Environmental Data in Data Centers...Fully Distributed Service Configuration Management...SmartFrog and Data Centre Automation...R-Capriccio: A Capacity Planning and Anomaly Detection Tool for Enterprise Services with Live Workloads...

Jul
24

“Merolyn the Phone”: A study of Bluetooth naming practices

HPL-2007-115 "Merolyn the Phone": A study of Bluetooth naming practices - Kindberg, Tim; Jones, Timothy
Keyword(s): bluetooth; electronic identity; naming; mobile phones
Abstract: This paper reports the results of an in-depth study of Bluetooth naming practices which took place in the UK in August 2006. There is a significant culture of giving Bluetooth names to mobile phones in the UK, and this paper's main contribution is to provide an account of those Bluetooth naming prac ...
Full Report Related posts:I, Me and My Phone: Identity and Personalization using Mobile Devices...“My iPod is my Pacifier”: An Investigation on the Everyday Practices of Mobile Video Consumption...Structure and tie strengths in Mobile Communication Networks...Naming Names...Workshop: Tinkering, Tailoring, & Mashing: The Social and Collaborative Practices of the Read-Write Web...SE3D User Study...

Jul
24

Mediascapes: Context-Aware Multimedia Experiences

HPL-2007-113 Mediascapes: Context-Aware Multimedia Experiences - Stenton, S. Philip; Wee, Susie; Hull, Richard; Goddi, Patrick M.; Reid, Josephine E.; Clayton, Ben J.C.; Melamed, Tom J.
Keyword(s): No keywords available.
Abstract: No abstract available. ...
Full Report Related posts:On Identity-Aware Devices: Putting Users in Control across Federated Services...Using GPS to Attach Real World Coordinates to Maps...A Search Engine Index for Multimedia Content...On Parametric Obligation Policies: Enabling Privacy-aware Information Lifecycle Management in Enterprises...TCO-aware provisioning of information security infrastructure...On Identity Analytics: Setting the Context...

Jul
24

Using GPS to Attach Real World Coordinates to Maps

HPL-2007-112 Using GPS to Attach Real World Coordinates to Maps - Melamed, Tom; Clayton, Ben
Keyword(s): GPS; locative media; context sensitive; mediascape; mscape; map; coordinate
Abstract: This paper discusses the requirements for map images such that they are suitable for the construction of location-based services and applications such as mediascapes. We then detail a specific class of maps that satisfy many of these requirements but may lack coordinate information. We show that fin ...
Full Report Related posts:A Real-Time Expectation Maximization Algorithm for Acquiring Multi-Planar Maps of Indoor Environments with Mobile Robots...New York Talk Exchange...CMM...Typographic Links...Paula Scher: Maps series...links for 2007-10-02...

Jul
24

Endless Documents: a Publication as a Continual Function

HPL-2007-111 Endless Documents: a Publication as a Continual Function - Lumley, John; Gimson, Roger; Rees, Owen
Keyword(s): XML; XSLT; SVG; document construction; functional programming
Abstract: Variable data documents can be considered as functions of their bindings to values. The Document Description Framework (DDF) treats documents in this manner, using XSLT semantics to describe document functionality and a variety of related mechanisms to support layout, reference and so forth. But the ...
Full Report Related posts:Endless Documents: a Publication as a Continual Function...Xebece...A Semantic Wiki for Continual Collaborative Information Management...links for 2008-02-26...Locality Sensitive Hash Function Based on Concomitant Rand Order Statistics...Visual Exploration of Topic Shifts...

Jul
24

Ingestion Pipeline for RDF

HPL-2007-110 Ingestion Pipeline for RDF - Bhatia, Nipun; Seaborne, Andy
Keyword(s): ingestion pipeline; validation of RDF; inferencing; large RDF datasets
Abstract: In this report we present the design and implementation of an ingestion pipeline for RDF Datasets. Our definition of ingestion subsumes: validation and inferencing. The design proposed performs these tasks without loading the data in-memory. There are several reasoners and Lint like validators avail ...
Full Report Related posts:Denoising Scheme for Realistic Digital Photos from Unknown Sources...Characterization of Noise in Digital Photographs for Image Processing...

Jul
24

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition

HPL-2007-109 A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition - Mandalapu, Dinesh; Murali Krishna, Sridhar
Keyword(s): shape contexts; features; handwriting recognition
Abstract: Feature extraction is a very important step in the process of character recognition. The features extracted from the character should encode the local, global and the structural characteristics of the character shape. In this paper we propose a new feature for recognition of online handwritten chara ...
Full Report Related posts:A Skew-tolerant Strategy and Confidence Measure for k-NN Classification of Online Handwritten Characters...Elastic Matching of Online Handwritten Tamil and Telugu Scripts Using Local Features...Feature of the Week: Character encoding of imported files...Online Handwriting Recognition for Indic Scripts...Hidden Markov Models for Online Handwritten Tamil Word Recognition...Machine Recognition of Online Handwritten Devanagari Characters...