从
www2007上的论文,我们可以大致看出来,近两年的热点(学术和商业)在数据挖掘、搜索以及与语义网方面。关于语义网,我个人更多的理解是现在半结构化甚至无结构化的www在某种程度上阻碍着很多东西的实现,而语义网则是在寻找一种突破。
- Track: Browsers and User Interfaces
- Session: Personalization
- Homepage Live: Automatic Block Tracing for Web Personalization
- Open User Profiles for Adaptive News Systems: Help or Harm?
- Investigating Behavioral Variability in Web Search
- Session: Smarter Browsing
- CSurf: A Context-Driven Non-Visual Web-Browser
- GeoTracker: Geospatial and Temporal RSS Navigation
- Learning Information Intent via Observation
- Session: Personalization
- Track: Data Mining
- Session: Mining in Social Networks
- Wherefore Art Thou R3579X? Anonymized Social Networks Hidden Patterns and Structural Steganography
- Information Flow Modeling based on Diffusion Rate for Prediction and Ranking
- NetProbe: A Fast and Scalable System for Fraud Detection in Online Auction Networks
- Session: Mining Textual Data
- Summarizing Email Conversations with Clue Words
- Organizing and Searching the World Wide Web of Facts - Step Two: Harnessing the Wisdom of the Crowds
- Do Not Crawl in the DUST: Different URLs with Similar Text
- Session: Predictive Modeling of Web Users
- Demographic Prediction based on User’s Browsing Behavior
- Why We Search? Visualizing and Predicting User Behavior
- Topic Sentiment Mixture: Modeling Facets and Opinions in Weblogs
- Session: Identifying Structure in Web Pages
- Page-level Template Detection via Isotonic Smoothing
- Towards Domain Independent Information Extraction from Web Tables
- Web Object Retrieval
- Session: Similarity Search
- New Suffix Tree Similarity Measure for Document Clustering
- Scaling Up All-Pairs Similarity Search
- Detecting Near-Duplicates for Web Crawling
- Session: Mining in Social Networks
- Track: E* Applications
- Session: E-Communities
- The Complex Dynamics of Collaborative Tagging
- Expertise Networks in Online Communities: Structure and Algorithms
- Internet-Scale Collection of Human Reviewed Data
- Session: E-Commerce and E-Content
- DETECTIVES: DETEcting Coalition hiT Inflation attacks in adVertising Networks Streams
- Extraction and Search of Chemical Formulae in Text Documents on the Web
- A Content-Driven Reputation System for the Wikipedia
- Session: E-Communities
- Track: Industrial Practice & Experience
- Session: IPE
- Google News Personalization: Scalable Online Collaborative Filtering
- Exploring in the Weblog Space by Detecting Informative and Affective Articles
- Spam Double-Funnel: Connecting Web Spammers with Advertisers
- Session: IPE
- Track: Performance and Scalability
- Session: Scalable Systems for Dynamic Content
- GlobeTP: Template-Based Database Replication for Scalable Web Applications
- Consistency-preserving Caching of Dynamic Database Content
- Optimized Query Planning of Continuous Aggregation Queries in Dynamic Data
- Dissemination Networks
- Session: Performance Engineering of Web Applications
- A Scalable Application Placement Controller for Enterprise Data Centers
- A Unified Platform for Data Driven Web Applictions with Automatic Client-Server Partitioning
- Is High-Quality VoD Feasible using P2P Swarming?
- Session: Scalable Systems for Dynamic Content
- Track: Pervasive Web and Mobility
- Session: Pervasive Web and Mobility
- Robust Web Page Segmentation for Mobile Terminal Using Content-Distances and Page Layout Information
- PRIVE: Anonymous Location-Based Queries in Distributed Mobile Systems
- A Mobile Application Framework for the Geospatial Web
- Session: Pervasive Web and Mobility
- Track: Search
- Session: Crawlers
- The Discoverability of the Web
- Combining Classifiers to Identify Online Databases
- An Adaptive Crawler for Locating Hidden-Web Entry Points
- Session: Web Graph
- Random Web Crawls
- Extraction and classification of dense communities in the Web
- Web Projections: Learning from Contextual Subgraphs of the Web
- Session: Search Quality and Precision
- Supervised Rank Aggregation
- Navigating the intranet with high precision
- Optimizing Web Search Using Social Annotation
- Session: Knowledge Discovery
- Compare&Contrast: Using the Web to Discover Comparable Cases for News Stories
- Answering Bounded Continuous Search Queries in the World Wide Web
- Answering Relationship Queries on the Web
- Session: Advertisements and Click Estimates
- Robust Methodologies for Modeling Web Click Distributions
- Predicting Clicks: Estimating the Click-Through Rate for New Ads
- Dynamics of bid optimization in online advertisement auctions
- Session: Search Potpourri
- Navigation-Aided Retrieval
- Efficient Search Engine Measurements
- Efficient Search in Large Textual Collections with Redundancy
- Session: Personalization
- Dynamic Personalized Pagerank in Entity-Relation Graphs
- A Large-scale Evaluation and Analysis of Personalized Search Strategies
- Privacy-Enhancing Personalized Web Search
- Session: Crawlers
- Track: Security, Privacy, Reliability and Ethics
- Session: Passwords and Phishing
- CANTINA: A Content-Based Approach to Detecting Phishing Web Sites
- Learning to Detect Phishing Emails
- A Large-Scale Study of Web Password Habits
- Session: Defending Against Emerging (and Emerged) Threats
- Defeating Script Injection Attacks with Browser-Enforced Embedded Policies
- Subspace: Secure Cross-Domain Communication for Web Mashups
- Exposing Private Information by Timing Web Applications
- On Anonymizing Query Logs via Token-based Hashing
- Session: Access Control and Trust on the Web
- A Fault Model and Mutation Testing of Access Control Policies
- Analyzing Web Access Control Policies
- Compiling Cryptographic Protocols for Deployment on the Web
- Session: Passwords and Phishing
- Track: Semantic Web
- Session: Ontologies
- Yago: A Core of Semantic Knowledge - Unifying WordNet and Wikipedia
- Ontology Summarization Based on RDF Sentence Graph
- Just the Right Amount: Extracting Modules from Ontologies
- Session: Similarity and Extraction
- Measuring Semantic Similarity between Words Using Web Search Engines
- Using Google Distance to weight approximate ontology matches
- Hierarchical Perceptron-like Learning for Ontology-Based Information Extraction
- Session: Query Languages and DBs
- From SPARQL to Rules (and back)
- SPARQ2L: Towards Support For Subgraph Extraction Queries in RDF Databases
- Bridging the Gap Between OWL and Relational Databases
- ActiveRDF: Object-Oriented Semantic Web Programming
- Session: Applications
- Towards Expressive Syndication on the Web
- Exhibit: Light-weight Structured Data Publishing
- Explorations in the use of Semantic Web Technologies for Product Information Management
- Session: Semantic Web and Web 2.0
- The two cultures — position paper
- Analysis of Topological Characteristics of Huge Online Social Networking Services
- P-TAG: Large Scale Automatic Generation of Personalized Annotation TAGs for the Web
- Session: Ontologies
- Track: Technology for Developing Regions
- Session: Communication in Developing Regions
- Connecting the bottom of the pyramid: an exploratory case study of India’s rural communication environment
- Communication as Information-Seeking: The Case for Mobile Social Software for Developing Regions
- Optimal Audio-Visual Representations for Illiterate Users of Computers
- Session: Networking Issues in the Web
- Identifying and Discriminating Between Web and Peer-to-Peer Traffic in the Network Core
- Long Distance Wireless Mesh Network Planning: Problem Formulation and Solution
- MyXDNS: A Request Routing DNS Server With Decoupled Server Selection
- Session: Communication in Developing Regions
- Track: Web Engineering
- Session: Web Modeling
- Turning portlets into services: introducing the organization profile
- A Framework for Rapid Integration of Presentation Components
- Integrating Value-based Requirement Engineering Models to WebML using VIP
- Business Modeling Framework
- Session: End-Users Perspective and Measurement in Web Engineering
- Towards Effective Browsing of Large Scale Social Annotations
- Supporting End-Users in the Creation of Dependable Web Clips
- Effort Estimation: How Valuable is it for a Web company to Use a Cross-company
- Data Set Compared to Using Its Own Single-company Data Set?
- Session: Web Modeling
- Track: Web Services
- Session: Orchestration & Choreography
- Towards the Theoretical Foundation of Choreography
- Introduction and Evaluation of Martlet: A Scientific Workflow Language for
- Abstracted Parallelisation
- Semi-Automated Adaptation of Service Interactions
- Session: SLAs and QoS
- Reliable QoS Monitoring Based on Client Feedback
- Preference-based Selection of Highly Configurable Web Services
- Speeding up Adaptation of Web Service Compositions Using Expiration Times
- DIANE - An Integrated Approach to Automated Service Discovery Matchmaking and Composition
- Session: Orchestration & Choreography
- Track: XML and Web Data
- Session: Querying and Transforming XML
- Multiway SLCA-based Keyword Search in XML Data
- Visibly Pushdown Automata for Streaming XML
- Mapping-Driven XML Transformation
- Session: Parsing, Normalizing, and Storing XML
- Querying and Maintaining a Compact XML Storage
- XML Design for Relational Storage
- A High-Performance Interpretive Approach to Schema-Directed Parsing
- Session: Querying and Transforming XML