CS-GY 6313 - Fall 2025
2025-11-26
Based on materials by Enrico Bertini NYU Tandon School of Engineering
Objects, relationships, and visualization approaches
Network Data = Objects + Relationships + Values

Application Domains: Social networks, biological systems, transportation, web, infrastructure
1. Node-Link Diagrams
2. Adjacency Matrices


Physics simulation for automatic positioning
Physical Analogy:
Result: Clusters emerge, bridges visible, hubs toward center

Steps:
Complexity: O(n²) per iteration (expensive for large networks)


Node Encodings:
Edge Encodings:

Example: Size=age, Color=gender, Thickness=messages, Pattern=old/new friendship
Meaningful node positions instead of algorithmic
When to use fixed instead of force-directed:
Common patterns: Circular, linear, grid, spatial (geographic)
Circular Layout:
Edge Bundling:

Holten, D. (2006). Hierarchical edge bundles. IEEE TVCG.
Geographic Networks: Nodes at real locations (airports, cities, servers)
Problem: Edge clutter!
Solution: Edge bundling reveals corridors, hubs, regional patterns
Tools: qGIS, Gephi, D3.js


Table representation of networks
Encoding:
Advantages: ✓ All nodes visible, ✓ No crossings, ✓ Scalable to denser networks
Disadvantages: ✗ Less intuitive, ✗ Needs reordering, ✗ n² space, ✗ Hard to trace paths


“Hairball”: Dense networks as node-link diagrams = unreadable
Solution: Switch to matrix OR apply clutter reduction
Critical: Matrix ordering reveals patterns!


Directed Matrices:
Alternative: Parallel axes (bipartite-like view)


Five main techniques:

Dunne, C., & Shneiderman, B. (2013). Motif simplification. CHI.
Question: The main advantage of a fixed layout over force-directed is:
A. Less cluttered visualizations B. Node positions encode meaningful data C. Faster computation
Answer: B - Position can carry information (geography, time, category)
Specialized networks with no cycles
Tree: Network with hierarchical structure, no cycles
Properties:
Real-world: File systems, org charts, evolutionary trees, taxonomies, syntax trees

1. Node-Link

2. Space-Filling (Containment)

Top-Down 
File systems, org charts
Radial 
More space-efficient
Indented List 
Most compact
Issues: Scalability (1D growth), labeling, limited encoding channels
Dendrogram: Tree showing hierarchical clustering results
Algorithm (Agglomerative): 1. Start: Each point = own cluster 2. Find two closest clusters 3. Merge them (height = distance) 4. Repeat until one cluster
Properties: - Binary tree structure - Branch height = dissimilarity at merge - Cutting at height defines # of clusters
Used in: Gene expression, customer segmentation, document clustering


Decision Tree: Each node = decision point
Two contexts:
Why visualize: Interpretability, debugging, trust, bias detection


Space-filling approach for large hierarchies
Origin (Ben Shneiderman, 1990): “My hard disk is full - what’s using space?”
Encoding:
Key innovation: Shows BOTH hierarchy AND size


Shneiderman, B. (1992). Tree visualization with tree-maps. ACM TOG.
Problem: Slice-and-Dice creates thin rectangles (bad aspect ratios)
Solution: Squarified algorithm optimizes for square-like shapes
Trade-off: Squarified is more readable but less stable (layout changes with data updates)


Bruls, M. et al. (2000). Squarified treemaps. Springer.
File Systems 
Disk usage tools
Finance 
Stock market heat maps
Code Analysis 
Linux kernel by file type
Applications: Business dashboards, news (Newsmap), analytics, sports
Advantages:
Disadvantages:

Middle ground between node-link and treemaps:
Space efficiency: Treemap > Icicle > Sunburst Hierarchy perception: Icicle ≈ Sunburst > Treemap Familiarity: Treemap > Icicle > Sunburst


Networks (General Graphs):
Trees (Hierarchies):
Key principle: No single “best” method - depends on data, task, audience
Key dimensions to consider:
Decision process: Try multiple approaches → prototype → user test → iterate
Create variants of each technique:
Explore:
Tools: D3.js, Gephi, Cytoscape, NetworkX (Python), igraph (R)
Essential readings:
Advanced topics: Motif simplification (Dunne & Shneiderman), semantic substrates, time-varying networks, multilayer networks
Interactive examples: Observable (D3.js), Gephi tutorials, Graph visualization survey papers
Next class: Week 14 - Final Project Presentations
Questions?