P2P Search COP5711. 2 P2P Search Techniques Centralized P2P systems e.g. Napster, Decentralized ...

P2P SearchCOP5711

P2P Search Techniques Centralized P2P systems

e.g. Napster, SETI@home

Decentralized & unstructured P2P systems e.g. Gnutella

Hybrid - partially decentralized e.g., Freenet

Structured P2P systems DHT CAN

P2P Network P2P network is an overlay

network built on top of a real physical network (e.g., Internet)

In a P2P network, peers are network nodes connected by virtual or logical links

A logical link is a path through many physical links in the underlying network

Napster server(Central Catalog)

(xyz.mp3, 192.1.2.3)

192.1.2.3

Napster: Publish a File

Users upload their IP address and music titles they wish to share

Users search for peers to download desired files

xyz.mp3 ?

192.1.2.3192.1.2.3

Napster: Query for a File

Central Napster server

File transfer is P2P, using a proprietary protocol

192.1.2.3

xyz.mp3 ?

Napster: Transfer Requested File

Central Napster server

Disadvantage of Centralized Directory

Performance bottleneck

Single point of failure

Can we do it without a directory ?

Decentralized P2P - Gnutella No catalog

Pings network to locate Gnutella peers

File requests are broadcast to peers

Flooding or breadth-first research

When provider is located, the file is transferred via HTTP

Who are my neighbors ?

Gnutella: Join the Network

Peers areInternetedges

Special peer maintained by Gnutella

Pings network

to locate peers

xyz.mp3 ?

Gnutella: Broadcast Request to Peers

Gnutella: Flood the Request (Breadth-first research)

I have it.

xyz.mp3

Gnutella: Reply with the File(via HTTP)

I have it.

Gnutella - Disadvantages Network flooding - unnecessary

network traffic

Using TTL - some files might not be found

Alternatively, using ultranodes (or supernodes)using depth-first search, i.e., Freenet

Morpheus, KazaaFlooding only the Supernodes

Cluster

Center Index for its cluster

Query: “W

ho has

file X”

Reply: “Peer H

file X”

Download file X from Peer H

SupernodeLayer

Using Ultranodes Queries flood only the network of

ultranodes

Other peer nodes shielded from query traffic

Combine the benefits of centralized and decentralized search;

Take advantage of the heterogeneity in peer capabilities;

Freenet - Depth-First Search

Query: “Who has file X”

Peer D might have file X

Peer E might have file X

Reply: “I have file X”

Reply : “Peer E has file X”

Reply : “Peer E

has file X”

Download file X from Peer E

Peer C might

have file X

Freenet – File not Found

Peer D might have file X

Peer E might have file X

Peer C might

have file X F

NOT FOUND !

The requested file not found due to a poor routing decision made at peer D

In this case, query backs out of the dead-end, and tries another peer in depth-first manner

I havefile X

Using Distributed Directory Data objects are everywhere

Distribute subsets of the data directory among peers

If we can find the relevant sub-directory, we can locate the data object

DirectoryData

ObjectsSub-directory

How to Bound Search Space ?Basic Idea - Hashing

Hash key

Object “y”

Objects have hash keys

Peer “x”Peer nodes also have hash keys in the same hash space

P2P Network

y xH(y) H(x)

Join (H(x))Publish (H(y))

Place location information about an object at the peer with closest hash keys (i.e., a distributed directory)

Viewed as a Distributed Hash Table

Hash table0 2128-1

Peer nodes• Each peer node is responsible for a range of

the hash table, according to the peer hash key

• Location information about Objects are placed in the peer with the closest key (information redundancy)

How to Find an Object ?Looks for a peer /w the corresponding peer hash key

A peer knows its logical neighbors Find peer X based on multihop routing X knows who has the object

Hashtable

0 2128-1

Peernode X

Peer Y has the file

Dynamic Hash Table (DHT) in action

DHT in action

DHT in action: put()

insert(K1,V1)

Operation: Route message, “I have the file,” to node holding key K1

Want to share a

(K1,V1)

K VK V

DHT in action: put()

Operation: take key as input; route messages to node holding key

retrieve (K1)

K VK V

DHT in action: get()

Operation: Retrieve message V1 at node holding key K1

DHT in action

Retrieve file according to V1

Still Flooding

Still flood the network although intermediate nodes do not need to search

Can we avoid flooding ?

CAN – Content Addressable Network Each peer is

responsible for one zone, i.e., stores all (key, value) pairs of the zone

Each peer knows the neighbors of its zone

Random assignment of peers to zones at startup – split zone if not empty

Dimensional-ordered multihop routing

CAN: Object Publishing

node I::publish(K,V) I

(1) a = hx(K)

CAN: Object Publishingx = a

(1) a = hx(K) b = hy(K)

CAN: Object Publishingx = a

(1) a = hx(K) b = hy(K)

(2) route (K,V) -> J

(3) J stores (K,V)

(1) a = hx(K) b = hy(K)

(2) route “retrieve(K)” to J that is in charge of (a,b)

(K,V)(1) a = hx(K) b = hy(K)

node I::retrieve(K)

CAN: Object Retrieval

Maintenance

Inform neighbors that you are alive at discrete time interval t

If your neighbor does not send alive message in time t, takeover its zone

P2P Benefits Efficient use of resources

Use unused bandwidth, storage, and processing power at the edge of the network

Scalability Consumers of resources also donate resources

Reliability Replicas, geographic distribution No single point of

failure Ease of administration

Self organized nodes Built-in reliability and load balancing

Some Prototypes at UCF iSEE (Internet-scale Sensor Exploration Environement)Publishing real-time sensor data

Browsing and querying real-time sensor data

P2P Video Streaming for VoD and Live Broadcast Applications

P2P Search COP5711. 2 P2P Search Techniques Centralized P2P systems e.g. Napster, Decentralized ...

Documents

P2P Incentives

CMSC 332 Computer Networks P2P and Socketsdszajda/classes/cs332/...CMSC 332: Computer Networks P2P: Searching for Information File sharing (e.g., e-mule) • Index dynamically tracks

Peer to Peer Technologies. Outline What is P2P? P2P architectures Examples of P2P system (P2P applications) P2P data management techniques Conclusions

P2P Education - ndsu.eduxuchu/P2P...P2P-Education is an interactive teaching-learning system. Both web- and Windows-based versions of P2P-Education have been developed. The software

P2P Systems - cs.jhu.edubaruch/teaching/600.447/class-slides/P2P/P2P... · P2P Systems Keith W. Ross ... . 40 Gnutella overlay management UNew node uses bootstrap node to get IP

THUP: A P2P Network Robust to Churn and DoS Attack based ...simulations show that THUP substantially improves the stability ... P2P networks (e.g., Napster), the original Gnutella

CMSC 332 Computer Networks P2P and Socketsdszajda/classes/... · CMSC 332: Computer Networks P2P: Searching for Information File sharing (e.g., e-mule) • Index dynamically tracks

Black-box analysis of Internet P2P applications · the SopCast P2P-TV application, we explore a wider range of channels featuring different content (e.g., from football matches to

1 P2P Computing. 2 What is P2P? Server-Client model

ISP-Aided Neighbor Selection for P2P Systems · 3 P2P from an ISPs view Good: P2P applications fill a void P2P applications are easy to develop and deploy P2P applications spur broadband

SERIR P2P - deasecurity.com€¦ · SERIR P2P P2P 1.1.2 THE CONTROL UNIT Nerve centre of SERIR P2P system, the Control Unit is composed of a polyester cabinet (part number BOX-P2P)

On peer-to-peer (P2P) content delivery - microsoft.comhybrid P2P network has a central entity (server) which renders certain central functionality of the service, e.g., keeps track

Multidatabase Transaction Management COP5711. Multidatabase Transaction Management Outline Review - Transaction Processing Multidatabase Transaction Management

P2P Transformation Summitapn.today/attachments/article/1385/P2P Transformation...P2P Transformation Summit 2015 APN Annual P2P Summit The Waldorf Hotel, London 9th June 2015 Drive

P2P Systems - Polyross/tutorials/P2P... · 2004. 5. 7. · 13 P2P file sharing UAlice runs P2P client application on her notebook computer UIntermittently connects to Internet; gets

P2P Computing MIRA YUN September 16, 2005. Outline What is P2P P2P taxonomies Characteristics Different P2P systems Conclusion

Cs423-cotter1 P2P Discovering P2P (Miller) Internet

Unstructured vs. Structured P2P systems Peer-to-Peer Systemsheim.ifi.uio.no/michawe/teaching/p2p-ws08/p2p-2-6.pdf · Current P2P Content Distribution Systems • Most current P2P

Advanced Partitioning Techniques for Massively Distributed ...kienhua/classes/COP5711/Papers/CloudDB1.pdf · Advanced Partitioning Techniques for Massively Distributed Computation

PARALLEL DATABASE TECHNOLOGY - University of ...kienhua/classes/COP5711/ParallelDB.pdfPARALLEL DATABASE TECHNOLOGY Kien A. Hua School of Computer Science University of Central Florida