Recovering from Internet Routing Failures using Declarative Policies and Argumentation Protocols
Tuesday, April 13, 2010, 10:15am - Tuesday, April 13, 2010, 11:15am
Many Internet failures are caused by misconﬁgurations of the BGP routers that manage routing of trafﬁc between domains. The problems are usually due to a combination of human errors and the lack of a high-level language for specifying routing policies that can be used to generate router conﬁgurations. We describe an implemented approach that uses a declarative language for specifying network-wide routing policies to automatically conﬁgure routers and show how it can also be used by software agents to diagnose and correct some networking problems. The language is grounded in an ontology deﬁned in OWL and polices expressed in it are automatically compiled into low-level router conﬁgurations. A distributed collection of software agents use the high-level policies and a custom argumentation protocol to share and reason over information about routing failures, diagnose probable causes, and correct them by reconﬁguring routers and/or recommending actions to human operators. We have evaluated the framework in both a simulator and on a small physical network. Our results show that the framework performs well in identifying failure causes and automatically correcting them by reconﬁguring routers when permitted by the policies.