The proposed Ever Learning Censorware addresses limitations of existing systems by continuously learning and adapting to new threats. It overcomes overblocking and underblocking issues by using a Bayesian classifier to evaluate the harm level of websites based on content keywords and their associated probabilities. Key components include:
The system uses Bayes theorem to calculate the probability of a website being harmful based on the presence of certain keywords. It also stores the latest content of critical components in stable storage to ensure data safety. The censorware is implemented in Java and uses libraries for network analysis and machine learning, ensuring compatibility across different operating systems.
Ljiljana Trajkovic, John Jose, J. Jayakumari, Maurizio Palesi