I'm wondering if there's some code or library for getting all urls under a domain. I need to find all urls for a domain.
For example, if my domain is https://stackoverflow.com/ and I'd like to find all question url's like this:
- [Java lib or app to convert CSV to XML file?
- [https://stackoverflow.com/questions/456/what-can-i]
- [https://stackoverflow.com/questions/789/where-can-i]
I don't know about how many questions are under the domain, but I have to create an engine for searching all the urls and then after finding the urls I need to insert the content into my database.
I will create a small search engine for my 5 web pages.
Can anyone help please?
Thanks,