The "Upload picture from your computer:" and "Upload file from your computer:" functionality.
Such files (images, their preview thumbnails, files) will still point to Rizzoma servers.
You need to parse their URLs from the exported HTML and download them yourselves.
All such URLs will start from: https://rizzoma.com/r/files/
Only <a> and <img> tags in the exported HTML file can contain such links.
Examples:
<a class="export-file" href="https://rizzoma.com/r/files/214946d2342dba39409d6cf70e9d29a8-1173ca60ad720e6a5e1c4b2cbad21a7e-0-0.8253310453670577"><img src="https://rizzoma.com/r/files/thumbnail/214946d2342dba39409d6cf70e9d29a8-1173ca60ad720e6a5e1c4b2cbad21a7e-0-0.8253310453670577" title="World_Time_Zones_Map.png"></a>
<a class="export-file" href="https://rizzoma.com/r/files/214946d2342dba39409d6cf70e9d29a8-1173ca60ad720e6a5e1c4b2cbad21a7e-3-0.24300116621941625">hello123.txt</a>
Example scripts/programs:
- Store HTML and attachment files with the Python script
Here is a simple Python script to perform the export of topics and attachments using the requests library:
- Install requests and BeautifulSoup (if not already installed): pip install requests beautifulsoup4
- download the file "export_topics_with_attachments.py",
- open it with your text editor, write the list of TOPIC_IDS into it. Also you may modify the code to read the list from some another file.
- make sure to replace the COOKIE_SID value with the actual value you have.
- run the python file python3 export_topics_with_attachments.py
Code of the file: