Skip to content

hbz/to.science.scripts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

a2ed30e · Mar 25, 2025
Nov 20, 2024
Feb 6, 2025
Jun 25, 2024
Jan 23, 2024
Jan 23, 2024
Mar 25, 2025
Mar 20, 2025
Feb 6, 2025
Mar 18, 2025
Dec 19, 2019
Sep 25, 2024
Feb 6, 2025
Feb 6, 2025
Jan 23, 2024
Apr 17, 2018
Jan 23, 2024
Jan 23, 2024
Aug 25, 2023
May 11, 2023
Mar 18, 2025
Jun 25, 2024
Mar 18, 2025
Jan 23, 2024
Nov 20, 2024
Oct 30, 2023
Nov 27, 2023
Feb 6, 2025
Dec 15, 2022
Jan 23, 2024
Dec 19, 2019
Jan 26, 2018
Dec 19, 2019
Feb 6, 2025
Jul 11, 2024
Feb 6, 2025
Jan 23, 2024
Sep 25, 2024
Nov 26, 2020
Mar 18, 2025
Mar 18, 2025
Nov 29, 2021
May 17, 2023
Jan 26, 2018
Jan 26, 2018
Oct 30, 2023
Oct 30, 2023
Mar 18, 2025
Jan 23, 2024
Jan 26, 2018
Apr 17, 2018
Jan 23, 2024
Jan 26, 2018
Nov 7, 2024

Repository files navigation

Installation to.science.scripts

cd /opt/toscience  
git clone https://github.com/hbz/to.science.scripts.git  bin
cd bin

Edit variables and adjust to your own settings

Use same variables as already edited in to.science.install

ln -s ../to.science.install/conf/variables.conf

Create soft links

cd /opt/toscience/bin  
ln -s /opt/toscience/bin/cdn cdn  

Define cron jobs

Sample crontab:
# For more information see the manual pages of crontab(5) and cron(8)
#
# m h dom mon dow command
0 2 * * * /opt/toscience/bin/turnOnOaiPmhPolling.sh
0 5 * * * /opt/toscience/bin/turnOffOaiPmhPolling.sh
05 7 * * * /opt/toscience/bin/register_urn.sh control >> /opt/toscience/logs/control_urn_vergabe.log
1 1 * * * /opt/toscience/bin/register_urn.sh katalog >> /opt/toscience/logs/katalog_update.log
1 0 * * * /opt/toscience/bin/register_urn.sh register >> /opt/toscience/logs/register_urn.log
0 5 * * * /opt/toscience/bin/updateAll.sh > /dev/null
#0 23 * * * /opt/toscience/bin/loadCache.sh
0 1 * * * /opt/toscience/bin/import-logfiles.sh >/dev/null
# Start Edoweb Webgatherer Sequenz
0 20 * * * /opt/toscience/bin/runGatherer.sh >> /opt/toscience/logs/runGatherer.log
# Auswertung des letzten Webgatherer-Laufs
0 21 * * * /opt/toscience/bin/evalWebgatherer.sh >> /opt/toscience/logs/runGatherer.log
# Verschieben von Dateien aus dem Arbeitsverzeichnis von wpull ins Outputverzeichnis von wpull
0 22 * * * /opt/toscience/bin/ks.move_files_from_crawldir.sh >> /opt/toscience/logs/ks.move_files_from_crawldir.log
# Indexierung neu geharvesteter Webschnitte
0 2 * * * /opt/toscience/bin/backup-es.sh -c >> /opt/toscience/logs/backup-es.log 2>&1
30 2 * * * /opt/toscience/bin/backup-es.sh -b >> /opt/toscience/logs/backup-es.log 2>&1
0 2 * * * /opt/toscience/bin/backup-db.sh -c >> /opt/toscience/logs/backup-db.log 2>&1
30 2 * * * /opt/toscience/bin/backup-db.sh -b >> /opt/toscience/logs/backup-db.log 2>&1
0 2 * * * /opt/toscience/bin/depersonalize-apache-logs.sh
# Crawl Reports
0 22 * * * /opt/toscience/bin/crawlReport.sh >> /opt/toscience/logs/crawlReport.log