Skip to content

Architecture PARADISEC

silviapfeiffer edited this page Aug 8, 2011 · 7 revisions

The architecture of the system is given in the diagram at https://github.com/nabu-catalog/nabu/blob/master/doc/workflow.pdf .

Input into PARADISEC

  • metadata can be imported from METS (Metadata Encoding and Transmission Standard) files and http://en.wikipedia.org/wiki/METS and CSV files either per item or per collection; we will create example files which can be adapted by the user [import.php is currently used]

  • metadata will be extracted from the audio or video essence files mostly to gain encoding metadata, but possibly also ID3 tags and similar semantic metadata; this process runs on the ingested essence files [file_adder.php (for WAV) and non-audio_fileadder.php are currently used]

  • metadata for items and collections will be completed manually through the PARADISEC catalog

  • the essence files will be held in specific locations where the PARADISEC catalog can access them: for ingestion (upload in any format and transcoded to standard format), Web preview (transcoded), archiving (transcoded and sealed), and backup (copied off-site)

Export from PARADISEC

  • metadata for an item / collection can be exported to a METS or CSV file for storage with the archived audio, video and other essence files [download_csv.php and collection_xml.php are currently used]

  • specific metadata is also exported to an imp.xml file which is required for the sealing process to BWF and XMF [diverse versions of bwf.php are currently used]

Audio and video handling

  • audio and video is provided in diverse formats and converted to MJPEG2000 + MP4 by external company and WAV + MP3 by Dobbin software run internally (http://www.cube-tec.com/dobbin/fpus/encoding.html)

  • MP4 is converted/transcoded to shorter MP4 and WebM extract for Web publishing

  • MP3 is converted/transcoded to shorter MP3 and Ogg Vorbis extract for Web publishing

  • imp.xml file and WAV are used to seal audio to BWF file using internally run software, and MJPEG2000 to seal video to MXF file (special software that we run?)

  • BWF and MXF and METS files are archived (Uni Sydney) and backed up off-site (NCI)

  • Web MP4, WebM, MP3 and Ogg Vorbis files (extracts and full files) are served on Web server (Uni Sydney)

Notifications and Tracking

  • PARADISEC catalog only tracks what processes have started/finished and accepts notifications from the processes through a REST API Web service