We are in the middle of preparing for a migration from self-hosting to Managed Hosting. Rather than do a full move of the entire application and conversion from Windows/MSSSQL to Linux/Oracle, we are doing a batch export of courses, to be imported by Managed Hosting into a clean Learn installation.
Like many schools that allow course copy and have been doing so for many years, we have lots of courses where Course Files are full of multiple "importedContent" directories, sometimes recursively, so there are dozens or even hundreds of duplicate copies of files. A course archive includes all those duplicates, so archive files can end up being ridiculously large and impossible to restore.
In our testing with GUI course exports (as opposed to archives), the export process actually did cleanup of all those duplicates, so the export file got rid of the "importedContent" folders and only included files/folders which were actually linked. So, this seemed like a great way to do cleanup and generate files that could be imported into Managed Hosting.
Unfortunately, batch course export (using blackboard/apps/content-exchange/bin/batch_ImportExport) does not seem to work the same way. Instead, the generated export files seem to include all the duplicated "importedContent" folders and file, resulting in file sizes that are almost as large as the archives.
The docs for batch export/import/archive/restore for SP14 (https://en-us.help.blackboard.com/Learn/9.1_Older_Versions/9.1_SP_14/Administrator/160_Course_Management/020_Managing_Co… ) show 3 command-line flags for the batch operation:
- Option 1: Specify true or false to indicate whether to archive/export course files found under the course's home directory. If no value is specified, default is true.
- Option 2: Specify true or false to indicate whether to archive/export grade history. If no value is specified, default is false.
- Option 3: Specify true or false to indicate whether to archive/export course files found outside the course's home directory. If no value is specified, default is false.
However, setting all of those to "false" in the batch file does not seem to have any impact on the export file size, or to mimic the behavior of the GUI course export.
Has anyone else doing a batch course export (for Managed Hosting or otherwise) run across this problem and found a way around it?