describingWebArchives | Automating description for Web Archives | Continuous Backup library
kandi X-RAY | describingWebArchives Summary
kandi X-RAY | describingWebArchives Summary
Automating description for Web Archives in ArchivesSpace using the Archive-It CDX and Partner Data APIs
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Serialize the output to JSON .
- Pretty print output .
describingWebArchives Key Features
describingWebArchives Examples and Code Snippets
Community Discussions
Trending Discussions on Continuous Backup
QUESTION
ANSWER
Answered 2022-Feb-22 at 10:59I am not sure if you have seen this message in the portal when you created the account/also mentioned in the doc
"You will not be able to switch between the backup policies after the account has been created"
since you need to select either "Periodic" or "Continuous" at the creation of Cosmos Account, it becomes mandatory.
Update:
You will not see the above in portal anymore, you can Switch from "Periodic" to "Continous" on an existing account and that cannot be reverted. You can read more here.
QUESTION
What would be the consistency of the continuous backup of the write region if the database is using bounded staleness consistency? Will it be equivalent to strong consistent data assuming no failovers happened?
Thanks Guru
...ANSWER
Answered 2021-Nov-25 at 17:15Backups made from any secondary region will have data consistency defined by the guarantees provided by the consistency level chosen. In the case of strong consistency, all secondary region backups will have completely consistent data.
Bounded staleness will have data that may have stale or inconsistent data inside the defined staleness window (minimum 300 seconds or 100k writes). Outside of that staleness window the data will be consistent.
Data for the weaker consistency levels will have no guarantees for consistency from backups in secondary regions.
QUESTION
MongoDB has deprecated the continuous back up of data. It has recommended using CPS (Cloud provider snapshots). As far as I understood, snapshots isn't really going to be effective compared to continuous backup coz, if system breaks, then we can only be able to restore the data till the previous snapshot which isn't gonna make the database up-to-date or close to it atleast.
Am I missing something here in my understanding?
...ANSWER
Answered 2020-May-19 at 10:12Cloud provider snapshots can be combined with point in time restore to give the recovery point objective you require. With oplog based restores you can get granularity of one second.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install describingWebArchives
Change to the archives_tools directory and install the library (this will also install requests and configparser dependencies). cd archives_tools python setup.py install
Install Beautiful Soup 4 pip install beautifulsoup4
Clone the describing WebArchives repo git clone https://github.com/UAlbanyArchives/describingWebArchives
Change into repo directory cd .. (if still in archives_tools directory) cd describingWebArchives
All scripts require a local_settings.cfg text file that contains login credentials for both ArchivesSpace and Archive-It as well as some additional params. An example is provided in the repo. This is modeled after how I've seen a number of places store credentials for the ASpace API with the addition of an Archive-It section.
Use local_settings-example.cfg as a template
baseURL is URL of your ASpace instance with 8089 as the port to access the backend API
repository is the ASpace repository you'd like to update, default is 2
user and password are ASpace credentials with API permissions
account is your Archive-It partner ID. UAlbany's is 652
user and password are your Archive-It credentials
target_subject is the local subject that must be assigned to Web Archives Records you want to update
subject_source limits target subjects to a certain source such as "local"
extent_type is the lable for the extent that will be updated in ArchivesSpace, make sure this extent present in your ASpace controlled values list or it will fail
access_requirements this is a generic Access Restrictions note
warc_restrict_note is a separate Access Restrictions note applied for records of WARC files. This lets you apply an additional restriction warning for WARC file requests.
acqinfo_note this is a generic Acquisition Information note that will be added to web archives parent records if one is not already present.
general_internet_archive_note this is a Acquisition Information note applied to records that are in the general Internet Archive Collections, essentially designed to say why there is limited provenance information for these.
Requires a local subject denoted in local_settings.cfg as target_subject.
Subject can be assigned to an web archives record, resource or archival object.
Record must have a Physical Characteristics and Technical Requirements note with the label "URL" and the original URL of the website you are describing as a subnote.
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page