TLDR The redaction hypothesis tested in a controlled environment showed Presidio redacted 97% of the target name in the dataset. The [Redact, NER, Analyze] pattern was implemented against “Moby Dick” to remove all instances of Ahab. The outcome suggests a valid testing pipeline, easily abstracted to sparknlp for exploration at scale…