Errors using ps2ascii on some files
What does FC_WEIGHT
refer to? Please advise: Although a text file was produced it is large and consists largely of numbers which makes it hard to proofread. I need relatively good confidence the output matches the input. If there is a fix please point me to it and bring joy to my dull drab existence.
entered the command
ps2ascii /Users/dwstclair/Desktop/untitled3/stmt_20181130.pdf a.txt
The result was:
DEBUG: FC_WEIGHT didn't match
On the off chance a default font was missing on my system
I added DroidSansFallback.ttf (no joy)
operating-system ghostscript
add a comment |
What does FC_WEIGHT
refer to? Please advise: Although a text file was produced it is large and consists largely of numbers which makes it hard to proofread. I need relatively good confidence the output matches the input. If there is a fix please point me to it and bring joy to my dull drab existence.
entered the command
ps2ascii /Users/dwstclair/Desktop/untitled3/stmt_20181130.pdf a.txt
The result was:
DEBUG: FC_WEIGHT didn't match
On the off chance a default font was missing on my system
I added DroidSansFallback.ttf (no joy)
operating-system ghostscript
add a comment |
What does FC_WEIGHT
refer to? Please advise: Although a text file was produced it is large and consists largely of numbers which makes it hard to proofread. I need relatively good confidence the output matches the input. If there is a fix please point me to it and bring joy to my dull drab existence.
entered the command
ps2ascii /Users/dwstclair/Desktop/untitled3/stmt_20181130.pdf a.txt
The result was:
DEBUG: FC_WEIGHT didn't match
On the off chance a default font was missing on my system
I added DroidSansFallback.ttf (no joy)
operating-system ghostscript
What does FC_WEIGHT
refer to? Please advise: Although a text file was produced it is large and consists largely of numbers which makes it hard to proofread. I need relatively good confidence the output matches the input. If there is a fix please point me to it and bring joy to my dull drab existence.
entered the command
ps2ascii /Users/dwstclair/Desktop/untitled3/stmt_20181130.pdf a.txt
The result was:
DEBUG: FC_WEIGHT didn't match
On the off chance a default font was missing on my system
I added DroidSansFallback.ttf (no joy)
operating-system ghostscript
operating-system ghostscript
edited Jan 1 at 14:07


Klaus Gütter
2,56721421
2,56721421
asked Jan 1 at 13:09
Douglas W St. ClairDouglas W St. Clair
1
1
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
Basically, I wouldn't use ps2ascii. Its long been deprecated and doesn't even ship in more recent versions of Ghostscript.
Instead consider using the txtwrite device. It works with a wider range of input (in particular it can use ToUnicode CMaps in PDF files, which ps2ascii cannot) and is capable of producing output in other than ASCII, which is quite useful. Even if you aren't working with non-Latin languages, the ability to preserve ligatures (eg fi, ffi, ffl etc) is convenient.
The actual answer to your question is 'don't worry about it'.
FC_WEIGHT refers to the weight of a font (light, bold, regular, ExtraBold etc). This message can only arise when you are using FontConfig, and Ghostscript is enumerating the available fonts from font config, trying to find a match for a missing font in the input. This means that a candidate font did not match the target font's weight.
Since you aren't going to use the font, it doesn't affect you.
What would you suggest to replace for converting pdf > txt?
– Douglas W St. Clair
Jan 2 at 14:13
As I said in my answer, consider using the Ghostscript txtwrite device. The original ps2ascii only really deals with PostScript input, and is limited in the Encodings it can handle. The txtwrite device is usable with any of the languages the Ghostsceritp family supports, can use ToUnicode CMaps and has a series of fallbacks to try and figure out the real text. It also has a few different output options so you can recover the data in different ways. There are other tools which will do a similar job with PDF as well, such as MuPDF, and I'm sure there are more if you only want to deal with PDF.
– KenS
Jan 2 at 14:50
Thanks I don't know how I missed it.
– Douglas W St. Clair
Jan 4 at 1:52
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53995701%2ferrors-using-ps2ascii-on-some-files%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
Basically, I wouldn't use ps2ascii. Its long been deprecated and doesn't even ship in more recent versions of Ghostscript.
Instead consider using the txtwrite device. It works with a wider range of input (in particular it can use ToUnicode CMaps in PDF files, which ps2ascii cannot) and is capable of producing output in other than ASCII, which is quite useful. Even if you aren't working with non-Latin languages, the ability to preserve ligatures (eg fi, ffi, ffl etc) is convenient.
The actual answer to your question is 'don't worry about it'.
FC_WEIGHT refers to the weight of a font (light, bold, regular, ExtraBold etc). This message can only arise when you are using FontConfig, and Ghostscript is enumerating the available fonts from font config, trying to find a match for a missing font in the input. This means that a candidate font did not match the target font's weight.
Since you aren't going to use the font, it doesn't affect you.
What would you suggest to replace for converting pdf > txt?
– Douglas W St. Clair
Jan 2 at 14:13
As I said in my answer, consider using the Ghostscript txtwrite device. The original ps2ascii only really deals with PostScript input, and is limited in the Encodings it can handle. The txtwrite device is usable with any of the languages the Ghostsceritp family supports, can use ToUnicode CMaps and has a series of fallbacks to try and figure out the real text. It also has a few different output options so you can recover the data in different ways. There are other tools which will do a similar job with PDF as well, such as MuPDF, and I'm sure there are more if you only want to deal with PDF.
– KenS
Jan 2 at 14:50
Thanks I don't know how I missed it.
– Douglas W St. Clair
Jan 4 at 1:52
add a comment |
Basically, I wouldn't use ps2ascii. Its long been deprecated and doesn't even ship in more recent versions of Ghostscript.
Instead consider using the txtwrite device. It works with a wider range of input (in particular it can use ToUnicode CMaps in PDF files, which ps2ascii cannot) and is capable of producing output in other than ASCII, which is quite useful. Even if you aren't working with non-Latin languages, the ability to preserve ligatures (eg fi, ffi, ffl etc) is convenient.
The actual answer to your question is 'don't worry about it'.
FC_WEIGHT refers to the weight of a font (light, bold, regular, ExtraBold etc). This message can only arise when you are using FontConfig, and Ghostscript is enumerating the available fonts from font config, trying to find a match for a missing font in the input. This means that a candidate font did not match the target font's weight.
Since you aren't going to use the font, it doesn't affect you.
What would you suggest to replace for converting pdf > txt?
– Douglas W St. Clair
Jan 2 at 14:13
As I said in my answer, consider using the Ghostscript txtwrite device. The original ps2ascii only really deals with PostScript input, and is limited in the Encodings it can handle. The txtwrite device is usable with any of the languages the Ghostsceritp family supports, can use ToUnicode CMaps and has a series of fallbacks to try and figure out the real text. It also has a few different output options so you can recover the data in different ways. There are other tools which will do a similar job with PDF as well, such as MuPDF, and I'm sure there are more if you only want to deal with PDF.
– KenS
Jan 2 at 14:50
Thanks I don't know how I missed it.
– Douglas W St. Clair
Jan 4 at 1:52
add a comment |
Basically, I wouldn't use ps2ascii. Its long been deprecated and doesn't even ship in more recent versions of Ghostscript.
Instead consider using the txtwrite device. It works with a wider range of input (in particular it can use ToUnicode CMaps in PDF files, which ps2ascii cannot) and is capable of producing output in other than ASCII, which is quite useful. Even if you aren't working with non-Latin languages, the ability to preserve ligatures (eg fi, ffi, ffl etc) is convenient.
The actual answer to your question is 'don't worry about it'.
FC_WEIGHT refers to the weight of a font (light, bold, regular, ExtraBold etc). This message can only arise when you are using FontConfig, and Ghostscript is enumerating the available fonts from font config, trying to find a match for a missing font in the input. This means that a candidate font did not match the target font's weight.
Since you aren't going to use the font, it doesn't affect you.
Basically, I wouldn't use ps2ascii. Its long been deprecated and doesn't even ship in more recent versions of Ghostscript.
Instead consider using the txtwrite device. It works with a wider range of input (in particular it can use ToUnicode CMaps in PDF files, which ps2ascii cannot) and is capable of producing output in other than ASCII, which is quite useful. Even if you aren't working with non-Latin languages, the ability to preserve ligatures (eg fi, ffi, ffl etc) is convenient.
The actual answer to your question is 'don't worry about it'.
FC_WEIGHT refers to the weight of a font (light, bold, regular, ExtraBold etc). This message can only arise when you are using FontConfig, and Ghostscript is enumerating the available fonts from font config, trying to find a match for a missing font in the input. This means that a candidate font did not match the target font's weight.
Since you aren't going to use the font, it doesn't affect you.
answered Jan 1 at 15:11
KenSKenS
22.5k12132
22.5k12132
What would you suggest to replace for converting pdf > txt?
– Douglas W St. Clair
Jan 2 at 14:13
As I said in my answer, consider using the Ghostscript txtwrite device. The original ps2ascii only really deals with PostScript input, and is limited in the Encodings it can handle. The txtwrite device is usable with any of the languages the Ghostsceritp family supports, can use ToUnicode CMaps and has a series of fallbacks to try and figure out the real text. It also has a few different output options so you can recover the data in different ways. There are other tools which will do a similar job with PDF as well, such as MuPDF, and I'm sure there are more if you only want to deal with PDF.
– KenS
Jan 2 at 14:50
Thanks I don't know how I missed it.
– Douglas W St. Clair
Jan 4 at 1:52
add a comment |
What would you suggest to replace for converting pdf > txt?
– Douglas W St. Clair
Jan 2 at 14:13
As I said in my answer, consider using the Ghostscript txtwrite device. The original ps2ascii only really deals with PostScript input, and is limited in the Encodings it can handle. The txtwrite device is usable with any of the languages the Ghostsceritp family supports, can use ToUnicode CMaps and has a series of fallbacks to try and figure out the real text. It also has a few different output options so you can recover the data in different ways. There are other tools which will do a similar job with PDF as well, such as MuPDF, and I'm sure there are more if you only want to deal with PDF.
– KenS
Jan 2 at 14:50
Thanks I don't know how I missed it.
– Douglas W St. Clair
Jan 4 at 1:52
What would you suggest to replace for converting pdf > txt?
– Douglas W St. Clair
Jan 2 at 14:13
What would you suggest to replace for converting pdf > txt?
– Douglas W St. Clair
Jan 2 at 14:13
As I said in my answer, consider using the Ghostscript txtwrite device. The original ps2ascii only really deals with PostScript input, and is limited in the Encodings it can handle. The txtwrite device is usable with any of the languages the Ghostsceritp family supports, can use ToUnicode CMaps and has a series of fallbacks to try and figure out the real text. It also has a few different output options so you can recover the data in different ways. There are other tools which will do a similar job with PDF as well, such as MuPDF, and I'm sure there are more if you only want to deal with PDF.
– KenS
Jan 2 at 14:50
As I said in my answer, consider using the Ghostscript txtwrite device. The original ps2ascii only really deals with PostScript input, and is limited in the Encodings it can handle. The txtwrite device is usable with any of the languages the Ghostsceritp family supports, can use ToUnicode CMaps and has a series of fallbacks to try and figure out the real text. It also has a few different output options so you can recover the data in different ways. There are other tools which will do a similar job with PDF as well, such as MuPDF, and I'm sure there are more if you only want to deal with PDF.
– KenS
Jan 2 at 14:50
Thanks I don't know how I missed it.
– Douglas W St. Clair
Jan 4 at 1:52
Thanks I don't know how I missed it.
– Douglas W St. Clair
Jan 4 at 1:52
add a comment |
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53995701%2ferrors-using-ps2ascii-on-some-files%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown