gpt4 book ai didi

java - 从 PDFBox 中的字体获取颜色

转载 作者:行者123 更新时间:2023-12-01 09:42:05 25 4
gpt4 key购买 nike

我试图从 PDFBox 获取字体颜色,但似乎一直抛出异常。有人可以帮忙吗?我尝试获取颜色的方式是(页面是我获取的PDPage):

PDResources = page.getResources();
Iterable<COSName> fontNames = resources.getFontNames();
for (COSName fontName:fontNames)
System.out.println("name: " + resources.getFont(fontName).getName() +
"colour: " + resources.getColorSpace(fontName).getName());

这会打印出异常:

org.apache.pdfbox.pdmodel.MissingResourceException: Missing color space: F1

有人可以告诉我如何正确获取以这种方式获得的字体的颜色吗?

最佳答案

从源代码下载中尝试 PrintTextColors:

/**
* This is an example on how to get the colors of text. Note that this will not tell the background,
* and will only work properly if the text is not overwritten later, and only if the text rendering
* modes are 0, 1 or 2. In the PDF 32000 specification, please read 9.3.6 "Text Rendering Mode" to
* know more. Mode 0 (FILL) is the default. Mode 1 (STROKE) will make glyphs look "hollow". Mode 2
* (FILL_STROKE) will make glyphs look "fat".
*
* @author Ben Litchfield
* @author Tilman Hausherr
*/
public class PrintTextColors extends PDFTextStripper
{
/**
* Instantiate a new PDFTextStripper object.
*
* @throws IOException If there is an error loading the properties.
*/
public PrintTextColors() throws IOException
{
addOperator(new SetStrokingColorSpace());
addOperator(new SetNonStrokingColorSpace());
addOperator(new SetStrokingDeviceCMYKColor());
addOperator(new SetNonStrokingDeviceCMYKColor());
addOperator(new SetNonStrokingDeviceRGBColor());
addOperator(new SetStrokingDeviceRGBColor());
addOperator(new SetNonStrokingDeviceGrayColor());
addOperator(new SetStrokingDeviceGrayColor());
addOperator(new SetStrokingColor());
addOperator(new SetStrokingColorN());
addOperator(new SetNonStrokingColor());
addOperator(new SetNonStrokingColorN());
}

/**
* This will print the documents data.
*
* @param args The command line arguments.
*
* @throws IOException If there is an error parsing the document.
*/
public static void main(String[] args) throws IOException
{
if (args.length != 1)
{
usage();
}
else
{
PDDocument document = null;
try
{
document = PDDocument.load(new File(args[0]));

PDFTextStripper stripper = new PrintTextColors();
stripper.setSortByPosition(true);
stripper.setStartPage(0);
stripper.setEndPage(document.getNumberOfPages());

Writer dummy = new OutputStreamWriter(new ByteArrayOutputStream());
stripper.writeText(document, dummy);
}
finally
{
if (document != null)
{
document.close();
}
}
}
}

@Override
protected void processTextPosition(TextPosition text)
{
super.processTextPosition(text);

PDColor strokingColor = getGraphicsState().getStrokingColor();
PDColor nonStrokingColor = getGraphicsState().getNonStrokingColor();
String unicode = text.getUnicode();
RenderingMode renderingMode = getGraphicsState().getTextState().getRenderingMode();
System.out.println("Unicode: " + unicode);
System.out.println("Rendering mode: " + renderingMode);
System.out.println("Stroking color: " + strokingColor);
System.out.println("Non-Stroking color: " + nonStrokingColor);
System.out.println("Non-Stroking color: " + nonStrokingColor);
System.out.println();

// See the PrintTextLocations for more attributes
}

/**
* This will print the usage for this document.
*/
private static void usage()
{
System.err.println("Usage: java " + PrintTextColors.class.getName() + " <input-pdf>");
}
}

关于java - 从 PDFBox 中的字体获取颜色,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/38375884/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com