<html> <head> <meta http-equiv=Content-Type content="text/html; charset=windows-1253"> <meta name=Generator content="Microsoft Word 12 (filtered)"> <style> <!-- /* Font Definitions */ @font-face {font-family:"Cambria Math"; panose-1:2 4 5 3 5 4 6 3 2 4;} @font-face {font-family:Tahoma; panose-1:2 11 6 4 3 5 4 4 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0cm; margin-bottom:.0001pt; font-size:12.0pt; font-family:"Times New Roman","serif"; color:black;} .MsoChpDefault {font-size:10.0pt;} @page Section1 {size:595.3pt 841.9pt; margin:72.0pt 90.0pt 72.0pt 90.0pt;} div.Section1 {page:Section1;} --> </style> </head> <body bgcolor=white lang=EL> <div class=Section1> <p class=MsoNormal><span lang=EN-US style='font-family:"Tahoma","sans-serif"'>Abcd</span><span lang=EN-US style='font-family:"Tahoma","sans-serif"'> </span><span style='font-family:"Tahoma","sans-serif"'>αβγδ άᾶὰἂ </span></p> </div> </body> </html>
(好吧,实际文件不同但问题是一样的).
在上面的文件中,如果您将其保存为temp.htm并将其加载到Internet Explorer中,您将看到4个拉丁字符,4个没有音调的希腊字符和4个带有音调的希腊字符(Alpha的变体 – 第一个字母希腊字母表).像这样的东西:
Abcdαβγδᾶὰἂᾶὰἂ
到现在为止还挺好.
如果我们在TIdMessage的Body属性中加载上述文件并通过电子邮件发送它,则显示如下:
A B C D ???? ?ᾶὰἂ
如你所见,单调字母表中的希腊字母被替换为???? ? – 在WinXP上使用Mozilla Thunderbird 3进行测试.
TIdMessage组件的属性如下:
我试图将CharSet设置为Windows-1253,但没有运气.
任何想法如何工作?
更新:
回答你的问题:
收到后的原始消息来源是:(电子邮件地址被编辑)
From - Thu Sep 15 11:11:06 2011 X-Account-Key: account3 X-UIDL: 00007715 X-Mozilla-Status: 0001 X-Mozilla-Status2: 00400000 X-Mozilla-Keys: Return-Path: [redacted] X-Envelope-To: [redacted] X-Spam-Status: No, hits=0.0 required=5.0 tests=AWL: 0.194,BAYES_20: -0.73,HTML_MESSAGE: 0.001, MIME_HEADER_CTYPE_ONLY: 0.56,MIME_HTML_ONLY: 0.001,MISSING_MID: 0.001, CUSTOM_RULE_FROM: ALLOW,TOTAL_SCORE: 0.027,autolearn=no X-Spam-Level: Received: from localhost ([127.0.0.1]) by [redacted] for [redacted]; Thu, 15 Sep 2011 11:10:59 +0300 From: [redacted] Subject: Test msg To: [redacted] Content-Type: text/html; charset=us-ascii Sender: [redacted] Reply-To: [redacted] Disposition-Notification-To: [redacted] Return-Receipt-To: [redacted] Date: Thu, 15 Sep 2011 11:10:59 +0300 <html> <head> <meta http-equiv=Content-Type content="text/html; charset=windows-1253"> <meta name=Generator content="Microsoft Word 12 (filtered)"> <style> <!-- /* Font Definitions */ @font-face {font-family:"Cambria Math"; panose-1:2 4 5 3 5 4 6 3 2 4;} @font-face {font-family:Tahoma; panose-1:2 11 6 4 3 5 4 4 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0cm; margin-bottom:.0001pt; font-size:12.0pt; font-family:"Times New Roman","serif"; color:black;} .MsoChpDefault {font-size:10.0pt;} @page Section1 {size:595.3pt 841.9pt; margin:72.0pt 90.0pt 72.0pt 90.0pt;} div.Section1 {page:Section1;} --> </style> </head> <body bgcolor=white lang=EL> <div class=Section1> <p class=MsoNormal><span lang=EN-US style='font-family:"Tahoma","sans-serif"'>Abcd</span><span lang=EN-US style='font-family:"Tahoma","sans-serif"'> </span><span style='font-family:"Tahoma","sans-serif"'>???? ?ᾶὰἂ </span></p> </div> </body> </html>
Mozilla Thunderbird还说消息编码:Western(ISO-8859-1).我试图在IdMessage组件中添加不同的编码,如windows-1253(希腊语)或UTF-8 – 结果是一样的.此外,我试图将htm文件转换为UTF-8(使用记事本) – 它看起来一样(我在html的元信息中手动更改了charset).再次发送邮件.结果:Abcd ??? 2?3 ?? ??ᾶὰἂ
如果您查看自己的屏幕截图,您将看到TIdMessage和传输的电子邮件都设置为使用US-ASCII作为CharSet.这就是你的数据被改变的原因.如果将HTML加载到TIdMessage.Body或TIdText.Body属性中,则必须将数据解码为UTF-16(因为这是Body属性在XE中使用的),然后设置TIdMessage.CharSet或TIdText.CharSet属性到windows-1253,以便在发送电子邮件时正确地重新编码UTF-16数据,例如:
Enc := CharsetToEncoding('windows-1253'); try IdMessage.Body.LoadFromFile('file.htm', Enc); IdMessage.ContentType := 'text/html'; IdMessage.CharSet := 'windows-1253'; finally Enc.Free; end;
要么:
Enc := CharsetToEncoding('windows-1253'); try with TIdText.Create(IdMessage.MessageParts, nil) do begin Body.LoadFromFile('file.htm', Enc); ContentType := 'text/html'; CharSet := 'windows-1253'; end; finally Enc.Free; end;
如果您将HTML加载到TIdAttachment对象中,那么您不必手动解码/编码任何内容,因为附件数据按原样发送.
with TIdAttachmentFile.Create(IdMessage.MessageParts, 'file.htm') do begin ContentType := 'text/html'; end;