UTF8⇔UTF16

16bit-Unicode(UCS2)とUTF8との相互変換。
必要に迫られて書いてました。コード表片手にゴリゴリと。

#include <string>

// UTF16BE→UTF8
std::string utf16be_to_utf8(const std::wstring& utf16) {
std::string utf8;
utf8.reserve(utf16.size());
for ( size_t i = 0; i < utf16.size(); ++i ) {
    unsigned short wch = static_cast<unsigned short>(utf16.at(i));
    if ( wch <= 0x007f ) {
      utf8 += ((wch & 0x007f)      );
    } else
    if ( wch <= 0x07ff ) {
      utf8 += ((wch & 0x07c0) >> 6) | 0xc0;
      utf8 += ((wch & 0x003f)      ) | 0x80;
    } else {
      utf8 += ((wch & 0xf000) >> 12) | 0xe0;
      utf8 += ((wch & 0x0fc0) >> 6) | 0x80;
      utf8 += ((wch & 0x003f)      ) | 0x80;
    }
}
return utf8;
}

// UTF8→UTF16BE
std::wstring utf8_to_utf16be(const std::string& utf8) {
std::wstring utf16;
utf16.reserve(utf8.size());
for ( size_t i = 0; i < utf8.size(); ++i ) {
    unsigned char ch0 = utf8.at(i);
    if ( (ch0 & 0x80) == 0x00 ) {
      utf16 += ((ch0 & 0x7f)     );
    } else
    if ( (ch0 & 0xe0) == 0xc0 ) {
      unsigned char ch1 = utf8.at(++i);
      utf16 += ((ch0 & 0x3f) << 6)
             | ((ch1 & 0x3f)     );
    } else {
      unsigned char ch1 = utf8.at(++i);
      unsigned char ch2 = utf8.at(++i);
      utf16 += ((ch0 & 0x0f) << 12)
             | ((ch1 & 0x3f) << 6)
             | ((ch2 & 0x3f)      );
    }
}
return utf16;
}

間違えちゃいないと思うけど...

投稿日時 : 2007年12月1日 2:04

コメントを追加

# re: UTF8⇔UTF16 2007/12/01 12:09 凪瀬

if ( (ch0 & 0xe0) == 0xc0 ) {
ってことはUFT-8の先頭が2進数で0xxの場合と110の場合だけ？
1110から始まる日本語とか変換できないんじゃ…

サロゲートペアとかは考慮外なのだと思うけど。

# re: UTF8⇔UTF16 2007/12/01 12:50 通りすがる

WideCharToMultiByte/MultiByteToWideChar + CP_UTF8
とか。

# re: UTF8⇔UTF16 2007/12/01 13:11 επιστημη

ぁぃ、サロゲートペア考えてましぇん ^^;
2byte-UNICODEに納まらない"日本文字"ってあるんだっけ?

> WideCharToMultiByte/MultiByteToWideChar + CP_UTF8

使いたくなかったの。Windowsに縛られちゃうだから。

# re: UTF8⇔UTF16 2007/12/01 13:27 凪瀬

日本語はUTF-8で3byteになるわけですが、
utf8_to_utf16beってUTF-8で2byteまでしか考慮していないように思えるのですが。

>2byte-UNICODEに納まらない"日本文字"ってあるんだっけ?
UTF-16でサロゲートペアになるって意味であれば吉野家の吉（本来は土に口とかく）とかが有名ですかね。
UTF-8で2byteという意味であれば日本語はみんな納まりません。
ギリシア文字とかは2byteだから、「επιστημη」は変換できると思います。

# re: UTF8⇔UTF16 2007/12/01 15:08 επιστημη

2byte-UNICODE(UCS2)でしか考えてなかったかんなー orz
サロゲートペア使って4byte-UNICODE(UCS4)だとUTF8換算で6byteになるんか。
めんどっちーなーだれか書いてくれんかなー

.NET Frameworkの文字コード変換ってUCS4まできちんとやってくれるんだっけ?

# XimjzDaaDVknqRMoAV 2014/08/28 0:10 http://crorkz.com/

emBZ6i You made some first rate points there. I appeared on the internet for the difficulty and found most individuals will go together with along with your website.

# ydACOGgZVMXqGootpDP 2014/09/10 20:51 http://www.canadanobis.com

Well I definitely liked reading it. This article provided by you is very helpful for proper planning.

# jvDLxGgiFTaCTTOYBz 2014/09/18 16:53 http://akadmin.info/story/98353

5xy3sK Muchos Gracias for your article.Thanks Again. Awesome.

# YhLfRfGmFzaWSRhqNj 2015/01/12 7:35 varlog

deNp1H http://www.FyLitCl7Pf7kjQdDUOLQOuaxTXbj5iNG.com

# XrUTfZJrGdIE 2015/01/26 22:57 Josue

I read a lot http://www.consensusortho.com/index.php/patients/ dormicum 15 mg injection International Council of Shopping Centers Chief Economist Michael Niemira said consumers had started their back-to-school shopping later this year than in 2012. That may mean a lot of goods remain unsold at the end of the season, he added.

# ClVgnZBBzUzOQVC 2015/01/28 4:38 Jessica

I like watching TV http://www.hollandpompgroep.nl/atex zopiclone online Already trailing, 2-1, in the third, Sabathia found himself in trouble thanks to a walk and a pair of errors that helped the Angels load the bases with one out. The lefty walked Chris Nelson to make it 3-1, but Sabathia came back and fanned Josh Hamilton before retiring Erick Aybar on a groundout to leave the bases loaded.

# PIIqbpRQgv 2015/01/28 4:38 Lawrence

We're at university together http://kyoorius.com/publications/ lexotanil online China's implied oil demand rose by nearly 10 percent in Juneover a weak base a year earlier to 9.94 million barrels per day(bpd), Reuters calculations based on preliminary government datashowed. Crude runs rose 10.8 percent to 9.636 million bpd, thehighest daily output since February, as refineries boostedproduction after maintenance.

# GibVwCzbTCVKPpqgm 2015/01/29 10:12 Jerald

I'll text you later http://www.floridacollegeaccess.org/the-network/ buy hydrocodone 30 mg Russian officials said Monday that they had not yet received an applicationfrom Snowden, and Putin did not say outright whether he would grant a requestfrom him. But the president clearly signaled that it remained a possibility,with a reference to his prior statements that Snowden could apply for asylum inRussia provided that he stopped publishing classified material harmful to theUnited States.

# SrhtobPsOtrAtZvqKc 2015/01/29 10:12 Hipolito

Thanks funny site http://www.video-to-flash.com/video_to_flv/ clonazepam 2.5 mg/ml Fonterra said some of its potentially contaminated whey protein was purchased by Coca-Cola and Australian health food company Vitaco but the manufacturing process used by those companies, including ultra-high temperature treatment, meant their products posed no risk. It said the same situation applied to Chinese beverage maker Wahaha.

# GHGDmrIefAUWpnyCe 2015/02/04 21:09 Boyce

I'm in a band http://www.jrdneng.com/careers.htm Diamox Online Baracoa: a tropical paradise situated on the eastern tip of Cuba. The only way we could get there was by crossing the daunting La Farola, a steep and narrow 80-mile mountain road that climbed to 600m at its peak. The town comprises a colonial cluster of bars, restaurants and casa de particulares offset by the long stretch of clay-coloured coastline and the salty azure seas.

# RYFubpsVwq 2015/02/06 2:18 Gregg

The United States http://www.blue-lemons.com/about buy levofloxacin canada Facing a sentence of 60 years in prison for spitting on a police officer, and another 28 years behind bars for driving drunk with a minor passenger in the car, Textor told Zimmerman to accept the prosecutionâ?s plea deal of 45 years for harassing a public servant and 20 years for DWI.

# fmuMFlzcbUluJKNT 2015/02/06 13:14 Leigh

I never went to university http://www.grasmerehotel.com/conferences/ make money with no website Egypt's political upheaval has put the Obama administration in an awkward diplomatic position. The White House strongly supported the pro-democracy protests that forced out longtime autocrat Hosni Mubarak in 2011, but it has refused to condemn the military's removal of Morsi, who was subsequently elected in the country's first democratic balloting.

# azcThmavHXNxg 2015/02/06 13:14 Nickolas

Punk not dead http://artist-how-to.com/studio/portraits/ rapid refund tax service Interested parties include a combination of private equityfirms KKR and New Mountain Capital, as well as sports andentertainment company Creative Artists Agency. Europeaninvestment group CVC Capital Partners is another likely bidder.

# NRIzsmohutNXStnRQ 2015/02/07 6:51 Razer22

Get a job http://www.wonderbra.ca/about-us/ tenormin tablets Hail the surgeons who perform the needed operations. Theymust be possessed of steady hands and be precise; they must dotheir work and check again to make sure that the damaged organsare completely removed. â?Istisal,â? surgical removal, is theword of the day among erstwhile decent men and women, whoexpress their fondness for the removal of tumors.

# QxfRFkNFBSrZYVST 2015/02/07 21:30 Camila

I'd like to send this to http://www.professorpotts.com/links/ online loan lender Carney said it was never the Bank's intention that none of the four characters on its notes should be a woman, but it would invite feedback from the public on whether it could to do more to comply with its commitment to equality. The conclusions of the review will be announced by the end of the year.

# TphNCbatuinzxsvZKme 2015/02/08 10:49 Rigoberto

A pension scheme http://www.green-events.co.uk/about.html Vytorin Generic Name Last year, 1.7 million prescriptions were written forNasacort AQ and its generics, Sanofi said. Prior to theintroduction of generics, Nasacort AQ generated peak annualsales of $375 million. Last year sales were less than $100million.

# scEfOwnbbZkNeaeDVS 2015/02/08 10:50 Lonny

I study here http://www.sullivans.com.au/free-inclusions/ where can i purchase azithromycin online “We’ve got to knock the myths on the head. In eastern Asia there is great respect and reverence for elderly people, but the reality is with one-child families the children are just often not there because they’ve moved to a city. It’s not practical for families to depend on their children.”

# dOOLCgdPwbEqET 2015/02/09 6:49 Dario

this is be cool 8) http://atecuccod.com/index.php/nyomtatas bicycle financing "The Mambo Kings Play Songs of Love," his second novel, told the story of a pair of Cuban-born brothers, both musicians, who emigrate to New York City in the 1950s and achieve short-lived fame after appearing on the "I Love Lucy" show.

# gAzaFnrolSoyKEbpQP 2015/02/09 6:49 Britt

I'm a member of a gym http://www.sporttaplalkozas.com/sporttaplalkozas/fogyokura-program small quick loan The administration says the healthcare law, which was acentral point of debate in last year's presidential election,which Obama won, will insure about 30 million people throughsubsidized private insurance or government-provided Medicaid.

# rScjnTarBDE 2015/02/09 6:49 Everett

I like it a lot http://atecuccod.com/index.php/ajandektargyak winning cash online “Fundamentally, we all want an assessment and examinations system that we can believe in and it is clear confidence is being lost in the current system,” he said. “It is no longer fit for purpose; it’s hugely expensive and results are unreliable because it’s too dependant on individuals sitting down and marking.”

# TnbPdLuqGb 2015/02/10 19:34 Marcelo

I'd like to take the job http://www.ryan-browne.co.uk/about/ Purchase Tadalafil Online Amazon plays the long game, but for now, its investors may find themselves discouraged. The online retail giant posted an earnings miss on Thursday. Though its second-quarter reported earnings of $15.7 billion came in roughly on par with analysts' estimates, its earnings-per-share loss of $0.02 did not. Wall Street had expected growth of $0.05, according to Yahoo! Finance.

# zUNVaAPIhE 2015/02/10 19:35 Terrance

magic story very thanks http://www.milliput.com/about.html Aciclovir Tablets 200mg MUMBAI, Oct 14 (Reuters) - India's Reliance Industries Ltd said on Monday further investment at its key gas fieldto reverse falling output rests on a rise in domestic gasprices, after subdued global demand for fuel narrowed itsrefining margins in the second quarter.

# FfZeVHFYkJCrc 2015/02/11 23:07 Bruce

No, I'm not particularly sporty https://josbinder.at/index.php?nav=37 pay day loans utah Zurich Insurance is looking into the suicide of its chief financial officer Pierre Wauthier and investigations are ongoing into the death of Bank of America Merrill Lynch intern Moritz Erhardt who was found dead in his London lodgings having worked through the night several days running.

# oHzBkqdHCo 2015/02/24 18:07 Sherwood

I've been made redundant http://www.horsdoeuvres.fr/contact/ glucophage xr On Thursday, Ketchum scored another public-relations coup:It helped place a Putin commentary in opinion pages of The NewYork Times, just as representatives from Russia and the UnitedStates were beginning to meet in Geneva to negotiate a plan forSyria to give up its chemical weapons.

# feoiCPtdTgOCLW 2015/02/24 18:07 Andrea

I've been made redundant http://www.streamsweden.com/service/ inderalici 20mg Another subsidiary, PTT Exploration and Production Pcl, was involved in Australia's worst offshore drillingaccident in 2009, when thousands of gallons of crude oil spewedinto the sea after a damaged oil well blew up.

# uAgFNKFpYOJAjqp 2015/02/25 21:06 Friend35

A staff restaurant http://spid.it/gestione-rischio-clinico/ promethazine online Saints Row 4′s writing is excellent. After Saints the Third, stakes were high for this game in terms of sheer lunacy and Volition answers them admirably. If gamers are concerned about how this game could top some of the moments from the last game, they should not worry because Saints Row 4 does that and more. In many ways it shows the evolution of the studio from the nascent beginnings as a simple GTA clone to gaining its own voice to defying expectations of what the series is capable of to the heights of madness that SR4 climbs. The series has always had a great sense of humor, a task often difficult to pull of in video games, but the series has always done the job adeptly. That is no different here, Saints Row 4 features some hilarious jokes and moments. The plot moves well and even the crazier moments work within the new virtual world of bizarro Steelport and the Zin.

# HBzAtuMGSSupkQ 2015/02/27 0:34 Chong

I can't get a dialling tone http://www.nude-webdesign.com/ongoing-support/ abilify 10 Russia, which has veto power in the Security Council, couldcite such doubts about proof of culpability in opposing futureefforts by the United States, Britain and France to punish Syriafor any violations of a deal to abandon chemical weapons.

# XSHlUGmNbM 2015/04/08 2:17 Amado

I'd like to take the job http://www.vaimnemaailm.ee/index.php/tegevused endep 50 Look, Iâ?ve had a hard time believing that A-Rod would walk away from the game if he could still play, mostly because Iâ?ve had enough conversations with him over the years, both on and off the record, to convince me he really was something of a gym rat when it came to his sport.

# cUfqAuiSvfumAB 2015/04/08 2:17 Bobbie

Who would I report to? http://www.europanova.eu/partenaires/ price latisse canada Last month, the UN Human Rights Council's Independent International Commission of Inquiry said there were reasonable grounds to believe that "limited quantities of toxic chemicals" had been used at Khan al-Assal, as well as in three other attacks.

# PLS173欧美游戏币 2016/01/08 17:06 boiiqm@pls173.com

www.pls173.com??美服金?(BNS)客??价：??付款完成，BNS金?不到10分?就交易完成，速度！！！！

# re: UTF8?UTF16 2021/07/09 19:02 hydroxychloroquine use

heart rate watch walmart https://chloroquineorigin.com/# hydroxy chloriquine

# re: UTF8?UTF16 2021/07/25 18:49 hydrochoriquine

malaria drug chloroquine https://chloroquineorigin.com/# how safe is hydroxychloroquine

# iqkztqnamhbm 2021/11/28 9:15 dwedayrmke

hydroxychloroquine ingredients https://hydroaaralen.com/

# gvrnpbixekzv 2021/12/03 15:06 dwedayvonx

https://aralenphosphates.com/ chloroquine phosphate

タイトル		タイトルを入力してください
名前		名前を入力してください
URL
コメントコメントを入力してください
名前をブラウザに記憶する

東方算程譚

目次

Blog 利用状況

ニュース

記事カテゴリ

書庫

日記カテゴリ