Die TYPO3 Mailingliste - nicht fragen: lesen!
This is a discussion on [TYPO3] typo3 database utf-8 conversion within the typo3-english@lists.netfielders.de forums, part of the TYPO3-Mailinglists: ENGLISH category; Hi, i would like to know about concepts of migration of a typo3 latin1 database to utf-8. i helped ...
|
|||||||
| Registrieren | Hilfe | Benutzerliste | Kalender | Suchen | Heutige Beiträge | Alle Foren als gelesen markieren |
|
#1
|
|||
|
|||
|
Hi,
i would like to know about concepts of migration of a typo3 latin1 database to utf-8. i helped myself by taking a dump and converting it with iconv but that can not solve problems e.g. caused by serialized strings, where stringlength might differ after conversion. is there something known like a standard-procedure to manage such a migration in typo3 context? cheers tom _______________________________________________ TYPO3-english mailing list TYPO3-english (AT) lists (DOT) netfielders.de http://lists.netfielders.de/cgi-bin/.../typo3-english |
|
#2
|
|||
|
|||
|
Thomas Oppelt wrote:
> i would like to know about concepts of migration of a typo3 latin1 > database to utf-8. > i helped myself by taking a dump and converting it with iconv but that > can not solve problems e.g. caused by serialized strings, where > stringlength might differ after conversion. > > is there something known like a standard-procedure to manage such a > migration in typo3 context? Google is your friend: http://www.google.ca/search?q=typo3+utf-8 First hit! Then: http://wiki.typo3.org/index.php/UFT-...abase_to_UTF-8 ![]() Patrick _______________________________________________ TYPO3-english mailing list TYPO3-english (AT) lists (DOT) netfielders.de http://lists.netfielders.de/cgi-bin/.../typo3-english |
|
#3
|
|||
|
|||
|
On Jan 11, 2008, at 3:10 PM, Patrick Gaumond wrote:
> Thomas Oppelt wrote: > >> i would like to know about concepts of migration of a typo3 latin1 >> database to utf-8. >> i helped myself by taking a dump and converting it with iconv but >> that >> can not solve problems e.g. caused by serialized strings, where >> stringlength might differ after conversion. >> >> is there something known like a standard-procedure to manage such a >> migration in typo3 context? > > Google is your friend: > > http://www.google.ca/search?q=typo3+utf-8 > > First hit! > > Then: > http://wiki.typo3.org/index.php/UFT-...abase_to_UTF-8 > > ![]() > > Patrick one warning and is shown wrong on many links, when you use a resent version of mysqldump, then you don't have to encode your dump with iconv or other tools since the dump is already in utf-8. Ries _______________________________________________ TYPO3-english mailing list TYPO3-english (AT) lists (DOT) netfielders.de http://lists.netfielders.de/cgi-bin/.../typo3-english |
|
#4
|
|||
|
|||
|
Hi Patrick,
thanks vor hints, but: is "recode" able to handle serialized strings conversion rather then iconv? Sorry i am not that familiar with both ways. Same question for me according to the wiki example where the dump is not converted at all as i see, just inserted back with --default-character-set=utf8. Is this enough and fits all potential problems? I can try this myself, but maybe you are so familiar with charset conversion stuff that you can give me a simple "yes, that's all no problem". ![]() thx & regards tom Patrick Gaumond schrieb: > Thomas Oppelt wrote: > >> i would like to know about concepts of migration of a typo3 latin1 >> database to utf-8. >> i helped myself by taking a dump and converting it with iconv but that >> can not solve problems e.g. caused by serialized strings, where >> stringlength might differ after conversion. >> >> is there something known like a standard-procedure to manage such a >> migration in typo3 context? > > Google is your friend: > > http://www.google.ca/search?q=typo3+utf-8 > > First hit! > > Then: > http://wiki.typo3.org/index.php/UFT-...abase_to_UTF-8 > > > ![]() > > Patrick _______________________________________________ TYPO3-english mailing list TYPO3-english (AT) lists (DOT) netfielders.de http://lists.netfielders.de/cgi-bin/.../typo3-english |
|
#5
|
|||
|
|||
|
ries van Twisk schrieb:
> > On Jan 11, 2008, at 3:10 PM, Patrick Gaumond wrote: > >> Thomas Oppelt wrote: >> >>> i would like to know about concepts of migration of a typo3 latin1 >>> database to utf-8. >>> i helped myself by taking a dump and converting it with iconv but that >>> can not solve problems e.g. caused by serialized strings, where >>> stringlength might differ after conversion. >>> >>> is there something known like a standard-procedure to manage such a >>> migration in typo3 context? >> >> Google is your friend: >> >> http://www.google.ca/search?q=typo3+utf-8 >> >> First hit! >> >> Then: >> http://wiki.typo3.org/index.php/UFT-...abase_to_UTF-8 >> >> >> ![]() >> >> Patrick > > > one warning and is shown wrong on many links, > when you use a resent version of mysqldump, then you don't have to > encode your > dump with iconv or other tools since the dump is already in utf-8. > > Ries But does this prevent me from these cases where a s:4:"öäüß"; in latin1 is s:8:"öäüß"; in utf-8. So if you try to unserialize such a string after conversion u get just false, or not? _______________________________________________ TYPO3-english mailing list TYPO3-english (AT) lists (DOT) netfielders.de http://lists.netfielders.de/cgi-bin/.../typo3-english |
|
#6
|
|||
|
|||
|
Thomas Oppelt wrote: on 12.01.2008 11:43:
>>> http://www.google.ca/search?q=typo3+utf-8 >>> >>> First hit! >>> >>> Then: >>> http://wiki.typo3.org/index.php/UFT-...abase_to_UTF-8 >> one warning and is shown wrong on many links, >> when you use a resent version of mysqldump, then you don't have to >> encode your >> dump with iconv or other tools since the dump is already in utf-8. >> >> Ries > > But does this prevent me from these cases where a s:4:"öäüß"; in latin1 > is s:8:"öäüß"; in utf-8. > So if you try to unserialize such a string after conversion u get just > false, or not? I guess you will have to "script" that converter on your own, because no "automatic" converter will work with PHP-serialized array. I haven't really found the need to do that, in most cases the serialized arrays are either some "minor" configuration that can be adapted by hand after converting everything else, or some cached stuff that we can clean up afterwards anyway. Isn't that the case? Cheers, Ernesto _______________________________________________ TYPO3-english mailing list TYPO3-english (AT) lists (DOT) netfielders.de http://lists.netfielders.de/cgi-bin/.../typo3-english |
|
#7
|
|||
|
|||
|
Ernesto Baschny [cron IT] schrieb:
> Thomas Oppelt wrote: on 12.01.2008 11:43: > >>>> http://www.google.ca/search?q=typo3+utf-8 >>>> >>>> First hit! >>>> >>>> Then: >>>> http://wiki.typo3.org/index.php/UFT-...abase_to_UTF-8 > > >>> one warning and is shown wrong on many links, >>> when you use a resent version of mysqldump, then you don't have to >>> encode your >>> dump with iconv or other tools since the dump is already in utf-8. >>> >>> Ries >> >> But does this prevent me from these cases where a s:4:"öäüß"; in >> latin1 is s:8:"öäüß"; in utf-8. >> So if you try to unserialize such a string after conversion u get just >> false, or not? > > I guess you will have to "script" that converter on your own, because no > "automatic" converter will work with PHP-serialized array. I haven't > really found the need to do that, in most cases the serialized arrays > are either some "minor" configuration that can be adapted by hand after > converting everything else, or some cached stuff that we can clean up > afterwards anyway. Isn't that the case? > > Cheers, > Ernesto i HOPE this is the case, i don't know if typo3 just stores temp data like caching data as serialized array/strings...what be nice if someone could say "no prob u can be unconcerned about that". _______________________________________________ TYPO3-english mailing list TYPO3-english (AT) lists (DOT) netfielders.de http://lists.netfielders.de/cgi-bin/.../typo3-english |
|
#8
|
|||
|
|||
|
_______________________________________________
TYPO3-english mailing list TYPO3-english (AT) lists (DOT) netfielders.de http://lists.netfielders.de/cgi-bin/.../typo3-english |
|
#9
|
|||
|
|||
|
On Jan 15, 2008, at 11:02 AM, Steffen Kamper wrote:
> >>> <-....snip> >> >> I guess you will have to "script" that converter on your own, >> because no >> "automatic" converter will work with PHP-serialized array. I haven't >> really found the need to do that, in most cases the serialized >> arrays are >> either some "minor" configuration that can be adapted by hand after >> converting everything else, or some cached stuff that we can clean up >> afterwards anyway. Isn't that the case? >> >> Cheers, >> Ernesto > > i don't think that is needed. As a dump is a normal textfile there > is no > difference between text and serialized array, conversion will be > done for > complete text. > > vg Steffen > > Hey Steffen the problem is the serialized arrays. What happens is that if an character get's changed to a two byte notation instead of a one byte notation the length parameter of the serialized array is incorrect. The length parameters counts bytes, not characters. If that happens then unserialize will fail and return false (or was it NULL? ) Ries -- Ries van Twisk Freelance TYPO3 Developer email: ries (AT) vantwisk (DOT) nl web: http://www.rvantwisk.nl/ skype: callto://r.vantwisk Phone: + 1 810-476-4193 _______________________________________________ TYPO3-english mailing list TYPO3-english (AT) lists (DOT) netfielders.de http://lists.netfielders.de/cgi-bin/.../typo3-english |
|
#10
|
|||
|
|||
|
"ries van Twisk" <typo3 (AT) rvt (DOT) dds.nl> schrieb im Newsbeitrag
news:mailman.3823.1200430939.5304.typo3-english (AT) lists (DOT) netfielders.de... > > On Jan 15, 2008, at 11:02 AM, Steffen Kamper wrote: > >> >>>> <-....snip> >>> >>> I guess you will have to "script" that converter on your own, because >>> no >>> "automatic" converter will work with PHP-serialized array. I haven't >>> really found the need to do that, in most cases the serialized arrays >>> are >>> either some "minor" configuration that can be adapted by hand after >>> converting everything else, or some cached stuff that we can clean up >>> afterwards anyway. Isn't that the case? >>> >>> Cheers, >>> Ernesto >> >> i don't think that is needed. As a dump is a normal textfile there is no >> difference between text and serialized array, conversion will be done >> for >> complete text. >> >> vg Steffen >> >> > > Hey Steffen > > the problem is the serialized arrays. > What happens is that if an character get's changed to a two byte notation > instead > of a one byte notation the length parameter of the serialized array is > incorrect. > > The length parameters counts bytes, not characters. > > If that happens then unserialize will fail and return false (or was it > NULL? ) > > Ries Hi Ries, indeed i didn't thought about that. But i remember that i took a dump file, changed collation and charset manually and restored db, and result was ok. What's about the extension convert2utf8 - does it works? maybe if not it could be improved. Download count is nearly 3000, so it could help. vg Steffen _______________________________________________ TYPO3-english mailing list TYPO3-english (AT) lists (DOT) netfielders.de http://lists.netfielders.de/cgi-bin/.../typo3-english |
| Themen-Optionen | Thema durchsuchen |
| Ansicht | Thema bewerten |
|
|
|
||||
| Thema | Autor | Forum | Antworten | Letzter Beitrag |
| [TYPO3-german] Database Analyser: "Write to database" ohne Wirkung | Christoph Pönighaus | typo3-german@lists.netfielders.de | 2 | 01.08.2007 07:09 |
| [TYPO3] Database problems | Heiner Lamprecht | typo3-english@lists.netfielders.de | 1 | 01.05.2007 10:03 |
| Re: [TYPO3] Mailform to database | James Corell | typo3-english@lists.netfielders.de | 0 | 20.04.2007 16:55 |
| [TYPO3] Database selction | Joop van Dijk | typo3-english@lists.netfielders.de | 1 | 19.12.2006 14:00 |
| [TYPO3] database.sql | Peter Marino | typo3-english@lists.netfielders.de | 3 | 15.12.2006 01:41 |