Develop candidate 02 24 2019 by mgage · Pull Request #927 · openwebwork/webwork2

mgage · 2019-02-25T02:00:18Z

This should replace PR #908. It changes fewer files since I have already committed the bulk of the changes in to the copyright symbol.

… and mess up the database. Note: We do not need decode from thaw because as sequences of bytes nothing changes. (I think.)

…bled.)

…ch/webwork2 into locbug Conflicts: courses.dist/modelCourse/course.conf

… suggested by goehle

added [qw(Encode::Encoding)] to ${pg}{modules}) in defaults.config as…

…om/heiderich/webwork2 into locbug

…o locbug

…lop_uft8_ver2 # Conflicts: # lib/WeBWorK/ContentGenerator/Instructor/SendMail.pm # lib/WeBWorK/Utils.pm

…p_candidate

On the other hand if the file contains a character (such as @) which is an invalid utf8 character then issue a warning to the html page. (Since course.conf often has an incorrect @ instead of © this results in an error for each course until those files are fixed.)

using © instead of © (the latter doesn't alwayswork with utf8 )

this change was undone in some some places during recent merges.

also have a "home directory" printout for debugging.

…pty_passwords_during_checkPassword PR 911+910+904 hotfixes to mgage new_develop_candidate_01_01_2019

…base use the utf8mb4 charset. The new file will be added to the running Docker MariabDB 10.1 container using changes to be made to docker-compose.yml in a following commit.

mysql_enable_utf8 ==> mysql_enable_utf8mb4 "SET NAMES 'utf8'" ==> SET NAMES 'utf8mb4'" modify the length of several VARCHAR columns so key length < 1000 as utf8mb4 reserves 4 bytes per character. These changes sufficed to allow (in a Docker test system): creating and using the admin course in a new MariaDB 10.0 database which was set to use utf8mb4 as the default charset, running OPL_update, creating a test course and creating a homework set with the question setMAAtutorial/hello.pg in it, creating a user whose name needs UTF8 characters, and submitting a UTF8 encoded string as an answer to setMAAtutorial/hello.pg and storing the UTF8 answer in the past_answer table. Note: The Docker test was done after the docker-compose.yml was modified, which is in the following commit.

database to use utf8mb4. This needs to be done BEFORE the Docker database storage volume is created. A special version called docker-compose.yml.specialUTF8MB4-db-storage can be used to replace docker-compose.yml, and will change the name of the database container, the database named storage volume, and make related settings. Replace docker-compose.yml with the file will force Docker to run the containers in a manner which will not intefere with the non-utf8mb4 database used by the prior configuration. That approach would allow one to switch between the use of Docker containers using a utf8mb4 database and using an old style database, just by using the "correct" settings in docker-compose.yml for each "case".

To allow the 1000 byte limit for keys, the OPL_local_statistics was changed to use ENGINE=MyISAM. That change allowed reducing the source_file column from varchar(255) to varchar(245), while the 767 bytes key limit of ENGINE=InnoDB as was used by this table and by OPL_global_statistics.sql (at present) would have limited this to about 191 characters. The OPL_problem_user table had 3 keys changed from using 100 characters each to using only 80 characters each. The same changes were already made to lib/WeBWorK/DB/Record/PastAnswer.pm for the same reason. Only the first 245 characters of source_file are used as a key. Finally, the CHARACTER SET of OPL_problem_user was explicitly changed from ascii to utf8mb4. The saved version of OPL_global_statistics.sql needs to be hand modified as follows, until the GitHub master branch has these changes made. Only after those changes can the file be loaded into a utf8mb4 database. utf8 => utf8mb4 255 => 245 ENGINE=InnoDB => ENGINE=MyISAM ================================= 10c10 < /*!40101 SET NAMES utf8mb4 */; --- > /*!40101 SET NAMES utf8 */; 24c24 < /*!40101 SET character_set_client = utf8mb4 */; --- > /*!40101 SET character_set_client = utf8 */; 26c26 < `source_file` varchar(245) NOT NULL, --- > `source_file` varchar(255) NOT NULL, 31c31 < ) ENGINE=MyISAM DEFAULT CHARSET=utf8mb4; --- > ) ENGINE=InnoDB DEFAULT CHARSET=utf8;

taniwallach · 2019-03-06T16:22:44Z

I made a PR to @mgage tree, to get things to work (at least in Docker) when a uft8mb4 database is used.
See mgage#24

There is a special docker-compose.yml.specialUTF8MB4-db-storage included which can be used to avoid the need to delete a current named Docker volume of SQL data by using a different container/volume name for the utf8mb4 testing.

With those changes (and the hacks to bypass the missing XML::Simple module in the current Docker image) I could (in a Docker test system):

creating and using the admin course in a new MariaDB 10.1 database which was set to use utf8mb4 as the default charset,
running OPL_update
- Additional hacks were needed to get the global statistics to work.
- See Tani develop candidate 02 24 2019 utf8mb4 mgage/webwork2#24 for details
creating a test course and creating a homework set with the question setMAAtutorial/hello.pg in it (selected as it takes a sting answer)
creating a user whose name needs UTF8 characters, and
submit a UTF8 encoded string as an answer to setMAAtutorial/hello.pg and later see the stored UTF8 answer in the course's past_answer table.

The testing was done when PG was using the branch from openwebwork/pg#390

Overall, this seems a pretty good indication that things basically work with this set of UTF8 support for both webwork2 and pg.

I have been running a development server using the older (early 2019) branches with good success.

In my opinion, we won't find many more of the issues with the UTF8 support until more people are testing the code on servers which get a reasonable amount of usage.

Note: When I tried to include XML::Simple near the start of the first "cpanm install" line, there was an error: Building and testing XMLRPC-Lite-0.717 ... ! Installing XMLRPC::Lite failed. See /root/.cpanm/work/1551887935.125/build.log for details. Retry with --force to force install it. so it was put into a second "cpanm install" line.

…2019_utf8mb4 Tani develop candidate 02 24 2019 utf8mb4

mgage · 2019-03-12T15:04:10Z

This is replaced by PR #930

mgage · 2019-03-12T15:10:55Z

replaced by pr #930

mgage · 2019-03-12T15:12:09Z

this is replaced by PR #930

goehle and others added 30 commits June 20, 2016 14:13

Tweaking how UTF8 encoding is done.

c7830d1

Adding more UTF8 support.

7f6a18a

Cleanup

5c20361

Added support for utf8 on xml

a54c760

Encoded results from freeze so they don't become long utf8 characters…

5c44802

… and mess up the database. Note: We do not need decode from thaw because as sequences of bytes nothing changes. (I think.)

Fix some broken css

fa1f954

small change.

ad37eb1

Added hardcopy support (assumin gyou have the fonts installed and ena…

0073969

…bled.)

Tracking down more open commands.

45f8bc1

corrected file encodings

7998565

Merge branch 'convert_to_utf8_encoding' of https://github.com/heideri…

eeb63a3

…ch/webwork2 into locbug Conflicts: courses.dist/modelCourse/course.conf

added [qw(Encode::Encoding)] to ${pg}{modules}) in defaults.config as…

e8b9947

… suggested by goehle

Merge pull request #14 from heiderich/Encode-error

ceb43be

added [qw(Encode::Encoding)] to ${pg}{modules}) in defaults.config as…

Merge branch 'add_maketext_calls_to_achievements' of https://github.c…

e1b5553

…om/heiderich/webwork2 into locbug

Merge branch 'locbug' of https://github.com/goehle/webwork2 into locbug

70ae45a

Merge branch 'develop' of https://github.com/openwebwork/webwork2 int…

80b663e

…o locbug

Merge branch 'develop' of https://github.com/openwebwork/webwork2 int…

f2b1564

…o locbug

Freeze/thaw to base64 because mysql fields are varchar

b0bfe73

update localization files.

200c5b8

Support for transition to freeze_base64

ea7147f

Polishing error handling

3d3795c

Misspelled method.

eaf0f2f

Merge branch 'locbug' of https://github.com/goehle/webwork2 into locbug

239790f

Used utf8::valid when I should have used utf8::is_utf8

1b083a0

Whoops.

6e2fbfd

Wrong maketext

e85eb00

Merge branch 'locbug' of https://github.com/goehle/webwork2 into deve…

054666f

…lop_uft8_ver2 # Conflicts: # lib/WeBWorK/ContentGenerator/Instructor/SendMail.pm # lib/WeBWorK/Utils.pm

local experimental changes

7541f5d

add utf-8 support to OPL-update (needs testing)

20c3237

add use utf8; to WWSafe.pm

4bee27e

mgage and others added 11 commits January 1, 2019 21:40

Merge branch 'develop_laptop' into develop_w_develop_candidate+develo…

3da5e96

…p_candidate

Fix error in opening taxonomy files. They should be opened for reading.

a57dddf

update usage comment

595f7d8

switching hosted2 references back to demo.

3f2e055

this change was undone in some some places during recent merges.

two more occurrences of hosted2

a7fed53

Include Encode as a module which is shared with the Safe compartment

3f83a1b

Add link to WeBWorK/webwork2 library

2eb2a2d

also have a "home directory" printout for debugging.

Merge pull request #23 from taniwallach/tani_forbid_whitespace_and_em…

01cba1e

…pty_passwords_during_checkPassword PR 911+910+904 hotfixes to mgage new_develop_candidate_01_01_2019

Reconcile copyright dates

23ddd22

mgage mentioned this pull request Feb 25, 2019

New develop candidate 01 01 2019 #908

Closed

taniwallach added 4 commits March 6, 2019 14:43

Add a special config file for use in Docker, to have the MariaDB data…

d804537

…base use the utf8mb4 charset. The new file will be added to the running Docker MariabDB 10.1 container using changes to be made to docker-compose.yml in a following commit.

taniwallach self-requested a review March 6, 2019 16:13

This was referenced Mar 6, 2019

New develop candidate multilingual openwebwork/pg#390

Closed

Time-zone name in set definition files can be invalid "short time-zone names" and prevent importing the file #928

Closed

mgage added 2 commits March 8, 2019 22:32

Merge pull request #24 from taniwallach/tani_develop_candidate_02_24_…

d271ec5

…2019_utf8mb4 Tani develop candidate 02 24 2019 utf8mb4

Merge branch 'develop' into develop_candidate_02_24_2019

fef2d56

mgage mentioned this pull request Mar 12, 2019

Develop multi ling 03 10 2019 no mb4 #930

Closed

mgage closed this Mar 12, 2019

taniwallach mentioned this pull request Mar 18, 2019

LTI authentication fails when the LTI consumer provides data which contains UTF8 multi-byte characters #915

Closed

mgage deleted the develop_candidate_02_24_2019 branch April 15, 2019 00:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Develop candidate 02 24 2019#927

Develop candidate 02 24 2019#927
mgage wants to merge 99 commits into
openwebwork:developfrom
mgage:develop_candidate_02_24_2019

mgage commented Feb 25, 2019

Uh oh!

taniwallach commented Mar 6, 2019

Uh oh!

mgage commented Mar 12, 2019

Uh oh!

mgage commented Mar 12, 2019

Uh oh!

mgage commented Mar 12, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

mgage commented Feb 25, 2019

Uh oh!

taniwallach commented Mar 6, 2019

Uh oh!

mgage commented Mar 12, 2019

Uh oh!

mgage commented Mar 12, 2019

Uh oh!

mgage commented Mar 12, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants