; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g35260 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g35260
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationchr4:26527984..26536488
RNA-Seq ExpressionMoc04g35260
SyntenyMoc04g35260
Gene Ontology termsGO:0009664 - plant-type cell wall organization (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602423.1 Glycosyltransferase BC10, partial [Cucurbita argyrosperma subsp. sororia]3.3e-19191.34Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT
        MKQKV QRKALFKWKRKLA  LLLVFCFGS V MQ++YGRVMMLASL PQSVQEPKIAFLFIARNRLPLDI+WD FF EGENKFSIFVHSRPGFLFNKAT
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT

Query:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS
        TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDT N+RFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDP+IPV NWRKGS
Subjt:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS

Query:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK
        QWVVLTRKHA VVVKD+TVFPMFQQHCKRKSLPEFWRD PFP+D  KEHNCIPDEHYVQTLLAQEGLE E+TRRSL+YSAWDLS S+D+ERRNWHPVTYK
Subjt:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK

Query:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIV
        FSDATLDLIQSIK IDNIYYETEYRREWCTSKG+PS CFLFARKFTRPAALRLLNM+V
Subjt:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIV

XP_022134469.1 uncharacterized protein LOC111006707 isoform X1 [Momordica charantia]3.9e-20898.35Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT
        MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT

Query:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS
        TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS
Subjt:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS

Query:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK
        QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK
Subjt:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK

Query:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQ
        FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNM  L    Q
Subjt:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQ

XP_022963443.1 uncharacterized protein LOC111463645 isoform X2 [Cucurbita moschata]3.3e-19188.47Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT
        MKQKV QRKALFKWKRKLA  LLLVFCFGS V MQ++YGRVMMLASL PQSVQEPKIAFLFIARNRLPLDI+WD FF EGENKFSIFVHSRPGFLFNKAT
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT

Query:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS
        TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDT N+RFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFAD+KEGRYNPKMDP+IPV NWRKGS
Subjt:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS

Query:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK
        QWVVLTRKHA VVVKD+TVFPMFQQHCKRKSLPEFWRD PFP+D  KEHNCIPDEHYVQTLLAQEGLE E+TRRSL+YSAWDLS S+D+ERRNWHPVTYK
Subjt:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK

Query:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK
        FSDATLDLIQSIK IDNIYYETEYRREWCTSKG+PS CFLFARKFTRPAALRLLNM+ +A L    H  +E+K
Subjt:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK

XP_022989938.1 uncharacterized protein LOC111486977 isoform X2 [Cucurbita maxima]5.6e-19188.2Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT
        MKQKV QRKALFKWKRKLA+ LL VFCFGS V+MQ++YGRVMMLASL PQSVQEPKIAFLFIARNRLPLDI+WD FF EGENKFSIFVHSRPGFLFNKAT
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT

Query:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS
        TRSIYFLNRQVN SIQVDWGEASMIEAERILLRHALTDT N+RFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDP+IPV NWRKGS
Subjt:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS

Query:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK
        QWVVLTRKHA VVVKD+TVFPMFQQHCKRKSLPEFWRD PFP+D  KEHNCIPDEHYVQTLLAQEGLE E+TRRSL+YSAWDLS S+D+ERRNWHPVTYK
Subjt:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK

Query:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK
        FSDATLDLIQSIK IDNIYYETEYRREWCTSKG+PS CFLFARKFTRPAALRLLNM+  A L    H  +E+K
Subjt:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK

XP_038889176.1 glycosyltransferase BC10 [Benincasa hispida]5.6e-19191.09Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT
        MKQKVAQRKA+FKWKRKLAI LL+VFCFGS V+MQ+RYGRVMMLASLHPQS   PKIAFLFIARNRLPLDI+WD FF EGENKFSIFVHSRPGFLFNKAT
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT

Query:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS
        TRS YFLNRQVNDSIQVDWGEASMIEAERILLRHALTDT N+RFIFLSDSCIPLYNFSYTYDYVMSTS SFVDSFADTKEGRYNPKMDP+IPV+NWRKGS
Subjt:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS

Query:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK
        QWVVLTRKHA VVV D TVFPMFQQHCKRKSLPEFWRDRPFPNDA KEHNCIPDEHYVQTLLAQEGLE E+TRRSL+YSAWDLS+SKD+ERRNWHPVTYK
Subjt:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK

Query:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVL
        FSDATLDLIQSIK IDNIYYETEYRREWCTSKG+PS CFLFARKFTRPAALRLLNM  L
Subjt:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVL

TrEMBL top hitse value%identityAlignment
A0A6J1BYT7 uncharacterized protein LOC111006707 isoform X11.9e-20898.35Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT
        MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT

Query:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS
        TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS
Subjt:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS

Query:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK
        QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK
Subjt:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK

Query:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQ
        FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNM  L    Q
Subjt:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQ

A0A6J1HG52 uncharacterized protein LOC111463645 isoform X13.9e-19088.24Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFF-LEGENKFSIFVHSRPGFLFNKA
        MKQKV QRKALFKWKRKLA  LLLVFCFGS V MQ++YGRVMMLASL PQSVQEPKIAFLFIARNRLPLDI+WD FF  EGENKFSIFVHSRPGFLFNKA
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFF-LEGENKFSIFVHSRPGFLFNKA

Query:  TTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKG
        TTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDT N+RFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFAD+KEGRYNPKMDP+IPV NWRKG
Subjt:  TTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKG

Query:  SQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTY
        SQWVVLTRKHA VVVKD+TVFPMFQQHCKRKSLPEFWRD PFP+D  KEHNCIPDEHYVQTLLAQEGLE E+TRRSL+YSAWDLS S+D+ERRNWHPVTY
Subjt:  SQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTY

Query:  KFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK
        KFSDATLDLIQSIK IDNIYYETEYRREWCTSKG+PS CFLFARKFTRPAALRLLNM+ +A L    H  +E+K
Subjt:  KFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK

A0A6J1HHT9 uncharacterized protein LOC111463645 isoform X21.6e-19188.47Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT
        MKQKV QRKALFKWKRKLA  LLLVFCFGS V MQ++YGRVMMLASL PQSVQEPKIAFLFIARNRLPLDI+WD FF EGENKFSIFVHSRPGFLFNKAT
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT

Query:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS
        TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDT N+RFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFAD+KEGRYNPKMDP+IPV NWRKGS
Subjt:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS

Query:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK
        QWVVLTRKHA VVVKD+TVFPMFQQHCKRKSLPEFWRD PFP+D  KEHNCIPDEHYVQTLLAQEGLE E+TRRSL+YSAWDLS S+D+ERRNWHPVTYK
Subjt:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK

Query:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK
        FSDATLDLIQSIK IDNIYYETEYRREWCTSKG+PS CFLFARKFTRPAALRLLNM+ +A L    H  +E+K
Subjt:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK

A0A6J1JH71 uncharacterized protein LOC111486977 isoform X16.7e-19087.97Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFF-LEGENKFSIFVHSRPGFLFNKA
        MKQKV QRKALFKWKRKLA+ LL VFCFGS V+MQ++YGRVMMLASL PQSVQEPKIAFLFIARNRLPLDI+WD FF  EGENKFSIFVHSRPGFLFNKA
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFF-LEGENKFSIFVHSRPGFLFNKA

Query:  TTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKG
        TTRSIYFLNRQVN SIQVDWGEASMIEAERILLRHALTDT N+RFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDP+IPV NWRKG
Subjt:  TTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKG

Query:  SQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTY
        SQWVVLTRKHA VVVKD+TVFPMFQQHCKRKSLPEFWRD PFP+D  KEHNCIPDEHYVQTLLAQEGLE E+TRRSL+YSAWDLS S+D+ERRNWHPVTY
Subjt:  SQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTY

Query:  KFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK
        KFSDATLDLIQSIK IDNIYYETEYRREWCTSKG+PS CFLFARKFTRPAALRLLNM+  A L    H  +E+K
Subjt:  KFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK

A0A6J1JRS9 uncharacterized protein LOC111486977 isoform X22.7e-19188.2Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT
        MKQKV QRKALFKWKRKLA+ LL VFCFGS V+MQ++YGRVMMLASL PQSVQEPKIAFLFIARNRLPLDI+WD FF EGENKFSIFVHSRPGFLFNKAT
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKAT

Query:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS
        TRSIYFLNRQVN SIQVDWGEASMIEAERILLRHALTDT N+RFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDP+IPV NWRKGS
Subjt:  TRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGS

Query:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK
        QWVVLTRKHA VVVKD+TVFPMFQQHCKRKSLPEFWRD PFP+D  KEHNCIPDEHYVQTLLAQEGLE E+TRRSL+YSAWDLS S+D+ERRNWHPVTYK
Subjt:  QWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK

Query:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK
        FSDATLDLIQSIK IDNIYYETEYRREWCTSKG+PS CFLFARKFTRPAALRLLNM+  A L    H  +E+K
Subjt:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMK

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC101.5e-13874.35Show/hide
Query:  KIAFLFIARNRLPLDILWDAFFL-EGENKFSIFVHSRPGFLFNKATTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPL
        ++AFLFIARNRLPLD++WDAFF  + E +FSIFVHSRPGF+  +ATTRS +F NRQVN+S+QVDWGEASMIEAER+LL HAL D LNERF+F+SDSC+PL
Subjt:  KIAFLFIARNRLPLDILWDAFFL-EGENKFSIFVHSRPGFLFNKATTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPL

Query:  YNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGSQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWR--DRPFPNDAWKEHNCI
        YNF+YTYDY+MS+STSFVDSFADTK GRYNP+MDP+IPV NWRKGSQW VLTRKHA VVV+D  V P FQ+HC+R+ LPEFWR  DRP P +AWK HNCI
Subjt:  YNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGSQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWR--DRPFPNDAWKEHNCI

Query:  PDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYKFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALR
        PDEHYVQTLLAQ GLE E+TRRS+++SAWDLS SKD ERR WHPVTYK SDAT  L++SIK IDNIYYETE R+EWCTS G+P+ CFLFARKFTR A L+
Subjt:  PDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYKFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALR

Query:  LLNMIVLA
        LL++ ++A
Subjt:  LLNMIVLA

Arabidopsis top hitse value%identityAlignment
AT1G11940.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein5.6e-9649.58Show/hide
Query:  KLAIVLLLVFCFGSFVLMQTRYGRVMMLA----------SLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGEN-KFSIFVHSRPGFLFNKATTRSIY
        KL I   +  C  + + +Q +Y     L+           LH  S   PK+AFLF+AR  LPLD +WD FF   ++  FSI++HS PGF+FN+ TTRS Y
Subjt:  KLAIVLLLVFCFGSFVLMQTRYGRVMMLA----------SLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGEN-KFSIFVHSRPGFLFNKATTRSIY

Query:  FLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGSQWVVL
        F NRQ+N+SI+V WGE+SMIEAER+LL  AL D  N+RF+ LSD C PLY+F Y Y Y++S+  SFVDSF  TKE RY+ KM P+IP   WRKGSQW+ L
Subjt:  FLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGSQWVVL

Query:  TRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAW-----KEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK
         R HA V+V D  VFP+F++ CKR   P         N+AW     K  NCIPDEHYVQTLL  +GLE E+ RR+++Y+ W++S +K  E ++WHPVT+ 
Subjt:  TRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAW-----KEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYK

Query:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLN
          ++  + I+ IK ID++YYE+E R EWC +  +P  CFLFARKFT  AA+R+++
Subjt:  FSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLN

AT1G62305.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.2e-9349.86Show/hide
Query:  FKWKRKL----AIVLLLVFCF----GSFVLMQTRYGRVMMLASLHP---QSVQEPKIAFLFIARNRLPLDILWDAFFLEGENK-FSIFVHSRPGFLFNKA
        F+WK  +    A+ +L +FC      S     T    + +  S  P    S   PK+AFLF+AR  LPLD LWD FF   + + FSI+VHS PGF+F+++
Subjt:  FKWKRKL----AIVLLLVFCF----GSFVLMQTRYGRVMMLASLHP---QSVQEPKIAFLFIARNRLPLDILWDAFFLEGENK-FSIFVHSRPGFLFNKA

Query:  TTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKG
        +TRS +F NRQ+ +SI+V WGE+SMI AER+LL  AL D  N+RF+ LSDSC+PLY+F Y Y Y++S+  SFVDSF D K+ RY  KM P+I    WRKG
Subjt:  TTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKG

Query:  SQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAW-----KEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNW
        SQW+ L R HA V+V D TVFP+FQ+ CKR SLP        P   W     + HNCIPDEHYVQTLL   GLE E+ RR+++Y+ W+LS +K  E ++W
Subjt:  SQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAW-----KEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNW

Query:  HPVTYKFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLN
        HP+T+   +   + I+ IK I+++YYE+EYR EWC +  +P  CFLFARKFTR AA+RLL+
Subjt:  HPVTYKFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLN

AT1G62305.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.1e-8045.98Show/hide
Query:  FKWKRKL----AIVLLLVFCF----GSFVLMQTRYGRVMMLASLHP---QSVQEPKIAFLFIARNRLPLDILWDAFFLEGENK-FSIFVHSRPGFLFNKA
        F+WK  +    A+ +L +FC      S     T    + +  S  P    S   PK+AFLF+AR  LPLD LWD FF   + + FSI+VHS PGF+F+++
Subjt:  FKWKRKL----AIVLLLVFCF----GSFVLMQTRYGRVMMLASLHP---QSVQEPKIAFLFIARNRLPLDILWDAFFLEGENK-FSIFVHSRPGFLFNKA

Query:  TTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKG
        +TRS +F NRQ+ +SI+V WGE+SMI AER+LL  AL D  N+RF+ LSD                    SF+D     K+ RY  KM P+I    WRKG
Subjt:  TTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKG

Query:  SQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAW-----KEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNW
        SQW+ L R HA V+V D TVFP+FQ+ CKR SLP        P   W     + HNCIPDEHYVQTLL   GLE E+ RR+++Y+ W+LS +K  E ++W
Subjt:  SQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAW-----KEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNW

Query:  HPVTYKFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLN
        HP+T+   +   + I+ IK I+++YYE+EYR EWC +  +P  CFLFARKFTR AA+RLL+
Subjt:  HPVTYKFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLN

AT5G14550.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.6e-15971.47Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVM-----MLASL----HPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSR
        MK+KV+Q+K L++WKRK+   L+  FCFG+FV +Q R+  +      + ASL     P+  Q P+IAFLFIARNRLPL+ +WDAFF   + KFSI+VHSR
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVM-----MLASL----HPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSR

Query:  PGFLFNKATTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLI
        PGF+ N+ATTRS YFL+RQ+NDSIQVDWGE++MIEAER+LLRHAL D+ N RF+FLSDSCIPLY+FSYTY+Y+MST TSFVDSFADTK+ RYNP+M+P+I
Subjt:  PGFLFNKATTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLI

Query:  PVYNWRKGSQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNER
        PV NWRKGSQWVVL RKHA +VV D++VFPMFQQHC+RKSLPEFWRDRP P + WKEHNCIPDEHYVQTLL+Q+G++ E+TRRSL++SAWDLS SK NER
Subjt:  PVYNWRKGSQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNER

Query:  RNWHPVTYKFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVL
        R WHP+TYKFSDAT DLIQSIK IDNI YETEYRREWC+SKG+PS CFLFARKFTRPAALRLL   +L
Subjt:  RNWHPVTYKFSDATLDLIQSIKSIDNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVL

AT5G14550.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein6.7e-12666.46Show/hide
Query:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVM-----MLASL----HPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSR
        MK+KV+Q+K L++WKRK+   L+  FCFG+FV +Q R+  +      + ASL     P+  Q P+IAFLFIARNRLPL+ +WDAFF   + KFSI+VHSR
Subjt:  MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVM-----MLASL----HPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSR

Query:  PGFLFNKATTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLI
        PGF+ N+ATTRS YFL+RQ+NDSIQVDWGE++MIEAER+LLRHAL D+ N RF+FLSDSCIPLY+FSYTY+Y+MST TSFVDSFADTK+ RYNP+M+P+I
Subjt:  PGFLFNKATTRSIYFLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLI

Query:  PVYNWRKGSQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNER
        PV NWRKGSQWVVL RKHA +VV D++VFPMFQQHC+             P + WKEHNCIPDEHYVQTLL+Q+G++ E+TRRSL++SAWDLS SK NER
Subjt:  PVYNWRKGSQWVVLTRKHANVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNER

Query:  RNWHPVTYKFSDATLDLIQSIK
        R WHP+TYKFSDAT DLIQSIK
Subjt:  RNWHPVTYKFSDATLDLIQSIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAGAAGGTTGCGCAGCGGAAAGCGCTCTTCAAATGGAAGAGGAAGCTTGCGATTGTTCTATTGCTTGTATTTTGCTTCGGGAGCTTCGTCTTGATGCAG
ACTCGGTACGGTCGGGTTATGATGCTGGCGTCGTTGCATCCTCAATCAGTTCAGGAACCGAAAATCGCGTTTCTGTTCATAGCTCGAAACCGGCTTCCGTTGGAC
ATACTTTGGGATGCGTTTTTTCTGGAAGGGGAGAATAAGTTTTCAATCTTTGTTCACTCGAGGCCAGGGTTTTTATTTAACAAGGCGACGACAAGATCGATCTAT
TTTTTGAATCGTCAAGTTAATGACAGTATACAGGTAGACTGGGGTGAAGCAAGCATGATTGAGGCAGAACGTATATTGCTTAGACATGCCCTTACTGATACTTTG
AATGAGCGATTTATTTTTCTTTCTGACAGCTGCATACCTTTATACAACTTCAGCTACACATATGACTATGTCATGTCAACTTCAACTAGTTTTGTGGACAGTTTT
GCTGATACAAAGGAAGGGCGATACAATCCTAAAATGGATCCTCTAATTCCTGTTTACAACTGGAGGAAAGGATCTCAGTGGGTTGTATTGACAAGAAAGCATGCA
AATGTTGTGGTGAAGGACAGTACAGTCTTTCCTATGTTTCAACAGCATTGTAAGAGGAAGTCACTACCAGAGTTTTGGCGGGATCGTCCATTTCCTAATGATGCA
TGGAAGGAACACAATTGCATACCTGATGAACATTATGTTCAGACTTTATTGGCTCAAGAAGGGCTTGAAGGAGAAGTCACACGAAGATCACTTTCATATTCAGCA
TGGGATCTCTCATTCTCCAAAGACAATGAGCGTCGTAATTGGCATCCTGTAACATACAAATTTTCAGATGCTACTCTTGATCTAATACAATCTATAAAGAGTATT
GATAATATATATTATGAAACTGAATATCGAAGAGAATGGTGTACCAGTAAGGGAAGACCATCCATATGTTTTCTTTTTGCAAGGAAGTTCACCCGTCCAGCTGCC
CTCCGCCTTCTTAATATGATAGTCCTTGCCTGTCTTGAACAGGAGATCCATGCTGAAATGGAGATGAAATTGGATTATCCAGTCAGTTCTTCGTTCTGTTTTCGG
CATGGACAGAGCCAAAGAAGAAGCATAGGAAGAAGGGTAATGGTTTCAATGGAAAGTGGAAGATCCGCCATTAAAGAGCCCAAGAAAGCTGTGGAACAGAGCTTG
AGGAACGCGCCGCCCAGTGGAGCAAATCCAATCCAGAATAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGCAGAAGGTTGCGCAGCGGAAAGCGCTCTTCAAATGGAAGAGGAAGCTTGCGATTGTTCTATTGCTTGTATTTTGCTTCGGGAGCTTCGTCTTGATGCAG
ACTCGGTACGGTCGGGTTATGATGCTGGCGTCGTTGCATCCTCAATCAGTTCAGGAACCGAAAATCGCGTTTCTGTTCATAGCTCGAAACCGGCTTCCGTTGGAC
ATACTTTGGGATGCGTTTTTTCTGGAAGGGGAGAATAAGTTTTCAATCTTTGTTCACTCGAGGCCAGGGTTTTTATTTAACAAGGCGACGACAAGATCGATCTAT
TTTTTGAATCGTCAAGTTAATGACAGTATACAGGTAGACTGGGGTGAAGCAAGCATGATTGAGGCAGAACGTATATTGCTTAGACATGCCCTTACTGATACTTTG
AATGAGCGATTTATTTTTCTTTCTGACAGCTGCATACCTTTATACAACTTCAGCTACACATATGACTATGTCATGTCAACTTCAACTAGTTTTGTGGACAGTTTT
GCTGATACAAAGGAAGGGCGATACAATCCTAAAATGGATCCTCTAATTCCTGTTTACAACTGGAGGAAAGGATCTCAGTGGGTTGTATTGACAAGAAAGCATGCA
AATGTTGTGGTGAAGGACAGTACAGTCTTTCCTATGTTTCAACAGCATTGTAAGAGGAAGTCACTACCAGAGTTTTGGCGGGATCGTCCATTTCCTAATGATGCA
TGGAAGGAACACAATTGCATACCTGATGAACATTATGTTCAGACTTTATTGGCTCAAGAAGGGCTTGAAGGAGAAGTCACACGAAGATCACTTTCATATTCAGCA
TGGGATCTCTCATTCTCCAAAGACAATGAGCGTCGTAATTGGCATCCTGTAACATACAAATTTTCAGATGCTACTCTTGATCTAATACAATCTATAAAGAGTATT
GATAATATATATTATGAAACTGAATATCGAAGAGAATGGTGTACCAGTAAGGGAAGACCATCCATATGTTTTCTTTTTGCAAGGAAGTTCACCCGTCCAGCTGCC
CTCCGCCTTCTTAATATGATAGTCCTTGCCTGTCTTGAACAGGAGATCCATGCTGAAATGGAGATGAAATTGGATTATCCAGTCAGTTCTTCGTTCTGTTTTCGG
CATGGACAGAGCCAAAGAAGAAGCATAGGAAGAAGGGTAATGGTTTCAATGGAAAGTGGAAGATCCGCCATTAAAGAGCCCAAGAAAGCTGTGGAACAGAGCTTG
AGGAACGCGCCGCCCAGTGGAGCAAATCCAATCCAGAATAAGTAG
Protein sequenceShow/hide protein sequence
MKQKVAQRKALFKWKRKLAIVLLLVFCFGSFVLMQTRYGRVMMLASLHPQSVQEPKIAFLFIARNRLPLDILWDAFFLEGENKFSIFVHSRPGFLFNKATTRSIY
FLNRQVNDSIQVDWGEASMIEAERILLRHALTDTLNERFIFLSDSCIPLYNFSYTYDYVMSTSTSFVDSFADTKEGRYNPKMDPLIPVYNWRKGSQWVVLTRKHA
NVVVKDSTVFPMFQQHCKRKSLPEFWRDRPFPNDAWKEHNCIPDEHYVQTLLAQEGLEGEVTRRSLSYSAWDLSFSKDNERRNWHPVTYKFSDATLDLIQSIKSI
DNIYYETEYRREWCTSKGRPSICFLFARKFTRPAALRLLNMIVLACLEQEIHAEMEMKLDYPVSSSFCFRHGQSQRRSIGRRVMVSMESGRSAIKEPKKAVEQSL
RNAPPSGANPIQNK