; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr004371 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr004371
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionE3 ubiquitin-protein ligase RNF4-like
Genome locationtig00002854:56557..72289
RNA-Seq ExpressionSgr004371
SyntenySgr004371
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR008700 - RIN4, pathogenic type III effector avirulence factor Avr cleavage site
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017907 - Zinc finger, RING-type, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158096.1 E3 ubiquitin-protein ligase RNF4 isoform X2 [Momordica charantia]2.1e-8389.19Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS
        MSTQAMRG TARGYHRRKSL+LDLNSMPPSENRDQGE SQLG QEA  NQQQ +QP MIDIEAIDDDVIESSARAFAEAKNKSRRNARKN+VDVDA+ERS
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS

Query:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
        RVSNN RSKRR+EVSNQRTVN +LLINLE SNSS++EKPVPPKEPKF+CPICMGPLVEETSTRCGHIFCKACIKAAIAVQ+KCPT
Subjt:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

XP_022963653.1 E3 ubiquitin-protein ligase RNF4-like [Cucurbita moschata]3.2e-8489.19Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS
        MSTQAMRG TARGYHRRKSL+LDLNSMPP+ENRDQGETSQ GF EAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN+VDVDA+ERS
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS

Query:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
        R SNNNRSKRR++V NQ+TVN DLLINLEGSNSSM+EK  PPKEP F+CPICMGPLVEETSTRCGHIFCKACIK AIAVQ+KCPT
Subjt:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

XP_022967334.1 E3 ubiquitin-protein ligase RNF4-like [Cucurbita maxima]9.3e-8488.65Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS
        MST+AMRG TARGYHRRKSL+LDLNSMPP+ENRDQGETSQ GFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN+VDVDA+ERS
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS

Query:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
        R SNNNRSKRR++V NQ+TVN DLLINLE SNSSM+EK  PPKEP F+CPICMGPLVEETSTRCGHIFCKACIK AIAVQ+KCPT
Subjt:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

XP_023554040.1 E3 ubiquitin-protein ligase RNF4-like [Cucurbita pepo subsp. pepo]5.4e-8489.19Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS
        MSTQAMRG TARGYHRRKSL+LDLNSMPP+ENRD GETSQ GFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN+VDVDA+ERS
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS

Query:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
        R SNNNRSKRR++V NQ+TVN DLLINLEGSNSSM+EK  PPKEP F+CPICMGPLVEETSTRCGHIFCKACIK AIAVQ+KCPT
Subjt:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

XP_038888129.1 E3 ubiquitin-protein ligase RNF4-like [Benincasa hispida]2.7e-8388.65Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS
        MSTQAMRG TARGYHRRKSL+LDLNSMPP+ENRDQGETSQL FQEAQVNQQQ VQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN+VDVDA+E +
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS

Query:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
        +VSNNNRSKRR++VSNQ+ VN DLLINLE SNSSM+ KP PPKEPKF+CPICMGPLVEETSTRCGHIFCKACI+AAIAVQNKCPT
Subjt:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

TrEMBL top hitse value%identityAlignment
A0A1S3CCB4 E3 ubiquitin-protein ligase RNF4-like isoform X12.7e-8186.49Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS
        MSTQAMRG   RGYHRRKSL+LDLNSMPP+ENRDQGETSQL  Q+AQVNQQQ VQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDA+ERS
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS

Query:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
        RVSNNNRSKRR++VSN+ +VN DLLINLE SNSSM+ KP PPKEPKF+CPICMGPLVEETST+CGHIFCKACI+AAI VQ+KCPT
Subjt:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

A0A6J1DZZ9 E3 ubiquitin-protein ligase RNF4 isoform X13.2e-8288.24Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS
        MSTQAMRG TARGYHRRKSL+LDLNSMPPSENRDQGE SQLG QEA  NQQQ +QP MIDIEAIDDDVIESSARAFAEAKNKSRRNARKN+VDVDA+ERS
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS

Query:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSM--REKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
        RVSNN RSKRR+EVSNQRTVN +LLINLE SNSS+  +EKPVPPKEPKF+CPICMGPLVEETSTRCGHIFCKACIKAAIAVQ+KCPT
Subjt:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSM--REKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

A0A6J1E010 E3 ubiquitin-protein ligase RNF4 isoform X21.0e-8389.19Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS
        MSTQAMRG TARGYHRRKSL+LDLNSMPPSENRDQGE SQLG QEA  NQQQ +QP MIDIEAIDDDVIESSARAFAEAKNKSRRNARKN+VDVDA+ERS
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS

Query:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
        RVSNN RSKRR+EVSNQRTVN +LLINLE SNSS++EKPVPPKEPKF+CPICMGPLVEETSTRCGHIFCKACIKAAIAVQ+KCPT
Subjt:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

A0A6J1HFU4 E3 ubiquitin-protein ligase RNF4-like1.5e-8489.19Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS
        MSTQAMRG TARGYHRRKSL+LDLNSMPP+ENRDQGETSQ GF EAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN+VDVDA+ERS
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS

Query:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
        R SNNNRSKRR++V NQ+TVN DLLINLEGSNSSM+EK  PPKEP F+CPICMGPLVEETSTRCGHIFCKACIK AIAVQ+KCPT
Subjt:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

A0A6J1HQI7 E3 ubiquitin-protein ligase RNF4-like4.5e-8488.65Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS
        MST+AMRG TARGYHRRKSL+LDLNSMPP+ENRDQGETSQ GFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN+VDVDA+ERS
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERS

Query:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
        R SNNNRSKRR++V NQ+TVN DLLINLE SNSSM+EK  PPKEP F+CPICMGPLVEETSTRCGHIFCKACIK AIAVQ+KCPT
Subjt:  RVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

SwissProt top hitse value%identityAlignment
B5DF45 TNF receptor-associated factor 61.2e-0643.28Show/hide
Query:  PPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIA-VQNKCPT-WEVLVDSIISGNLFGLRRLL
        PP E K+ CPIC+  L E   T CGH FCKACI  +I    +KCP   E+L+++ +  + F  R +L
Subjt:  PPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIA-VQNKCPT-WEVLVDSIISGNLFGLRRLL

B6CJY4 TNF receptor-associated factor 62.1e-0643.28Show/hide
Query:  PPKEPKFNCPICMGPLVEETSTRCGHIFCKAC-IKAAIAVQNKCPT-WEVLVDSIISGNLFGLRRLL
        PP E K+ CPIC+  L E   T CGH FCKAC IK+     +KCP   E+L+++ +  + F  R +L
Subjt:  PPKEPKFNCPICMGPLVEETSTRCGHIFCKAC-IKAAIAVQNKCPT-WEVLVDSIISGNLFGLRRLL

B6CJY5 TNF receptor-associated factor 62.1e-0643.28Show/hide
Query:  PPKEPKFNCPICMGPLVEETSTRCGHIFCKAC-IKAAIAVQNKCPT-WEVLVDSIISGNLFGLRRLL
        PP E K+ CPIC+  L E   T CGH FCKAC IK+     +KCP   E+L+++ +  + F  R +L
Subjt:  PPKEPKFNCPICMGPLVEETSTRCGHIFCKAC-IKAAIAVQNKCPT-WEVLVDSIISGNLFGLRRLL

Q3ZCC3 TNF receptor-associated factor 62.1e-0643.28Show/hide
Query:  PPKEPKFNCPICMGPLVEETSTRCGHIFCKAC-IKAAIAVQNKCPT-WEVLVDSIISGNLFGLRRLL
        PP E K+ CPIC+  L E   T CGH FCKAC IK+     +KCP   E+L+++ +  + F  R +L
Subjt:  PPKEPKFNCPICMGPLVEETSTRCGHIFCKAC-IKAAIAVQNKCPT-WEVLVDSIISGNLFGLRRLL

Q9Y4K3 TNF receptor-associated factor 62.1e-0643.28Show/hide
Query:  PPKEPKFNCPICMGPLVEETSTRCGHIFCKAC-IKAAIAVQNKCPT-WEVLVDSIISGNLFGLRRLL
        PP E K+ CPIC+  L E   T CGH FCKAC IK+     +KCP   E+L+++ +  + F  R +L
Subjt:  PPKEPKFNCPICMGPLVEETSTRCGHIFCKAC-IKAAIAVQNKCPT-WEVLVDSIISGNLFGLRRLL

Arabidopsis top hitse value%identityAlignment
AT3G07200.1 RING/U-box superfamily protein3.1e-2144.65Show/hide
Query:  SENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAI--DDDVIESSARAFAEAKNKSRRNAR-KNVVDVDADERSRVSNNNRSKRRKEVSNQRTVNVDLLI
        ++N  QG+  Q    +A    Q    P  I++ AI  DDDV+ES+A AFA+AKNKSR   R   VVDV++      +  NRS RR+  S+Q +V+    +
Subjt:  SENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAI--DDDVIESSARAFAEAKNKSRRNAR-KNVVDVDADERSRVSNNNRSKRRKEVSNQRTVNVDLLI

Query:  NLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
         L     S    P P +EPKF+CPIC+ P  +E ST+CGHIFCK CIK A+++Q KCPT
Subjt:  NLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

AT3G07200.2 RING/U-box superfamily protein3.1e-2144.65Show/hide
Query:  SENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAI--DDDVIESSARAFAEAKNKSRRNAR-KNVVDVDADERSRVSNNNRSKRRKEVSNQRTVNVDLLI
        ++N  QG+  Q    +A    Q    P  I++ AI  DDDV+ES+A AFA+AKNKSR   R   VVDV++      +  NRS RR+  S+Q +V+    +
Subjt:  SENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAI--DDDVIESSARAFAEAKNKSRRNAR-KNVVDVDADERSRVSNNNRSKRRKEVSNQRTVNVDLLI

Query:  NLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
         L     S    P P +EPKF+CPIC+ P  +E ST+CGHIFCK CIK A+++Q KCPT
Subjt:  NLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

AT5G48655.1 RING/U-box superfamily protein1.3e-3044.79Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN--VVDVDADE
        M+T  +R P  RG  RRK++ +DLN++P  +   +G ++ +      +   Q   P MID++AI+DDVIESSA AFAEAK+KS RNAR+   +VDV++  
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN--VVDVDADE

Query:  RSRVSNNNRSKRRKEVSNQRTV-----NVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
         +R   N  +KRR+  S++  +     +V+  +N+    S  +    PP+EPKF CPICM P  EE ST+CGHIFCK CIK AI+ Q KCPT
Subjt:  RSRVSNNNRSKRRKEVSNQRTV-----NVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

AT5G48655.2 RING/U-box superfamily protein1.3e-3044.79Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN--VVDVDADE
        M+T  +R P  RG  RRK++ +DLN++P  +   +G ++ +      +   Q   P MID++AI+DDVIESSA AFAEAK+KS RNAR+   +VDV++  
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN--VVDVDADE

Query:  RSRVSNNNRSKRRKEVSNQRTV-----NVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
         +R   N  +KRR+  S++  +     +V+  +N+    S  +    PP+EPKF CPICM P  EE ST+CGHIFCK CIK AI+ Q KCPT
Subjt:  RSRVSNNNRSKRRKEVSNQRTV-----NVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT

AT5G48655.3 RING/U-box superfamily protein1.3e-3044.79Show/hide
Query:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN--VVDVDADE
        M+T  +R P  RG  RRK++ +DLN++P  +   +G ++ +      +   Q   P MID++AI+DDVIESSA AFAEAK+KS RNAR+   +VDV++  
Subjt:  MSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVNQQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKN--VVDVDADE

Query:  RSRVSNNNRSKRRKEVSNQRTV-----NVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT
         +R   N  +KRR+  S++  +     +V+  +N+    S  +    PP+EPKF CPICM P  EE ST+CGHIFCK CIK AI+ Q KCPT
Subjt:  RSRVSNNNRSKRRKEVSNQRTV-----NVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEETSTRCGHIFCKACIKAAIAVQNKCPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATAAAGAAGCTGCGGTTCCAAAATTTGGCGAATGGAACGACAGGAACGCCACGACACCGGACAACTACACGGCTATTTTCAACAGAGTGCGGGAAGAGAGGCAGGAGGC
AGCAAGAGCACCGACGCTGCCACGGTCGAATGTTAACGGCACAAGGAACCAAGATCATGAACACAATCAAAAGATGAGTACTCAGGCAATGAGGGGACCTACTGCAAGAG
GGTATCATCGCAGGAAATCGTTGGAGTTAGACCTAAACAGCATGCCGCCAAGTGAGAACCGAGATCAAGGGGAGACATCTCAATTGGGGTTTCAGGAAGCACAAGTTAAT
CAACAACAGTCAGTACAACCTGCAATGATCGACATCGAGGCCATTGATGATGATGTTATTGAATCATCTGCGAGGGCATTTGCTGAAGCTAAAAATAAATCCAGACGGAA
TGCTAGGAAGAATGTAGTTGATGTGGATGCAGATGAGCGGAGCAGGGTTTCAAATAATAACCGGAGCAAGCGTAGGAAAGAAGTGTCCAATCAACGAACTGTAAATGTTG
ATTTATTGATCAATTTGGAAGGCAGCAACAGCTCTATGAGAGAGAAGCCTGTACCACCGAAGGAGCCCAAATTCAATTGTCCAATATGCATGGGCCCATTAGTAGAAGAA
ACATCAACAAGGTGTGGTCATATTTTCTGCAAGGCATGCATCAAGGCTGCAATAGCCGTCCAGAATAAATGCCCCACCTGGGAAGTTCTGGTGGACTCCATAATTTCCGG
CAACCTCTTTGGACTCCGGCGCCTGTTGCTTAGAGGAGTCCCAATTGTTTTGATGATCATCATCACGATCGCCGTTGAGCAAGCCGTCGCAGTAGCGAGCCATGTGTTTG
AGGTCGCTGACGGTGCCCAAGAGATGCTGAACGGTGGCGGCGAACGTCGCCATGGAGATCGGCATGGCTCTGATTCTCAACCGCTTCAGAAGGTAAGCGATGTCGTAGAA
GCCATGGAAGAACGAATCCAACGAAGAAACCCCTTCCATATCACTAATCTGATCAAAATCTCTGAAATTAAACTCCCACGTCATGGCTACGTTTCCTTTGTCGTCAGCGA
GTGTGAGACCGAGTTGGAGTATGTTCAATCGGTTCACATTGAATCGGAGACAATCAAGCTAATCGATTCTAAACAACAGTTCGCTGAATCTGAGCAACCCAATATATACC
TGGAAACTCTGTATCAATGGCGACAACAGGAAACTCTGAAAGATACTCCTTCAGCACTCCAACTCTTCCTGGAGATTGTGAGACCAAACCTCCCGATTCAGACGGAAGGA
TCTCGGAGTCGGAACTATAGATGGGAAAATGATGCATTACCCGAAGAGGAGTTGGGCGGCTGGCGGCGGCGGAGAGTGGCCGGCGGACCGGAGGTGGGCGGCGATAGAGG
GATTGGGACCTGGGAAGGGGAGGACGCGCCGCTGGAGTGGTGGGTATTTAAGCTGAGAGAGAAAGTGCAAGCAAGATTCTGTATTCAGCTGGGCGGGCTGAAATGTTGGG
CTTAG
mRNA sequenceShow/hide mRNA sequence
GATAAAGAAGCTGCGGTTCCAAAATTTGGCGAATGGAACGACAGGAACGCCACGACACCGGACAACTACACGGCTATTTTCAACAGAGTGCGGGAAGAGAGGCAGGAGGC
AGCAAGAGCACCGACGCTGCCACGGTCGAATGTTAACGGCACAAGGAACCAAGATCATGAACACAATCAAAAGATGAGTACTCAGGCAATGAGGGGACCTACTGCAAGAG
GGTATCATCGCAGGAAATCGTTGGAGTTAGACCTAAACAGCATGCCGCCAAGTGAGAACCGAGATCAAGGGGAGACATCTCAATTGGGGTTTCAGGAAGCACAAGTTAAT
CAACAACAGTCAGTACAACCTGCAATGATCGACATCGAGGCCATTGATGATGATGTTATTGAATCATCTGCGAGGGCATTTGCTGAAGCTAAAAATAAATCCAGACGGAA
TGCTAGGAAGAATGTAGTTGATGTGGATGCAGATGAGCGGAGCAGGGTTTCAAATAATAACCGGAGCAAGCGTAGGAAAGAAGTGTCCAATCAACGAACTGTAAATGTTG
ATTTATTGATCAATTTGGAAGGCAGCAACAGCTCTATGAGAGAGAAGCCTGTACCACCGAAGGAGCCCAAATTCAATTGTCCAATATGCATGGGCCCATTAGTAGAAGAA
ACATCAACAAGGTGTGGTCATATTTTCTGCAAGGCATGCATCAAGGCTGCAATAGCCGTCCAGAATAAATGCCCCACCTGGGAAGTTCTGGTGGACTCCATAATTTCCGG
CAACCTCTTTGGACTCCGGCGCCTGTTGCTTAGAGGAGTCCCAATTGTTTTGATGATCATCATCACGATCGCCGTTGAGCAAGCCGTCGCAGTAGCGAGCCATGTGTTTG
AGGTCGCTGACGGTGCCCAAGAGATGCTGAACGGTGGCGGCGAACGTCGCCATGGAGATCGGCATGGCTCTGATTCTCAACCGCTTCAGAAGGTAAGCGATGTCGTAGAA
GCCATGGAAGAACGAATCCAACGAAGAAACCCCTTCCATATCACTAATCTGATCAAAATCTCTGAAATTAAACTCCCACGTCATGGCTACGTTTCCTTTGTCGTCAGCGA
GTGTGAGACCGAGTTGGAGTATGTTCAATCGGTTCACATTGAATCGGAGACAATCAAGCTAATCGATTCTAAACAACAGTTCGCTGAATCTGAGCAACCCAATATATACC
TGGAAACTCTGTATCAATGGCGACAACAGGAAACTCTGAAAGATACTCCTTCAGCACTCCAACTCTTCCTGGAGATTGTGAGACCAAACCTCCCGATTCAGACGGAAGGA
TCTCGGAGTCGGAACTATAGATGGGAAAATGATGCATTACCCGAAGAGGAGTTGGGCGGCTGGCGGCGGCGGAGAGTGGCCGGCGGACCGGAGGTGGGCGGCGATAGAGG
GATTGGGACCTGGGAAGGGGAGGACGCGCCGCTGGAGTGGTGGGTATTTAAGCTGAGAGAGAAAGTGCAAGCAAGATTCTGTATTCAGCTGGGCGGGCTGAAATGTTGGG
CTTAG
Protein sequenceShow/hide protein sequence
DKEAAVPKFGEWNDRNATTPDNYTAIFNRVREERQEAARAPTLPRSNVNGTRNQDHEHNQKMSTQAMRGPTARGYHRRKSLELDLNSMPPSENRDQGETSQLGFQEAQVN
QQQSVQPAMIDIEAIDDDVIESSARAFAEAKNKSRRNARKNVVDVDADERSRVSNNNRSKRRKEVSNQRTVNVDLLINLEGSNSSMREKPVPPKEPKFNCPICMGPLVEE
TSTRCGHIFCKACIKAAIAVQNKCPTWEVLVDSIISGNLFGLRRLLLRGVPIVLMIIITIAVEQAVAVASHVFEVADGAQEMLNGGGERRHGDRHGSDSQPLQKVSDVVE
AMEERIQRRNPFHITNLIKISEIKLPRHGYVSFVVSECETELEYVQSVHIESETIKLIDSKQQFAESEQPNIYLETLYQWRQQETLKDTPSALQLFLEIVRPNLPIQTEG
SRSRNYRWENDALPEEELGGWRRRRVAGGPEVGGDRGIGTWEGEDAPLEWWVFKLREKVQARFCIQLGGLKCWA