; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G12950 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G12950
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUPF0426 protein At1g28150, chloroplastic
Genome locationClcChr04:26238807..26245605
RNA-Seq ExpressionClc04G12950
SyntenyClc04G12950
Gene Ontology termsGO:0043086 - negative regulation of catalytic activity (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0030599 - pectinesterase activity (molecular function)
GO:0046910 - pectinesterase inhibitor activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036066.1 Ankyrin repeat domain-containing protein 50 isoform 1 [Cucumis melo var. makuwa]5.4e-5759.26Show/hide
Query:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL
        MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVA               KMQEVAGERGGYLHGRG                      S   L L
Subjt:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL

Query:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP
              +     +          RR +  +  A +     +  +     LADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSN +DLSNP
Subjt:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP

Query:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE
        KHD +NDKVKENPL+KTNKTEGETEAENPVKSLL  A  +S +
Subjt:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE

TYJ98873.1 Ankyrin repeat domain-containing protein 50 isoform 1 [Cucumis melo var. makuwa]1.0e-7165.88Show/hide
Query:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAK-------FIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNI
        MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAK       F +L+ L +G S                       +     S C      + +
Subjt:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAK-------FIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNI

Query:  SFGFLTLTRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPF-----PASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKR
        +F            PWTVMICSIL+SRWK+RRMQ+ASLDALRNEPLQHSRYPF      ASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKR
Subjt:  SFGFLTLTRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPF-----PASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKR

Query:  PRVSNQADLSNPKHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE
        PRVSN +DLSNPKHD +NDKVKENPL+KTNKTEGETEAENPVKSLL  A  +S +
Subjt:  PRVSNQADLSNPKHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE

XP_004137399.1 uncharacterized protein LOC101214943 [Cucumis sativus]5.8e-5959.67Show/hide
Query:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL
        MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVA               KMQEVAGERGGYLHGRG                             
Subjt:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL

Query:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP
                  +    +L  + +    ++A     R E    + +   ASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP
Subjt:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP

Query:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE
        KHD DN+KVKE PL+KTNKTEGETEAENPVKSLL  A  +S +
Subjt:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE

XP_008439948.1 PREDICTED: uncharacterized protein LOC103484557 [Cucumis melo]2.2e-5859.26Show/hide
Query:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL
        MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVA               KMQEVAGERGGYLHGRG                             
Subjt:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL

Query:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP
                  +    +L  + +    ++A     R E    + +   ASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSN +DLSNP
Subjt:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP

Query:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE
        KHD +NDKVKENPL+KTNKTEGETEAENPVKSLL  A  +S +
Subjt:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE

XP_038895509.1 uncharacterized protein LOC120083727 [Benincasa hispida]1.3e-5859.26Show/hide
Query:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL
        MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVA               KMQEVAGERGGYLHGRG                             
Subjt:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL

Query:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP
                  +    +L  + +    ++A     R E    + +   ASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP
Subjt:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP

Query:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE
        KHDID++KVKE+P +KTNKTEGETEAENPVKSLL  A  +S +
Subjt:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE

TrEMBL top hitse value%identityAlignment
A0A0A0LQC3 Uncharacterized protein2.8e-5959.67Show/hide
Query:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL
        MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVA               KMQEVAGERGGYLHGRG                             
Subjt:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL

Query:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP
                  +    +L  + +    ++A     R E    + +   ASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP
Subjt:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP

Query:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE
        KHD DN+KVKE PL+KTNKTEGETEAENPVKSLL  A  +S +
Subjt:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE

A0A1S3B0L6 uncharacterized protein LOC1034845571.1e-5859.26Show/hide
Query:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL
        MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVA               KMQEVAGERGGYLHGRG                             
Subjt:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL

Query:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP
                  +    +L  + +    ++A     R E    + +   ASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSN +DLSNP
Subjt:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP

Query:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE
        KHD +NDKVKENPL+KTNKTEGETEAENPVKSLL  A  +S +
Subjt:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE

A0A5A7T1G4 Ankyrin repeat domain-containing protein 50 isoform 12.6e-5759.26Show/hide
Query:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL
        MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVA               KMQEVAGERGGYLHGRG                      S   L L
Subjt:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL

Query:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP
              +     +          RR +  +  A +     +  +     LADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSN +DLSNP
Subjt:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNP

Query:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE
        KHD +NDKVKENPL+KTNKTEGETEAENPVKSLL  A  +S +
Subjt:  KHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE

A0A5D3BGH6 UPF0426 protein7.1e-5588.28Show/hide
Query:  MAALILHSSSATSVLFGEMGKLKAQNHHSILSSNKPMRGNYLRIGCNKSTNGISAFFFNPVEDPVIKEALKEPVAFAGGLFAGLLRLDLNEDPLKEWVKK
        MAAL+LHSSS  SVLFGEMGKLKA++HHSILSS KPMR ++LR+G N STNGISAFFFNPVEDPV+KEALKEPVAFAGGLFAGLLRLDLNEDPLKEWVKK
Subjt:  MAALILHSSSATSVLFGEMGKLKAQNHHSILSSNKPMRGNYLRIGCNKSTNGISAFFFNPVEDPVIKEALKEPVAFAGGLFAGLLRLDLNEDPLKEWVKK

Query:  TMESAGLTEEVDTQESVQEDGPTEIEIE
        T+ESAGLTEEVDTQ SVQEDGPTEIEIE
Subjt:  TMESAGLTEEVDTQESVQEDGPTEIEIE

A0A5D3BHW1 Ankyrin repeat domain-containing protein 50 isoform 14.9e-7265.88Show/hide
Query:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAK-------FIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNI
        MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAK       F +L+ L +G S                       +     S C      + +
Subjt:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAK-------FIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNI

Query:  SFGFLTLTRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPF-----PASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKR
        +F            PWTVMICSIL+SRWK+RRMQ+ASLDALRNEPLQHSRYPF      ASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKR
Subjt:  SFGFLTLTRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPF-----PASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKR

Query:  PRVSNQADLSNPKHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE
        PRVSN +DLSNPKHD +NDKVKENPL+KTNKTEGETEAENPVKSLL  A  +S +
Subjt:  PRVSNQADLSNPKHDIDNDKVKENPLLKTNKTEGETEAENPVKSLLETAPLASSE

SwissProt top hitse value%identityAlignment
P74786 UPF0426 protein ssl02943.4e-0640.3Show/hide
Query:  PVIKEALKEPVAFAGGLFAGLLRLDLNEDPLKEWVKKTMESAGLTE--EVDTQESVQEDGPTEIEIE
        PV +E  ++P+AFAGG  +G+LRL L +DPLK+W++K     G+T+    D  ++ Q++ P  I+I+
Subjt:  PVIKEALKEPVAFAGGLFAGLLRLDLNEDPLKEWVKKTMESAGLTE--EVDTQESVQEDGPTEIEIE

Q9FZ89 UPF0426 protein At1g28150, chloroplastic8.7e-1846.43Show/hide
Query:  GKLKAQNHHSILSSNKPMRGNYLRIGCNKSTNGISAFFFNPVEDPVIKEALKEPVAFAGGLFAGLLRLDLNEDPLKEWVKKTMESAGLTEE---VDTQES
        G + +Q HH   S+   ++ N L     + T    +  FN  ++P++ EALKEP+AF GG+FAGLLRLDLNE+PLK+WV +T+E++G+TEE    D   S
Subjt:  GKLKAQNHHSILSSNKPMRGNYLRIGCNKSTNGISAFFFNPVEDPVIKEALKEPVAFAGGLFAGLLRLDLNEDPLKEWVKKTMESAGLTEE---VDTQES

Query:  VQEDGPTEIEIE
          ED P +IEIE
Subjt:  VQEDGPTEIEIE

Arabidopsis top hitse value%identityAlignment
AT1G28150.1 unknown protein6.2e-1946.43Show/hide
Query:  GKLKAQNHHSILSSNKPMRGNYLRIGCNKSTNGISAFFFNPVEDPVIKEALKEPVAFAGGLFAGLLRLDLNEDPLKEWVKKTMESAGLTEE---VDTQES
        G + +Q HH   S+   ++ N L     + T    +  FN  ++P++ EALKEP+AF GG+FAGLLRLDLNE+PLK+WV +T+E++G+TEE    D   S
Subjt:  GKLKAQNHHSILSSNKPMRGNYLRIGCNKSTNGISAFFFNPVEDPVIKEALKEPVAFAGGLFAGLLRLDLNEDPLKEWVKKTMESAGLTEE---VDTQES

Query:  VQEDGPTEIEIE
          ED P +IEIE
Subjt:  VQEDGPTEIEIE

AT1G73350.1 unknown protein8.9e-3441.06Show/hide
Query:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL
        MAG+EV+EYTNLSDPKDKK GKG  KIDDED+TFQRMVA               KMQEVAGERGGYLHGRG +   L L                     
Subjt:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL

Query:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSN-------
                           + +    ++A     R E    + +   A+ ADS+PAS+PL LRVEPKPKSGIRQQDLL++VVEVKPKRP++S        
Subjt:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSN-------

Query:  ---QADLSNPKHDIDNDKVKEN--PLLKT--------NKTEGETEAENPVKSLLETAPLASSE
           ++D    +  +  DK KE   P+LK           TE   + +N  K LL  A  +S E
Subjt:  ---QADLSNPKHDIDNDKVKEN--PLLKT--------NKTEGETEAENPVKSLLETAPLASSE

AT1G73350.2 unknown protein4.4e-3340.68Show/hide
Query:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL
        MAG+EV+EYTNLSDPKDKK GKG  KIDDED+TFQRMVA               KMQEVAGERGGYLHGRG                             
Subjt:  MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTL

Query:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSN-------
                  +    +L  + +    ++A     R E    + +   A+ ADS+PAS+PL LRVEPKPKSGIRQQDLL++VVEVKPKRP++S        
Subjt:  TRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSN-------

Query:  ---QADLSNPKHDIDNDKVKEN--PLLKT--------NKTEGETEAENPVKSLLETAPLASSE
           ++D    +  +  DK KE   P+LK           TE   + +N  K LL  A  +S E
Subjt:  ---QADLSNPKHDIDNDKVKEN--PLLKT--------NKTEGETEAENPVKSLLETAPLASSE

AT1G73350.3 unknown protein7.6e-1735.1Show/hide
Query:  MQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTLTRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAP
        MQEVAGERGGYLHGRG                                       +    +L  + +    ++A     R E    + +   A+ ADS+P
Subjt:  MQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTLTRVWSLQPWTVMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAP

Query:  ASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSN----------QADLSNPKHDIDNDKVKEN--PLLKT--------NKTEGETEAENPVKSLLE
        AS+PL LRVEPKPKSGIRQQDLL++VVEVKPKRP++S           ++D    +  +  DK KE   P+LK           TE   + +N  K LL 
Subjt:  ASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSN----------QADLSNPKHDIDNDKVKEN--PLLKT--------NKTEGETEAENPVKSLLE

Query:  TAPLASSE
         A  +S E
Subjt:  TAPLASSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGGAAAGAAGTTCGCGAATACACCAATCTCTCCGACCCAAAAGACAAGAAGTGGGGTAAAGGGAAGGATAAGATAGACGATGAAGATATTACGTTTCAACGGAT
GGTTGCGAAGTTTATTGAGTTAAGATTTCTCCGAATTGGACTGAGCATTCTCAAGATGCAGGAGGTTGCAGGAGAACGTGGAGGTTACCTTCATGGACGAGGCGGTAACC
CTTTCTTTCTCTGTCTCTGCATGTCTTTGTGTGTGCATCCCTTAACAATTCTTAATATTAGTTTTGGATTCTTAACGTTGACTCGAGTTTGGAGTTTGCAGCCTTGGACA
GTGATGATTTGCTCTATCTTAAAGAGCAGATGGAAGCAGAGGAGGATGCAGAACGCCTCCTTAGACGCACTGAGAAACGAGCCTTTGCAGCATTCAAGATATCCTTTTCC
TGCTAGTTTGGCTGATTCTGCACCTGCATCTGTTCCCTTGGCGCTTCGTGTTGAGCCAAAGCCAAAGAGTGGAATCAGACAGCAAGATCTATTGAAAAGAGTAGTGGAAG
TCAAACCCAAAAGGCCCAGAGTTTCAAATCAAGCAGATTTATCAAATCCCAAGCATGACATTGACAATGACAAGGTGAAAGAGAACCCCTTGCTGAAAACAAACAAAACT
GAAGGCGAAACAGAAGCTGAAAATCCTGTGAAGAGTTTACTAGAAACAGCCCCATTAGCGTCATCAGAGAGATTACCCATGGCCGCTCTTATTCTTCACTCCTCTTCGGC
TACGAGCGTTCTGTTTGGAGAAATGGGGAAATTGAAGGCACAGAATCATCATTCGATTTTATCATCCAACAAACCTATGCGCGGTAATTATCTTCGTATCGGCTGCAATA
AATCTACAAATGGCATCTCCGCATTTTTCTTCAATCCTGTTGAGGATCCCGTTATCAAAGAAGCTCTTAAGGAGCCAGTTGCCTTCGCCGGTGGGCTATTTGCTGGACTT
CTAAGGCTTGATTTGAACGAAGATCCATTAAAGGAATGGGTTAAGAAGACAATGGAATCAGCTGGACTCACGGAAGAGGTTGATACTCAAGAATCTGTACAAGAGGATGG
TCCAACAGAGATCGAGATTGAGTGA
mRNA sequenceShow/hide mRNA sequence
CAAACGCAATCGCGTGATTTTTGCTTTCTGCTTGTTGGTTATCGCCGGTCAATCTCTCTAGGTGTGAAGGAAGAGGGAGAGAGGGAGCCTGGCAAAGGGGTTTCCTCTGG
ATTTCTAGTTCGAAGGGAAGCAATCAAGGTTCATAGCAATGGCTGGGAAAGAAGTTCGCGAATACACCAATCTCTCCGACCCAAAAGACAAGAAGTGGGGTAAAGGGAAG
GATAAGATAGACGATGAAGATATTACGTTTCAACGGATGGTTGCGAAGTTTATTGAGTTAAGATTTCTCCGAATTGGACTGAGCATTCTCAAGATGCAGGAGGTTGCAGG
AGAACGTGGAGGTTACCTTCATGGACGAGGCGGTAACCCTTTCTTTCTCTGTCTCTGCATGTCTTTGTGTGTGCATCCCTTAACAATTCTTAATATTAGTTTTGGATTCT
TAACGTTGACTCGAGTTTGGAGTTTGCAGCCTTGGACAGTGATGATTTGCTCTATCTTAAAGAGCAGATGGAAGCAGAGGAGGATGCAGAACGCCTCCTTAGACGCACTG
AGAAACGAGCCTTTGCAGCATTCAAGATATCCTTTTCCTGCTAGTTTGGCTGATTCTGCACCTGCATCTGTTCCCTTGGCGCTTCGTGTTGAGCCAAAGCCAAAGAGTGG
AATCAGACAGCAAGATCTATTGAAAAGAGTAGTGGAAGTCAAACCCAAAAGGCCCAGAGTTTCAAATCAAGCAGATTTATCAAATCCCAAGCATGACATTGACAATGACA
AGGTGAAAGAGAACCCCTTGCTGAAAACAAACAAAACTGAAGGCGAAACAGAAGCTGAAAATCCTGTGAAGAGTTTACTAGAAACAGCCCCATTAGCGTCATCAGAGAGA
TTACCCATGGCCGCTCTTATTCTTCACTCCTCTTCGGCTACGAGCGTTCTGTTTGGAGAAATGGGGAAATTGAAGGCACAGAATCATCATTCGATTTTATCATCCAACAA
ACCTATGCGCGGTAATTATCTTCGTATCGGCTGCAATAAATCTACAAATGGCATCTCCGCATTTTTCTTCAATCCTGTTGAGGATCCCGTTATCAAAGAAGCTCTTAAGG
AGCCAGTTGCCTTCGCCGGTGGGCTATTTGCTGGACTTCTAAGGCTTGATTTGAACGAAGATCCATTAAAGGAATGGGTTAAGAAGACAATGGAATCAGCTGGACTCACG
GAAGAGGTTGATACTCAAGAATCTGTACAAGAGGATGGTCCAACAGAGATCGAGATTGAGTGATCCCATGAATGGACCTAACCTCGTATAGTTCTTATTTATTACAATAA
TATATAAACTGGAAAAGTGGAAATAGAATGATATTTGATTTGATTTAATATTACTACTGCTCCCTTGATTTATAACAAGAAGTGAATCCGAATTAGGAGTTAACCATTTT
TCATAATCCAGATACTTCTACTTCTTTTTTTTTTCCTTCTTTTCCCTCTTGATTTAGGAGGAAACACTTCATTAGTGTTTTGAATTAATTAATTGTTCCAGGGTGAATTA
ATATCGTTTTATGATAATTTATTTAATTAGTTAGTTCCATACTAAATTTAATATTGATTTATATCCAG
Protein sequenceShow/hide protein sequence
MAGKEVREYTNLSDPKDKKWGKGKDKIDDEDITFQRMVAKFIELRFLRIGLSILKMQEVAGERGGYLHGRGGNPFFLCLCMSLCVHPLTILNISFGFLTLTRVWSLQPWT
VMICSILKSRWKQRRMQNASLDALRNEPLQHSRYPFPASLADSAPASVPLALRVEPKPKSGIRQQDLLKRVVEVKPKRPRVSNQADLSNPKHDIDNDKVKENPLLKTNKT
EGETEAENPVKSLLETAPLASSERLPMAALILHSSSATSVLFGEMGKLKAQNHHSILSSNKPMRGNYLRIGCNKSTNGISAFFFNPVEDPVIKEALKEPVAFAGGLFAGL
LRLDLNEDPLKEWVKKTMESAGLTEEVDTQESVQEDGPTEIEIE