; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0005087 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0005087
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGag/pol protein
Genome locationchr04:19871012..19873674
RNA-Seq ExpressionPI0005087
SyntenyPI0005087
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041264.1 hypothetical protein E6C27_scaffold128G002490 [Cucumis melo var. makuwa]1.3e-3444.16Show/hide
Query:  IRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFF
        IR  VV  FY A +N EE   ++  K V F  +AINALY L NN    G L  +NP  +  ++ALE I WPG  W+  PT KYQL+P+ L TE SVWL F
Subjt:  IRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFF

Query:  IKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFLSAIQTISIPGGLCNQMALNRMITFHGHK
        IKK I P RHDSTI++E  MLLY      + N  E+    ++AW++ P G  PF    + L +KA P L   Q + +  G+C    L+R IT H +K
Subjt:  IKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFLSAIQTISIPGGLCNQMALNRMITFHGHK

KAA0048500.1 protein MNN4-like [Cucumis melo var. makuwa]7.8e-4337Show/hide
Query:  IRSPLNEVAELHKKISDKLAQVSFAKTRKTIEAVKLPAKVRALEPERNLEAIAEEFEEELEAMSPLDD---GPSPRKSREVAGPS----RGRKKLGRSGP
        + S   +VA   ++ ++KL +    K +   + VK  A+ +    E+  +  ++EFE+ELE +SPL+D      P+K R + G        + K  +   
Subjt:  IRSPLNEVAELHKKISDKLAQVSFAKTRKTIEAVKLPAKVRALEPERNLEAIAEEFEEELEAMSPLDD---GPSPRKSREVAGPS----RGRKKLGRSGP

Query:  EERLSGSDTISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNV
        E + S  + +     +     +EKG+F F GQLP FL +PI+A  WK FF+G T IR  V+  FY   +N E     + GK V+F  + +N LY L    
Subjt:  EERLSGSDTISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNV

Query:  ETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWM
         T        P+    + ALE +AWPG  W++TP  KYQL+PH L T ASVWL FIKK + P RHD+TI+LE  MLLYCI+ +  +N+ E+I   I AW+
Subjt:  ETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWM

KAA0049609.1 transposase [Cucumis melo var. makuwa]1.8e-3148.47Show/hide
Query:  ISGKTVSFNAEAINALYDLPNNVET-QGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAML
        +  + V F  E IN LYDLPN++    GQ  + +  +  A++ +++I WP A    TPT + QL+PHQLT EA+VWLFFIKKKIFP  HDSTI  E A++
Subjt:  ISGKTVSFNAEAINALYDLPNNVET-QGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAML

Query:  LYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAV-PFLSAIQTISIPGGLCN
        LYCI AKK  NLG ++  + L+WM+ PK   PFP+TV+ LCLK +      I  I + GG CN
Subjt:  LYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAV-PFLSAIQTISIPGGLCN

KAA0049609.1 transposase [Cucumis melo var. makuwa]8.2e+0027.15Show/hide
Query:  VCKPMMEQFEHILANQGDQATQLHNLQTRVDQLQRPNAENLV-TPASQEEINVLRCEIRDLSANNAQISTTVSNLFTSVSNFTSLVMGQSEMMRQIAARH
        V K      E  +  +G+    L+N+ T  + +     E L+   A++E+I  L  +++ L A  + +++TVS L ++++N T L++   + +  I  RH
Subjt:  VCKPMMEQFEHILANQGDQATQLHNLQTRVDQLQRPNAENLV-TPASQEEINVLRCEIRDLSANNAQISTTVSNLFTSVSNFTSLVMGQSEMMRQIAARH

Query:  DRQFRTQMEYTYAAIVQ----CVPAPIIPPDLEAPFPPITRPGDPDPRQDN
         R+F  +MEY +A  VQ    C  A +           +T P +PDP  +N
Subjt:  DRQFRTQMEYTYAAIVQ----CVPAPIIPPDLEAPFPPITRPGDPDPRQDN

KAA0054837.1 hypothetical protein E6C27_scaffold406G00150 [Cucumis melo var. makuwa]1.0e-3433.33Show/hide
Query:  KTRKTIEAVKLPAKVRALEPERNLEAIAEEFEEELEAMSPLDDGPSPRKSREVAGPSRGRKKLGRSGPEERLSGSDTISKTPSINSL-----IKVEKGLF
        KT +  E    P        +  +    E  EE+   +SPL++    R+ R+      G+  + R   E+     ++      + S        VEKG F
Subjt:  KTRKTIEAVKLPAKVRALEPERNLEAIAEEFEEELEAMSPLDDGPSPRKSREVAGPSRGRKKLGRSGPEERLSGSDTISKTPSINSL-----IKVEKGLF

Query:  SFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPG
         F  QL  FL  PI+A GW+ F +G   IR GVV+ FY  K++ E+    +  +                             P+    +EALE +AW  
Subjt:  SFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPG

Query:  AAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFL-SA
          W+VT   KY+L+ H LTTEASVWL FIKKK+ P RHD+TI+ E  MLLYCI+ +  V++ E+I   I AW+Q P+G  PFP  +E LCL++   L  +
Subjt:  AAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFL-SA

Query:  IQTISIPGGLCN
         Q   +  G+CN
Subjt:  IQTISIPGGLCN

KAA0062900.1 gag/pol protein [Cucumis melo var. makuwa]7.3e-4150.56Show/hide
Query:  KIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL
        KIR+ VV KFY  K N  +  + I  +   FN E IN LY+ PN+ E  GQ  V   TK +A+EAL+V+AWPG   EV P   +YQLYPH LTT+A+VW+
Subjt:  KIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL

Query:  FFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFLSAIQTISIP
        FF K KIFP  +DSTI+++  ++LYCI+ KK +NL E+I  +IL WM+ PK  MPFPS +E LCLK +P L      +IP
Subjt:  FFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFLSAIQTISIP

TrEMBL top hitse value%identityAlignment
A0A5A7TZE0 Protein MNN4-like3.8e-4337Show/hide
Query:  IRSPLNEVAELHKKISDKLAQVSFAKTRKTIEAVKLPAKVRALEPERNLEAIAEEFEEELEAMSPLDD---GPSPRKSREVAGPS----RGRKKLGRSGP
        + S   +VA   ++ ++KL +    K +   + VK  A+ +    E+  +  ++EFE+ELE +SPL+D      P+K R + G        + K  +   
Subjt:  IRSPLNEVAELHKKISDKLAQVSFAKTRKTIEAVKLPAKVRALEPERNLEAIAEEFEEELEAMSPLDD---GPSPRKSREVAGPS----RGRKKLGRSGP

Query:  EERLSGSDTISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNV
        E + S  + +     +     +EKG+F F GQLP FL +PI+A  WK FF+G T IR  V+  FY   +N E     + GK V+F  + +N LY L    
Subjt:  EERLSGSDTISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNV

Query:  ETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWM
         T        P+    + ALE +AWPG  W++TP  KYQL+PH L T ASVWL FIKK + P RHD+TI+LE  MLLYCI+ +  +N+ E+I   I AW+
Subjt:  ETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWM

A0A5A7U806 Transposase8.8e-3248.47Show/hide
Query:  ISGKTVSFNAEAINALYDLPNNVET-QGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAML
        +  + V F  E IN LYDLPN++    GQ  + +  +  A++ +++I WP A    TPT + QL+PHQLT EA+VWLFFIKKKIFP  HDSTI  E A++
Subjt:  ISGKTVSFNAEAINALYDLPNNVET-QGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAML

Query:  LYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAV-PFLSAIQTISIPGGLCN
        LYCI AKK  NLG ++  + L+WM+ PK   PFP+TV+ LCLK +      I  I + GG CN
Subjt:  LYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAV-PFLSAIQTISIPGGLCN

A0A5A7U806 Transposase4.0e+0027.15Show/hide
Query:  VCKPMMEQFEHILANQGDQATQLHNLQTRVDQLQRPNAENLV-TPASQEEINVLRCEIRDLSANNAQISTTVSNLFTSVSNFTSLVMGQSEMMRQIAARH
        V K      E  +  +G+    L+N+ T  + +     E L+   A++E+I  L  +++ L A  + +++TVS L ++++N T L++   + +  I  RH
Subjt:  VCKPMMEQFEHILANQGDQATQLHNLQTRVDQLQRPNAENLV-TPASQEEINVLRCEIRDLSANNAQISTTVSNLFTSVSNFTSLVMGQSEMMRQIAARH

Query:  DRQFRTQMEYTYAAIVQ----CVPAPIIPPDLEAPFPPITRPGDPDPRQDN
         R+F  +MEY +A  VQ    C  A +           +T P +PDP  +N
Subjt:  DRQFRTQMEYTYAAIVQ----CVPAPIIPPDLEAPFPPITRPGDPDPRQDN

A0A5A7V6M5 Gag/pol protein3.5e-4150.56Show/hide
Query:  KIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL
        KIR+ VV KFY  K N  +  + I  +   FN E IN LY+ PN+ E  GQ  V   TK +A+EAL+V+AWPG   EV P   +YQLYPH LTT+A+VW+
Subjt:  KIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL

Query:  FFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFLSAIQTISIP
        FF K KIFP  +DSTI+++  ++LYCI+ KK +NL E+I  +IL WM+ PK  MPFPS +E LCLK +P L      +IP
Subjt:  FFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFLSAIQTISIP

A0A5D3CW17 Uncharacterized protein6.5e-3544.16Show/hide
Query:  IRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFF
        IR  VV  FY A +N EE   ++  K V F  +AINALY L NN    G L  +NP  +  ++ALE I WPG  W+  PT KYQL+P+ L TE SVWL F
Subjt:  IRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFF

Query:  IKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFLSAIQTISIPGGLCNQMALNRMITFHGHK
        IKK I P RHDSTI++E  MLLY      + N  E+    ++AW++ P G  PF    + L +KA P L   Q + +  G+C    L+R IT H +K
Subjt:  IKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFLSAIQTISIPGGLCNQMALNRMITFHGHK

A0A5D3DVQ6 Uncharacterized protein5.0e-3533.33Show/hide
Query:  KTRKTIEAVKLPAKVRALEPERNLEAIAEEFEEELEAMSPLDDGPSPRKSREVAGPSRGRKKLGRSGPEERLSGSDTISKTPSINSL-----IKVEKGLF
        KT +  E    P        +  +    E  EE+   +SPL++    R+ R+      G+  + R   E+     ++      + S        VEKG F
Subjt:  KTRKTIEAVKLPAKVRALEPERNLEAIAEEFEEELEAMSPLDDGPSPRKSREVAGPSRGRKKLGRSGPEERLSGSDTISKTPSINSL-----IKVEKGLF

Query:  SFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPG
         F  QL  FL  PI+A GW+ F +G   IR GVV+ FY  K++ E+    +  +                             P+    +EALE +AW  
Subjt:  SFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEVIAWPG

Query:  AAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFL-SA
          W+VT   KY+L+ H LTTEASVWL FIKKK+ P RHD+TI+ E  MLLYCI+ +  V++ E+I   I AW+Q P+G  PFP  +E LCL++   L  +
Subjt:  AAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFL-SA

Query:  IQTISIPGGLCN
         Q   +  G+CN
Subjt:  IQTISIPGGLCN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44191.1 ECA1 gametogenesis related family protein6.9e-0532.43Show/hide
Query:  STPSPKAKKTKVLATKQLPLKFPHSSSRPIQRAPPPVQNSSNSNPPRSSSPIPITLSSPTISPRHSPLPHIRSPTNIPHLSPRPPTPPPTKSTSP-----
        S PSPK         K  P   P SS  PI +  PP    S+  P    SP P   SSP  SP+ SP P   SP+      P+P TPPPT   SP     
Subjt:  STPSPKAKKTKVLATKQLPLKFPHSSSRPIQRAPPPVQNSSNSNPPRSSSPIPITLSSPTISPRHSPLPHIRSPTNIPHLSPRPPTPPPTKSTSP-----

Query:  LHSKSPSPRRAEPLSPFLLSPIMDLTVLRHDQPATNTAVVEVSSPITHPTNRPLQPSPIL------LISEEGTPPTNQPSQPSP-----PPPIMAAAEKV
          S  PSP+++ P      SP          +P+T     + S P   P+  P +PSP              TPP+ +PS P P     PPP    +   
Subjt:  LHSKSPSPRRAEPLSPFLLSPIMDLTVLRHDQPATNTAVVEVSSPITHPTNRPLQPSPIL------LISEEGTPPTNQPSQPSP-----PPPIMAAAEKV

Query:  DDPHVKDKNHILNEVGETASSA
         DP V    H  N + E   SA
Subjt:  DDPHVKDKNHILNEVGETASSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGATCTTCAAGTCAAGATCAGATTAGAGTACGTTCTGATGGACCTGAATCAGAACGCCCCACATCGCGGGGAAGTGACGAACACAAAACGCCACAATACATCCA
CTTGAAGCACGGCAAGGGGAAGGTTTTACTAAAACCTCCGCCGGAAGCAGCTGACAATTTCTTGGAAGTTGAAGTTGATAATCAGGATACAGAGGCGATTCTCGAATTTC
TTAATGAACGAGCAAAGAAGAGGAATGAAGCCCACATCAAAAGGACTAAGGAAGCTCATTGCCGAAAGGACGAGCGTAAGCGAACGAGAGTTGACACAATTCGAAAGGCG
AAGAGCACACTAGAAATACGCTCACCGCTTAACGAGGTTGCAGAGCTTCACAAGAAGATCTCTGACAAGCTCGCACAAGTCTCGTTCGCTAAAACGAGGAAAACAATCGA
GGCAGTTAAGCTCCCTGCGAAAGTAAGAGCATTGGAGCCGGAAAGAAACCTCGAAGCAATCGCTGAAGAATTCGAGGAAGAGCTGGAGGCGATGAGCCCACTTGATGACG
GGCCATCGCCAAGAAAATCAAGGGAGGTCGCAGGACCATCAAGAGGAAGAAAGAAACTTGGACGTTCTGGACCTGAAGAACGCCTATCAGGCAGCGACACCATAAGCAAG
ACACCCTCTATCAACTCTCTCATCAAAGTTGAAAAGGGGTTGTTTTCGTTCAATGGTCAACTTCCTGACTTCCTCTACGCGCCAATTCAGGCGTTCGGATGGAAGTCATT
TTTCAAAGGGCACACCAAGATACGATTAGGAGTGGTAGAAAAATTTTATACGGCTAAACTCAACGCTGAAGAGTTTAGCGTACAAATAAGTGGAAAGACAGTGAGTTTCA
ACGCGGAGGCCATCAATGCGTTGTATGATTTGCCCAACAATGTTGAAACCCAAGGGCAATTATACGTAGATAATCCTACGAAGAAGATGGCCCGTGAAGCGTTGGAAGTC
ATCGCATGGCCTGGGGCCGCATGGGAAGTAACGCCAACAGGGAAGTATCAGTTGTATCCACACCAACTAACCACTGAAGCAAGTGTGTGGTTGTTCTTTATCAAGAAGAA
GATCTTCCCAAAACGCCATGATAGCACCATCAATTTGGAGTCAGCAATGCTACTTTATTGCATCCTAGCAAAGAAGCGTGTTAACCTTGGCGAACTTATAGCCACATCCA
TTCTGGCATGGATGCAAGCTCCCAAAGGCGTGATGCCCTTCCCTTCAACCGTTGAGGCCCTTTGCCTTAAAGCTGTGCCATTCTTATCCGCCATCCAAACCATCTCAATA
CCTGGCGGACTGTGTAATCAAATGGCCTTAAACCGCATGATTACTTTCCATGGACACAAGGAAATGGAAAGGCGGGCAAAGACATTAGGCGACACGCCTGAAGGAATGGC
CCTAGCAGAAAGAAAAAGAAAGGCCCTAGTCGTCGCATCAACCCCCTCACCTAAAGCCAAAAAAACAAAGGTTCTTGCGACGAAGCAACTTCCACTGAAATTTCCCCACT
CCTCATCTCGCCCAATACAGCGAGCTCCACCACCAGTCCAAAATTCCAGCAACTCCAATCCTCCCCGCTCTTCTTCGCCCATTCCAATCACCCTATCATCCCCAACTATC
TCTCCCCGCCATTCACCTCTCCCCCACATTCGTTCCCCCACCAACATCCCTCACCTTTCCCCACGACCGCCTACACCTCCGCCCACAAAATCCACTTCCCCCCTTCACTC
CAAATCACCCTCACCAAGGCGAGCTGAACCCCTTTCGCCTTTTCTTCTTTCACCCATCATGGACCTGACCGTTCTTCGCCATGACCAACCCGCGACCAACACTGCAGTCG
TTGAGGTTTCTTCGCCCATCACCCACCCAACCAACCGTCCTCTGCAACCTTCCCCCATTCTTCTAATCTCAGAAGAGGGCACACCTCCCACCAACCAACCATCCCAACCA
TCACCACCACCGCCTATTATGGCCGCAGCGGAAAAAGTTGATGACCCACACGTTAAGGACAAAAACCACATCCTTAATGAAGTTGGCGAGACTGCCTCCTCTGCGCATAC
CCCCATTGCTCAACCATTCACCGCACCAGGAGACGATGAAGATTTCGCCGAAATGCTGGGTTCCCTTGTGTGTAAGCCAATGATGGAGCAATTCGAACATATTTTGGCTA
ACCAAGGGGATCAGGCGACGCAACTCCACAATTTGCAAACTCGAGTTGATCAGTTGCAGCGTCCCAACGCCGAAAATCTTGTAACGCCCGCATCTCAAGAAGAAATCAAT
GTTTTGCGATGCGAAATAAGGGACCTTTCGGCGAACAACGCCCAGATCTCCACCACAGTCTCCAACCTTTTCACCTCCGTCTCCAACTTCACCTCCTTGGTCATGGGCCA
ATCAGAAATGATGCGGCAAATAGCTGCGCGGCATGATAGGCAGTTTCGCACACAGATGGAATACACATATGCAGCAATTGTGCAATGCGTGCCTGCCCCAATCATACCGC
CAGATCTCGAAGCACCCTTTCCACCAATCACGCGTCCTGGCGATCCTGACCCCCGCCAAGACAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGATCTTCAAGTCAAGATCAGATTAGAGTACGTTCTGATGGACCTGAATCAGAACGCCCCACATCGCGGGGAAGTGACGAACACAAAACGCCACAATACATCCA
CTTGAAGCACGGCAAGGGGAAGGTTTTACTAAAACCTCCGCCGGAAGCAGCTGACAATTTCTTGGAAGTTGAAGTTGATAATCAGGATACAGAGGCGATTCTCGAATTTC
TTAATGAACGAGCAAAGAAGAGGAATGAAGCCCACATCAAAAGGACTAAGGAAGCTCATTGCCGAAAGGACGAGCGTAAGCGAACGAGAGTTGACACAATTCGAAAGGCG
AAGAGCACACTAGAAATACGCTCACCGCTTAACGAGGTTGCAGAGCTTCACAAGAAGATCTCTGACAAGCTCGCACAAGTCTCGTTCGCTAAAACGAGGAAAACAATCGA
GGCAGTTAAGCTCCCTGCGAAAGTAAGAGCATTGGAGCCGGAAAGAAACCTCGAAGCAATCGCTGAAGAATTCGAGGAAGAGCTGGAGGCGATGAGCCCACTTGATGACG
GGCCATCGCCAAGAAAATCAAGGGAGGTCGCAGGACCATCAAGAGGAAGAAAGAAACTTGGACGTTCTGGACCTGAAGAACGCCTATCAGGCAGCGACACCATAAGCAAG
ACACCCTCTATCAACTCTCTCATCAAAGTTGAAAAGGGGTTGTTTTCGTTCAATGGTCAACTTCCTGACTTCCTCTACGCGCCAATTCAGGCGTTCGGATGGAAGTCATT
TTTCAAAGGGCACACCAAGATACGATTAGGAGTGGTAGAAAAATTTTATACGGCTAAACTCAACGCTGAAGAGTTTAGCGTACAAATAAGTGGAAAGACAGTGAGTTTCA
ACGCGGAGGCCATCAATGCGTTGTATGATTTGCCCAACAATGTTGAAACCCAAGGGCAATTATACGTAGATAATCCTACGAAGAAGATGGCCCGTGAAGCGTTGGAAGTC
ATCGCATGGCCTGGGGCCGCATGGGAAGTAACGCCAACAGGGAAGTATCAGTTGTATCCACACCAACTAACCACTGAAGCAAGTGTGTGGTTGTTCTTTATCAAGAAGAA
GATCTTCCCAAAACGCCATGATAGCACCATCAATTTGGAGTCAGCAATGCTACTTTATTGCATCCTAGCAAAGAAGCGTGTTAACCTTGGCGAACTTATAGCCACATCCA
TTCTGGCATGGATGCAAGCTCCCAAAGGCGTGATGCCCTTCCCTTCAACCGTTGAGGCCCTTTGCCTTAAAGCTGTGCCATTCTTATCCGCCATCCAAACCATCTCAATA
CCTGGCGGACTGTGTAATCAAATGGCCTTAAACCGCATGATTACTTTCCATGGACACAAGGAAATGGAAAGGCGGGCAAAGACATTAGGCGACACGCCTGAAGGAATGGC
CCTAGCAGAAAGAAAAAGAAAGGCCCTAGTCGTCGCATCAACCCCCTCACCTAAAGCCAAAAAAACAAAGGTTCTTGCGACGAAGCAACTTCCACTGAAATTTCCCCACT
CCTCATCTCGCCCAATACAGCGAGCTCCACCACCAGTCCAAAATTCCAGCAACTCCAATCCTCCCCGCTCTTCTTCGCCCATTCCAATCACCCTATCATCCCCAACTATC
TCTCCCCGCCATTCACCTCTCCCCCACATTCGTTCCCCCACCAACATCCCTCACCTTTCCCCACGACCGCCTACACCTCCGCCCACAAAATCCACTTCCCCCCTTCACTC
CAAATCACCCTCACCAAGGCGAGCTGAACCCCTTTCGCCTTTTCTTCTTTCACCCATCATGGACCTGACCGTTCTTCGCCATGACCAACCCGCGACCAACACTGCAGTCG
TTGAGGTTTCTTCGCCCATCACCCACCCAACCAACCGTCCTCTGCAACCTTCCCCCATTCTTCTAATCTCAGAAGAGGGCACACCTCCCACCAACCAACCATCCCAACCA
TCACCACCACCGCCTATTATGGCCGCAGCGGAAAAAGTTGATGACCCACACGTTAAGGACAAAAACCACATCCTTAATGAAGTTGGCGAGACTGCCTCCTCTGCGCATAC
CCCCATTGCTCAACCATTCACCGCACCAGGAGACGATGAAGATTTCGCCGAAATGCTGGGTTCCCTTGTGTGTAAGCCAATGATGGAGCAATTCGAACATATTTTGGCTA
ACCAAGGGGATCAGGCGACGCAACTCCACAATTTGCAAACTCGAGTTGATCAGTTGCAGCGTCCCAACGCCGAAAATCTTGTAACGCCCGCATCTCAAGAAGAAATCAAT
GTTTTGCGATGCGAAATAAGGGACCTTTCGGCGAACAACGCCCAGATCTCCACCACAGTCTCCAACCTTTTCACCTCCGTCTCCAACTTCACCTCCTTGGTCATGGGCCA
ATCAGAAATGATGCGGCAAATAGCTGCGCGGCATGATAGGCAGTTTCGCACACAGATGGAATACACATATGCAGCAATTGTGCAATGCGTGCCTGCCCCAATCATACCGC
CAGATCTCGAAGCACCCTTTCCACCAATCACGCGTCCTGGCGATCCTGACCCCCGCCAAGACAATTAA
Protein sequenceShow/hide protein sequence
MAGSSSQDQIRVRSDGPESERPTSRGSDEHKTPQYIHLKHGKGKVLLKPPPEAADNFLEVEVDNQDTEAILEFLNERAKKRNEAHIKRTKEAHCRKDERKRTRVDTIRKA
KSTLEIRSPLNEVAELHKKISDKLAQVSFAKTRKTIEAVKLPAKVRALEPERNLEAIAEEFEEELEAMSPLDDGPSPRKSREVAGPSRGRKKLGRSGPEERLSGSDTISK
TPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYTAKLNAEEFSVQISGKTVSFNAEAINALYDLPNNVETQGQLYVDNPTKKMAREALEV
IAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPKRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMQAPKGVMPFPSTVEALCLKAVPFLSAIQTISI
PGGLCNQMALNRMITFHGHKEMERRAKTLGDTPEGMALAERKRKALVVASTPSPKAKKTKVLATKQLPLKFPHSSSRPIQRAPPPVQNSSNSNPPRSSSPIPITLSSPTI
SPRHSPLPHIRSPTNIPHLSPRPPTPPPTKSTSPLHSKSPSPRRAEPLSPFLLSPIMDLTVLRHDQPATNTAVVEVSSPITHPTNRPLQPSPILLISEEGTPPTNQPSQP
SPPPPIMAAAEKVDDPHVKDKNHILNEVGETASSAHTPIAQPFTAPGDDEDFAEMLGSLVCKPMMEQFEHILANQGDQATQLHNLQTRVDQLQRPNAENLVTPASQEEIN
VLRCEIRDLSANNAQISTTVSNLFTSVSNFTSLVMGQSEMMRQIAARHDRQFRTQMEYTYAAIVQCVPAPIIPPDLEAPFPPITRPGDPDPRQDN