; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0007699 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0007699
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetroelement pol polyprotein-like
Genome locationchr02:8084489..8091395
RNA-Seq ExpressionPI0007699
SyntenyPI0007699
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048713.1 hypothetical protein E6C27_scaffold43G00050 [Cucumis melo var. makuwa]1.5e-5744.97Show/hide
Query:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSH------RGGRV
        M+F+Q+DRENL D W RFK+M+K CPH+ IP+C+LME FYFGL+K T Q+A+ VF  GML+S+YNQIK  LDTMASN++EW ++ FGS       +G R 
Subjt:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSH------RGGRV

Query:  KGDDGMDKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACPLNTETVVFVRNDPFSNIYNPGWRNHLNFGWGGSGQQ
        + +DG+D S++VALQGQ+  M N+L+SMA+ QVN V SSV    Q+++MGCVGC   HNT+ACPLNTE V +++NDP            ++  WGG   Q
Subjt:  KGDDGMDKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACPLNTETVVFVRNDPFSNIYNPGWRNHLNFGWGGSGQQ

Query:  QGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPNQ
                                                                   + SQASSI+N+E+QLGQL SDFS R + S PSNTETPNQ
Subjt:  QGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPNQ

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]7.5e-5443.57Show/hide
Query:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM
        MSFQQ + E   DAW RFK++++ CPH+GIP CI +E FY GLN A++   DA     +L  +YN+    L+ +ASNN +W  +   +H   +V G   +
Subjt:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM

Query:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVH--AANQIDDMGCVGCGGHHNTDACPLNTETVVFV-------RNDPFSNIYNPGWRNHLNFGWGGS
        +   + AL  QM +M N+LK+M       +G SV   AA Q     CV CG  H  + CP N  +V +V        N+P+SN YNP W++H NF WGG 
Subjt:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVH--AANQIDDMGCVGCGGHHNTDACPLNTETVVFV-------RNDPFSNIYNPGWRNHLNFGWGGS

Query:  GQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPN
        G+Q    G                Q  RPQ     HQP    S TS +ESL+R+YM KND ++QSQA+S+RNLEVQLGQLA+D   R QG+LPS+TE P 
Subjt:  GQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPN

Query:  QAGGSGKEKCHAVTLRSGR
        +    GKE C AVTLRSG+
Subjt:  QAGGSGKEKCHAVTLRSGR

XP_030498047.1 uncharacterized protein LOC115713707 [Cannabis sativa]1.2e-5444.24Show/hide
Query:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM
        MSFQQ D E   DAW RFK++++ CPH+GIP CI +E FY GLN A++   DA     +   +YN+    ++ +ASNN +W  +   + R  +V G   +
Subjt:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM

Query:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVH--AANQIDDMGCVGCGGHHNTDACPLNTETVVFV-------RNDPFSNIYNPGWRNHLNFGWGGS
        +   + AL  QM +M N+LK+M       +G SV   AA Q  ++ CV CG  H  + CP N  +V +V        N+P+SN YNP W++H NF WGG 
Subjt:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVH--AANQIDDMGCVGCGGHHNTDACPLNTETVVFV-------RNDPFSNIYNPGWRNHLNFGWGGS

Query:  GQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPN
        G      G QG   G+ S      Q  RPQ S   HQP    S TS +ESL+R+YM KNDA++QSQA+S+RNLEVQLGQLA++   R QG+LPS+TE P 
Subjt:  GQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPN

Query:  QAGGSGKEKCHAVTLRSGRNL
        +    GKE C A+TLRSG+ L
Subjt:  QAGGSGKEKCHAVTLRSGRNL

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]2.6e-5444.2Show/hide
Query:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM
        MSFQQ + E   DAW RFK++++ CPH+GIP CI +E FY GLN A++   DA     +L  +YN+    L+ +ASNN +W  +   + R  +V G   +
Subjt:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM

Query:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVH--AANQIDDMGCVGCGGHHNTDACPLNTETVVFV-------RNDPFSNIYNPGWRNHLNFGWGGS
        +   + AL  QM +M N+LK+M       +G SV   AA Q  ++ CV CG  H  + CP N  +V +V        N+P+SN YNP W++H NF WGG 
Subjt:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVH--AANQIDDMGCVGCGGHHNTDACPLNTETVVFV-------RNDPFSNIYNPGWRNHLNFGWGGS

Query:  GQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPN
        G       G     G+ S      Q  RPQ     HQP    S TS +ESL+R+YM KNDA++QSQA+S+RNLEVQLGQLA+D   R QG+LPS+TE P 
Subjt:  GQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPN

Query:  QAGGSGKEKCHAVTLRSGR
        +    GKE C AVTLRSG+
Subjt:  QAGGSGKEKCHAVTLRSGR

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]4.4e-5443.57Show/hide
Query:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM
        MSFQQ + E   DAW RFK++++ CPH+GIP CI +E FY GLN A++   DA     +L  +YN+    L+ +ASNN +W  +   + R  +V G   +
Subjt:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM

Query:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVH--AANQIDDMGCVGCGGHHNTDACPLNTETVVFV-------RNDPFSNIYNPGWRNHLNFGWGGS
        +   + AL  QM +M N+LK+M       +G SV   AA Q  ++ CV CG  H  + CP N  +V +V        N+P+SN YNP W++H NF WGG 
Subjt:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVH--AANQIDDMGCVGCGGHHNTDACPLNTETVVFV-------RNDPFSNIYNPGWRNHLNFGWGGS

Query:  GQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPN
        G       GQG             Q+  P  S   HQP    S TS +ESL+R+YM KNDA++QSQA+S+RNLEVQLGQLA+D   R QG+LPS+TE P 
Subjt:  GQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPN

Query:  QAGGSGKEKCHAVTLRSGR
        +     KE C AVTLRSG+
Subjt:  QAGGSGKEKCHAVTLRSGR

TrEMBL top hitse value%identityAlignment
A0A0A0K6F9 Uncharacterized protein3.2e-4260.99Show/hide
Query:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM
        M F+Q+D+EN+HD WSRFK++VKACP +GIP+C+ MEVFYFGL+K T Q  + +FV GML+S+YNQIK TLD+M++N++EWD+  FGS   GR K  +G+
Subjt:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM

Query:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQID
        DKSVVV LQGQM AMNNLL+SM +SQVNA  + +HA  Q++
Subjt:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQID

A0A5B6VWJ0 Retroelement pol polyprotein-like1.7e-4334.1Show/hide
Query:  SFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGMD
        +F   D E+L++AW RFK++++ CPH+GIP CI +E FY GL   T+   DA     +L  +YN+    ++ +ASNN +W      S  G RV G   +D
Subjt:  SFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGMD

Query:  KSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACPLNTETVVFVRNDP--------FSNIYNPGWRNHLNFGWGGSGQ
           + +L  Q+++++++ K++  +  N+   +    NQ +++  V CG  H  + CP N E+V ++ N           SN YN  WRNHL+F W   G 
Subjt:  KSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACPLNTETVVFVRNDP--------FSNIYNPGWRNHLNFGWGGSGQ

Query:  QQGRHGGQGDHCGEASGSHARYQNNRP---QHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETP
                       +G+   Y   RP    +   Q Q    A +++ +ESLL+ YM KNDAL+QSQA++++NLE Q+GQLA++   R QG+LPS+TE P
Subjt:  QQGRHGGQGDHCGEASGSHARYQNNRP---QHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETP

Query:  NQAGGSGKEKCHAVTLRSGRNLTIRDLDAECSYPNSNSTAEIGSTSKIP
              GKE C A+TLRS + +    ++ E    N+    E+  + + P
Subjt:  NQAGGSGKEKCHAVTLRSGRNLTIRDLDAECSYPNSNSTAEIGSTSKIP

A0A5D3CC26 Uncharacterized protein7.0e-5844.97Show/hide
Query:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSH------RGGRV
        M+F+Q+DRENL D W RFK+M+K CPH+ IP+C+LME FYFGL+K T Q+A+ VF  GML+S+YNQIK  LDTMASN++EW ++ FGS       +G R 
Subjt:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSH------RGGRV

Query:  KGDDGMDKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACPLNTETVVFVRNDPFSNIYNPGWRNHLNFGWGGSGQQ
        + +DG+D S++VALQGQ+  M N+L+SMA+ QVN V SSV    Q+++MGCVGC   HNT+ACPLNTE V +++NDP            ++  WGG   Q
Subjt:  KGDDGMDKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACPLNTETVVFVRNDPFSNIYNPGWRNHLNFGWGGSGQQ

Query:  QGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPNQ
                                                                   + SQASSI+N+E+QLGQL SDFS R + S PSNTETPNQ
Subjt:  QGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPNQ

A0A5D3D2S0 Uncharacterized protein1.3e-4347.95Show/hide
Query:  MLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGMDKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACP
        ML+S+Y QIKTTLD + +N++E  +DD       R + D GMD++V+VALQGQ+T M  LL+SMA+SQV+AVG+ V A  Q+D+M  VGCG  H TDAC 
Subjt:  MLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGMDKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACP

Query:  LNTETVVFVRNDPFSNIYNPGWRNHLNFGWGGSGQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQ--PTTTASSTSPMESLLREYMQKNDALLQS
        LN E   +V++ P+SN YN                        G++ GEA   H +   +RP +S  QHQ   T T SS S M +LLR+YMQ+ DA +QS
Subjt:  LNTETVVFVRNDPFSNIYNPGWRNHLNFGWGGSGQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQ--PTTTASSTSPMESLLREYMQKNDALLQS

Query:  QASSIRNLEVQLGQLASDFSERQQGSLPSNTETPNQAGGSGKEK
        Q +SI NLE+ LGQLA DFS R  GSLPSN E PN   G  K K
Subjt:  QASSIRNLEVQLGQLASDFSERQQGSLPSNTETPNQAGGSGKEK

A0A6J1G7Q6 uncharacterized protein LOC1114515984.6e-4136.91Show/hide
Query:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM
        ++FQ+ + E L +AW RFK+ ++ CPH+G+P CI +E FY GLN AT+Q  DA     +L  TYN+    L+ +ASNN +W   D  S+ G + +  + +
Subjt:  MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGM

Query:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAAN---QIDDMGCVGCGGHHNTDACPLNTETVVFVRN---------DPFSNIYNPGWRNHLNF--
        +   + ++  Q+ +M N+L+++A  Q + + +  H A    Q     CV CG  H  D CP N  ++ +V N         +P SN YNPGWRNH NF  
Subjt:  DKSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAAN---QIDDMGCVGCGGHHNTDACPLNTETVVFVRN---------DPFSNIYNPGWRNHLNF--

Query:  -GWGGSGQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSP--------MESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSE
         G G   QQ           G         QN   Q ++   Q TT    TS         +ESL++EYM +NDA++QSQ  S+RNLEVQ+GQLA++   
Subjt:  -GWGGSGQQQGRHGGQGDHCGEASGSHARYQNNRPQHSHHQHQPTTTASSTSP--------MESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSE

Query:  RQQGSLPSNTETPNQAG
        R  G LP++TE P + G
Subjt:  RQQGSLPSNTETPNQAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTTCCAGCAGAAGGACAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACAGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGA
GGTTTTCTATTTTGGACTAAACAAGGCTACACAGCAAACTGCTGATGCTGTGTTTGTAGACGGTATGCTGAAAAGTACATACAATCAGATTAAGACGACGCTGGACACGA
TGGCCAGCAACAATGAAGAATGGGATGAAGATGATTTCGGCAGTCACCGAGGAGGACGAGTAAAAGGTGATGATGGCATGGATAAAAGCGTCGTGGTGGCATTACAGGGA
CAAATGACTGCGATGAACAATTTACTCAAATCAATGGCAATATCGCAAGTCAACGCCGTAGGAAGCTCTGTGCACGCGGCTAACCAAATTGATGACATGGGATGCGTGGG
ATGCGGCGGTCATCATAACACTGACGCATGCCCCCTTAATACTGAAACCGTCGTGTTCGTAAGGAACGACCCCTTCTCCAATATTTACAACCCTGGCTGGAGGAACCATC
TCAACTTTGGATGGGGAGGATCGGGTCAACAACAAGGACGACATGGTGGTCAAGGTGACCATTGCGGGGAAGCATCTGGCTCCCACGCGAGGTACCAAAACAATAGACCC
CAACACTCCCATCATCAACATCAGCCCACCACCACCGCCTCGTCCACCTCTCCCATGGAAAGTCTCCTCCGCGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCA
AGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAGCTCGCTAGTGATTTCTCCGAAAGACAGCAAGGTTCCCTCCCAAGCAATACAGAAACGCCAAATCAGGCGG
GAGGATCTGGTAAAGAGAAGTGTCACGCGGTGACACTACGCAGCGGAAGAAATTTAACCATCCGCGACCTTGATGCTGAATGTAGCTATCCCAATTCTAACTCTACTGCC
GAGATTGGCAGTACAAGTAAAATTCCTAATCTTGTGAATTTCCCTTCAACTGATAATGTTTCTTCCTTGCAGAATAATGGCGCACCAAGCAAGGGTTTCGAGAGCAAGGG
GCAACGGAATCCACAAAACGAAACAACGCCCCACATCACGGGAAGTGGCGAACAAAGAACGCCACAATACACCTACTTAAAACACGGTAAGGGAAAGGTGTTACTGAAAC
CCCCGCTGGAAGCGTTTGACAATTTTTTGGAGGTAGAGGTTGATATCCAGGATACAGAGGTGGTGCTCAAATTTCTTAACGAACGAGCAAAGAAGAGGAAAGACGCTCAC
ATCAAAAGAACCAAAGAAGCTCGTCGCCAAAAGGACGAGCGTGAGCGAACAAGAGTTGATGCAATTCGAAAGGCAAAGAACACATTAGAAATACGCTCACCGCCTAACGA
GGTTGCAGAGCTTCACAAGAAGATCTCTGACAAGCTTGCACAAGTCTTGTTCGCTAAAACGAGGAAAACAATTGAGGTAGTTAAGGCTGCCTTAAAAAGGAAAGAAGAGA
AGAAAAAGATGTTCGCAGAACTGAGCGAGCAAGTGGCAGAGCTCCCCACGAAAGTAAGGACATTGGAGCTAGAGAAAAACCTCGAAGCAATCGCAGAAGAATTCGAGGAT
GAGCTGGAGGCGATGAGTCCACTTGATGACGGGCCACCGCCAAGAAAACCAAGGGAGGTCGTAGGACCATCAAAAGGAAGGAAGAAAGCTGGGTGTTCTGGACCTGAAGA
GCGCCCATCAGGCGACGACACCAGAAGGCACACCAAAATAAGATTAGGAGTGGTGGAAAAGTTTTACGCGGCTAAGCTCAACGCGGCAGAGTTTAGCATACAAATAAGTG
GAAAGACAATGAGTTTCAGCGCGGAGGCCATCAACGCGTTGTATGATTTGCCCAATGAAGTTGAAACCCCAAGGCAAATATACGTAGACAGTCCTACGAAGAGGATGGCC
CGTGAAGTGCTGGAAGTCATCGCATGGCCTGGGGCCGCATGGGAAGTAACGCCAACAGGGAAGTATCAGTTGTATCCACACCAGCTAACCACTGAAGCAAGCATGTGGTT
GTTCTTTATCAAGAAGAAGATCTTCCCAACGCACCATGATAGCACCATCAATTTAGAGTCAGCGATGCTACTCTATTGTATCCTGGCGAAGAAGCGTGTTAACCTTGGCG
AACTTATAGCCACATCCATTCTGTCATGGATGCGAGCTCCCAAAGGCGCGATGCCCTTCCTTTCAACCATTGAGGCCCATTGCCTTAAAGCTGTGTCATTCTTATCCGCC
ATCCAAACCATCTCAATACCAGGTGGGCTGTGTAATCAAATGGCCCTGAACCGCATGATTACTTTCCATGGACACAAGGAAATGGAAAGGCGGGCAAAAACATTAGGCGA
CACACTTAAAGGAATGGCCCAAGTAGAAAGAAAAAGGAAATCCCCAATCGTCGCATCAACCCCCCCACCTAAAGCCAAAAAAACAAAGGTTCTTGCGACGAAGCAGCCTC
CACTGAAATTTCTCCACTCCTCATCTCGCCCAATACAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTTCCAGCAGAAGGACAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACAGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGA
GGTTTTCTATTTTGGACTAAACAAGGCTACACAGCAAACTGCTGATGCTGTGTTTGTAGACGGTATGCTGAAAAGTACATACAATCAGATTAAGACGACGCTGGACACGA
TGGCCAGCAACAATGAAGAATGGGATGAAGATGATTTCGGCAGTCACCGAGGAGGACGAGTAAAAGGTGATGATGGCATGGATAAAAGCGTCGTGGTGGCATTACAGGGA
CAAATGACTGCGATGAACAATTTACTCAAATCAATGGCAATATCGCAAGTCAACGCCGTAGGAAGCTCTGTGCACGCGGCTAACCAAATTGATGACATGGGATGCGTGGG
ATGCGGCGGTCATCATAACACTGACGCATGCCCCCTTAATACTGAAACCGTCGTGTTCGTAAGGAACGACCCCTTCTCCAATATTTACAACCCTGGCTGGAGGAACCATC
TCAACTTTGGATGGGGAGGATCGGGTCAACAACAAGGACGACATGGTGGTCAAGGTGACCATTGCGGGGAAGCATCTGGCTCCCACGCGAGGTACCAAAACAATAGACCC
CAACACTCCCATCATCAACATCAGCCCACCACCACCGCCTCGTCCACCTCTCCCATGGAAAGTCTCCTCCGCGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCA
AGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAGCTCGCTAGTGATTTCTCCGAAAGACAGCAAGGTTCCCTCCCAAGCAATACAGAAACGCCAAATCAGGCGG
GAGGATCTGGTAAAGAGAAGTGTCACGCGGTGACACTACGCAGCGGAAGAAATTTAACCATCCGCGACCTTGATGCTGAATGTAGCTATCCCAATTCTAACTCTACTGCC
GAGATTGGCAGTACAAGTAAAATTCCTAATCTTGTGAATTTCCCTTCAACTGATAATGTTTCTTCCTTGCAGAATAATGGCGCACCAAGCAAGGGTTTCGAGAGCAAGGG
GCAACGGAATCCACAAAACGAAACAACGCCCCACATCACGGGAAGTGGCGAACAAAGAACGCCACAATACACCTACTTAAAACACGGTAAGGGAAAGGTGTTACTGAAAC
CCCCGCTGGAAGCGTTTGACAATTTTTTGGAGGTAGAGGTTGATATCCAGGATACAGAGGTGGTGCTCAAATTTCTTAACGAACGAGCAAAGAAGAGGAAAGACGCTCAC
ATCAAAAGAACCAAAGAAGCTCGTCGCCAAAAGGACGAGCGTGAGCGAACAAGAGTTGATGCAATTCGAAAGGCAAAGAACACATTAGAAATACGCTCACCGCCTAACGA
GGTTGCAGAGCTTCACAAGAAGATCTCTGACAAGCTTGCACAAGTCTTGTTCGCTAAAACGAGGAAAACAATTGAGGTAGTTAAGGCTGCCTTAAAAAGGAAAGAAGAGA
AGAAAAAGATGTTCGCAGAACTGAGCGAGCAAGTGGCAGAGCTCCCCACGAAAGTAAGGACATTGGAGCTAGAGAAAAACCTCGAAGCAATCGCAGAAGAATTCGAGGAT
GAGCTGGAGGCGATGAGTCCACTTGATGACGGGCCACCGCCAAGAAAACCAAGGGAGGTCGTAGGACCATCAAAAGGAAGGAAGAAAGCTGGGTGTTCTGGACCTGAAGA
GCGCCCATCAGGCGACGACACCAGAAGGCACACCAAAATAAGATTAGGAGTGGTGGAAAAGTTTTACGCGGCTAAGCTCAACGCGGCAGAGTTTAGCATACAAATAAGTG
GAAAGACAATGAGTTTCAGCGCGGAGGCCATCAACGCGTTGTATGATTTGCCCAATGAAGTTGAAACCCCAAGGCAAATATACGTAGACAGTCCTACGAAGAGGATGGCC
CGTGAAGTGCTGGAAGTCATCGCATGGCCTGGGGCCGCATGGGAAGTAACGCCAACAGGGAAGTATCAGTTGTATCCACACCAGCTAACCACTGAAGCAAGCATGTGGTT
GTTCTTTATCAAGAAGAAGATCTTCCCAACGCACCATGATAGCACCATCAATTTAGAGTCAGCGATGCTACTCTATTGTATCCTGGCGAAGAAGCGTGTTAACCTTGGCG
AACTTATAGCCACATCCATTCTGTCATGGATGCGAGCTCCCAAAGGCGCGATGCCCTTCCTTTCAACCATTGAGGCCCATTGCCTTAAAGCTGTGTCATTCTTATCCGCC
ATCCAAACCATCTCAATACCAGGTGGGCTGTGTAATCAAATGGCCCTGAACCGCATGATTACTTTCCATGGACACAAGGAAATGGAAAGGCGGGCAAAAACATTAGGCGA
CACACTTAAAGGAATGGCCCAAGTAGAAAGAAAAAGGAAATCCCCAATCGTCGCATCAACCCCCCCACCTAAAGCCAAAAAAACAAAGGTTCTTGCGACGAAGCAGCCTC
CACTGAAATTTCTCCACTCCTCATCTCGCCCAATACAGTGA
Protein sequenceShow/hide protein sequence
MSFQQKDRENLHDAWSRFKQMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGSHRGGRVKGDDGMDKSVVVALQG
QMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACPLNTETVVFVRNDPFSNIYNPGWRNHLNFGWGGSGQQQGRHGGQGDHCGEASGSHARYQNNRP
QHSHHQHQPTTTASSTSPMESLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLPSNTETPNQAGGSGKEKCHAVTLRSGRNLTIRDLDAECSYPNSNSTA
EIGSTSKIPNLVNFPSTDNVSSLQNNGAPSKGFESKGQRNPQNETTPHITGSGEQRTPQYTYLKHGKGKVLLKPPLEAFDNFLEVEVDIQDTEVVLKFLNERAKKRKDAH
IKRTKEARRQKDERERTRVDAIRKAKNTLEIRSPPNEVAELHKKISDKLAQVLFAKTRKTIEVVKAALKRKEEKKKMFAELSEQVAELPTKVRTLELEKNLEAIAEEFED
ELEAMSPLDDGPPPRKPREVVGPSKGRKKAGCSGPEERPSGDDTRRHTKIRLGVVEKFYAAKLNAAEFSIQISGKTMSFSAEAINALYDLPNEVETPRQIYVDSPTKRMA
REVLEVIAWPGAAWEVTPTGKYQLYPHQLTTEASMWLFFIKKKIFPTHHDSTINLESAMLLYCILAKKRVNLGELIATSILSWMRAPKGAMPFLSTIEAHCLKAVSFLSA
IQTISIPGGLCNQMALNRMITFHGHKEMERRAKTLGDTLKGMAQVERKRKSPIVASTPPPKAKKTKVLATKQPPLKFLHSSSRPIQ