; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0023840 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0023840
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr07:12619138..12620252
RNA-Seq ExpressionPI0023840
SyntenyPI0023840
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]4.1e-5943.28Show/hide
Query:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE
        ++K+FPP  NA+ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +E FY GLN A++   DA     +L  +YN+   +L+ +ASNN +W  
Subjt:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE

Query:  DDFGNRRGGRAKNDG-MDKNAVVALQGQMTAMNNLLKSIAISQVNVAGS-SVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFV-------RNDPFSNT
            NR     K  G ++ +A+ AL  QM +M N+LK+     +N+ GS    AA Q     CV CG  H  + CP N  +V +V        N+P+SN+
Subjt:  DDFGNRRGGRAKNDG-MDKNAVVALQGQMTAMNNLLKSIAISQVNVAGS-SVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFV-------RNDPFSNT

Query:  YNPGWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASDF
        YNP W++HPNF WGG G+Q    G                Q  RPQQ H  Q      S TS +E+L+ +YM KND ++QSQA+S+RNLEVQLGQLA+D 
Subjt:  YNPGWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASDF

Query:  SERQQGSLWSNIETPNQAGGSGKEKCHAVTLRSGR
          R QG+L S+ E P +    GKE C AVTLRSG+
Subjt:  SERQQGSLWSNIETPNQAGGSGKEKCHAVTLRSGR

XP_030507648.1 uncharacterized protein LOC115722545 [Cannabis sativa]5.1e-5742.6Show/hide
Query:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE
        ++K+FPP  NA+ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +E FY GLN  T+   DA     +L  +YN+   +L+ +ASNN +W  
Subjt:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE

Query:  DDFGNRRGGRAKNDG-MDKNAVVALQGQMTAMNNLLKSIAISQVNVAGS-SVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFV-------RNDPFSNT
            NR     K  G ++ +A+ AL  QM +M N+LK+     +N+ GS    AA Q  ++ CV CG  H  + CP N  +V +V        N+P+SN+
Subjt:  DDFGNRRGGRAKNDG-MDKNAVVALQGQMTAMNNLLKSIAISQVNVAGS-SVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFV-------RNDPFSNT

Query:  YNPGWRNHPNFGWGGSG-QQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASD
        YNP W++HPNF WGG G    G    QG  +   PG   + ++++PQ            S TS +E+L+ +YM KNDA++QSQA+S+RNLEVQLGQLA+D
Subjt:  YNPGWRNHPNFGWGGSG-QQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASD

Query:  FSERQQGSLWSNIETPNQAGGSGKEKCHAVTLRSGRNL
           R QG+L S+ E P +    GKE C A+TLRSG+ L
Subjt:  FSERQQGSLWSNIETPNQAGGSGKEKCHAVTLRSGRNL

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]1.1e-5944.35Show/hide
Query:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE
        ++K+FPP  NA+ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +E FY GLN A++   DA     +L  +YN+   +L+ +ASNN +W  
Subjt:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE

Query:  DDFGNRRGGRAKNDG-MDKNAVVALQGQMTAMNNLLKSIAISQVNVAGS-SVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFV-------RNDPFSNT
            NR     K  G ++ +A+ AL  QM +M N+LK+     +N+ GS    AA Q  ++ CV CG  H  + CP N  +V +V        N+P+SN+
Subjt:  DDFGNRRGGRAKNDG-MDKNAVVALQGQMTAMNNLLKSIAISQVNVAGS-SVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFV-------RNDPFSNT

Query:  YNPGWRNHPNFGWGGSG-QQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASD
        YNP W++HPNF WGG G    G    QG  +   PG     Q  RPQQ H  Q      S TS +E+L+ +YM KNDA++QSQA+S+RNLEVQLGQLA+D
Subjt:  YNPGWRNHPNFGWGGSG-QQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASD

Query:  FSERQQGSLWSNIETPNQAGGSGKEKCHAVTLRSGR
           R QG+L S+ E P +    GKE C AVTLRSG+
Subjt:  FSERQQGSLWSNIETPNQAGGSGKEKCHAVTLRSGR

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]6.0e-5842.99Show/hide
Query:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE
        ++K+FPP  NA+ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +E FY GLN A++   DA     +L  +YN+   +L+ +ASNN +W  
Subjt:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE

Query:  DDFGNRRGGRAKNDG-MDKNAVVALQGQMTAMNNLLKSIAISQVNVAGS-SVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFV-------RNDPFSNT
            NR     K  G ++ +A+ AL  QM +M N+LK+     +N+ GS    AA Q  ++ CV CG  H  + CP N  +V +V        N+P+SN+
Subjt:  DDFGNRRGGRAKNDG-MDKNAVVALQGQMTAMNNLLKSIAISQVNVAGS-SVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFV-------RNDPFSNT

Query:  YNPGWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASDF
        YNP W++HPNF WGG G      G QG  +   P   +       QQ H  Q      S TS +E+L+ +YM KNDA++QSQA+S+RNLEVQLGQLA+D 
Subjt:  YNPGWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASDF

Query:  SERQQGSLWSNIETPNQAGGSGKEKCHAVTLRSGR
          R QG+L S+ E P +     KE C AVTLRSG+
Subjt:  SERQQGSLWSNIETPNQAGGSGKEKCHAVTLRSGR

XP_038902511.1 uncharacterized protein LOC120089170 [Benincasa hispida]5.6e-5639.76Show/hide
Query:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE
        MKKFFPP  NARR+ ++++F+  + E L  AW RF+R+VK CPH  I  C+++E FY GL ++ Q  A+A   +G +   Y + K +LD +  N ++W +
Subjt:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE

Query:  DDFGNRRGGRAKNDG--MDKNAVVALQGQMTAMNNLLKSIAIS----QVNVAGSSVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFVRNDPFSNTYNP
        + +G R   R K +   +  + +  L  QM  + +LL+ +AI+        A  + LA  Q+  + C  CG  H+ + CP N + V  ++N+P++NTYNP
Subjt:  DDFGNRRGGRAKNDG--MDKNAVVALQGQMTAMNNLLKSIAIS----QVNVAGSSVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFVRNDPFSNTYNP

Query:  GWRNHPNFGWGGSGQQQGRHGGQ--GDHRGEAPGSH--ARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASD
        GWRNHPNF WGG+  Q G+   Q   ++RG  P  H      +++ +QSH+Q   +  ++++S +E LL +Y++KNDA++QSQASSIRNLEVQ+GQLA++
Subjt:  GWRNHPNFGWGGSGQQQGRHGGQ--GDHRGEAPGSH--ARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASD

Query:  FSERQQGSLWSNIETPNQAGGSGKEKC
           R  G + SN E P   G +GKE+C
Subjt:  FSERQQGSLWSNIETPNQAGGSGKEKC

TrEMBL top hitse value%identityAlignment
A0A5B6VWJ0 Retroelement pol polyprotein-like2.0e-4333.61Show/hide
Query:  KFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDEDD
        K+F P +NA+ R E+ +F   D E+L++AW RFK +++ CPH+GIP CI +E FY GL   T+   DA     +L  +YN+   +++ +ASNN +W    
Subjt:  KFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDEDD

Query:  FGNRRGGRAKNDGMDKNAVVALQGQMTAMNNLLKSIAISQVNVAGSSVLAA---NQIDDMGCVGCGGHHNTDACPLNTETVAFVRNDP--------FSNT
              GR      + +A+ +L  Q+++++++ K++  +     GS+  AA   NQ +++  V CG  H  + CP N E+V ++ N           SN 
Subjt:  FGNRRGGRAKNDGMDKNAVVALQGQMTAMNNLLKSIAISQVNVAGSSVLAA---NQIDDMGCVGCGGHHNTDACPLNTETVAFVRNDP--------FSNT

Query:  YNPGWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHH---QQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLA
        YN  WRNH +F W   G                 G+   Y   RP Q  +   Q      A +++ +E+LL  YM KNDAL+QSQA++++NLE Q+GQLA
Subjt:  YNPGWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHH---QQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLA

Query:  SDFSERQQGSLWSNIETPNQAGGSGKEKCHAVTLRSGRNLTIHDPDAERSYPNSNSTAEIGSS
        ++   R QG+L S+ E P      GKE C A+TLRS + +  +  + E+   N+    E+  S
Subjt:  SDFSERQQGSLWSNIETPNQAGGSGKEKCHAVTLRSGRNLTIHDPDAERSYPNSNSTAEIGSS

A0A5D3CC26 Uncharacterized protein1.5e-5443.96Show/hide
Query:  MSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNR------RGGRA
        M+F+Q+DRENL D W RFKRM+K CPH+ IP+C+++E FYFGL+K T Q+A+ VF  GML+S+YNQIK MLDTMASN++EW ++ FG+R      +G R 
Subjt:  MSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNR------RGGRA

Query:  K-NDGMDKNAVVALQGQMTAMNNLLKSIAISQVNVAGSSVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFVRNDPFSNTYNPGWRNHPNFGWGGSGQQ
        +  DG+D + +VALQGQ+  M N+L+S+A+ QVNV  SSV    Q+++MGCVGC   HNT+ACPLNTE VA+++NDP             +  WGG   Q
Subjt:  K-NDGMDKNAVVALQGQMTAMNNLLKSIAISQVNVAGSSVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFVRNDPFSNTYNPGWRNHPNFGWGGSGQQ

Query:  QGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLWSNIETPNQ
                                                                   + SQASSI+N+E+QLGQL SDFS R + S  SN ETPNQ
Subjt:  QGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLWSNIETPNQ

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129459.0e-4436.23Show/hide
Query:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE
        + KFFPP + A+ R ++ SF Q D E+L++AW RFK +++ CPH+GIP  + ++ FY GL  + +   DA     ++         +L+ MASNN +W  
Subjt:  MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDE

Query:  DDFGNRRGGRAKNDGMDKNAVVALQGQMTAMNNLLKSIAISQVNVAGSSVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFV------RNDPFSNTYNP
        +    R G R      + +A+  L  Q+ A++  L ++ +  V           Q   + C  CG  H+ D CP N+E+V FV      +N+P+SNTYNP
Subjt:  DDFGNRRGGRAKNDGMDKNAVVALQGQMTAMNNLLKSIAISQVNVAGSSVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFV------RNDPFSNTYNP

Query:  GWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASDFSER
        GWRNHPNF W  +       G         PG     Q  RPQ               S +E LL +Y+ K DA++QSQ +S+RNLE Q+GQLA+  + R
Subjt:  GWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASDFSER

Query:  QQGSLWSNIETPNQAGGSGKEKCHAVTLRSGRNL
         QGSL S+     Q    GKE+C A+TLRSG+ +
Subjt:  QQGSLWSNIETPNQAGGSGKEKCHAVTLRSGRNL

A0A6J1EQ90 uncharacterized protein LOC1114364118.2e-4540.33Show/hide
Query:  KFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDEDD
        K+FPP  NAR + E+++FQQ + E L +A  RFK M++ CPH+G+P CI +E FY GLN  T+Q  DA     +L  TYN+   +L+ +ASNN +W +  
Subjt:  KFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDEDD

Query:  FGNRRGGRAKNDGMDKNAVVALQGQMTAMNNLLKSIAISQVNVAGSSV---LAANQIDDMGCVGCGGHHNTDACPLNTETVAFV---------RNDPFSN
              GR     ++ +A+ ++  Q+ ++ N+L+++A+ Q ++  + V    A NQ     CV CG  H  D CP N  ++ +V         +N+PFSN
Subjt:  FGNRRGGRAKNDGMDKNAVVALQGQMTAMNNLLKSIAISQVNVAGSSV---LAANQIDDMGCVGCGGHHNTDACPLNTETVAFV---------RNDPFSN

Query:  TYNPGWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNN---RPQQSHHQQHPTTTASSTS--PMENLLHEYMQKNDALLQSQASSIRNLEVQLG
        TYNPGWRNHPNF W G    Q  +  Q   +   P S  R QN      QQ + Q   TT A  TS   +E+L+ EYM KNDA++QSQ +S+RNLEVQ+G
Subjt:  TYNPGWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNN---RPQQSHHQQHPTTTASSTS--PMENLLHEYMQKNDALLQSQASSIRNLEVQLG

A0A6J1G7Q6 uncharacterized protein LOC1114515986.3e-4537.2Show/hide
Query:  KFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDEDD
        K+FPP  +AR R E+++FQ+ + E L +AW RFK  ++ CPH+G+P CI IE FY GLN AT+Q  DA     +L  TYN+   +L+ +ASNN +W +  
Subjt:  KFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDEDD

Query:  FGNRRGGRAKNDGMDKNAVVALQGQMTAMNNLLKSIAISQ---VNVAGSSVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFVRN---------DPFSN
              G+   + ++ +A+ ++  Q+ +M N+L+++A  Q   +     +     Q     CV CG  H  D CP N  ++ +V N         +P SN
Subjt:  FGNRRGGRAKNDGMDKNAVVALQGQMTAMNNLLKSIAISQ---VNVAGSSVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFVRN---------DPFSN

Query:  TYNPGWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSP--------MENLLHEYMQKNDALLQSQASSIRNLEV
        TYNPGWRNHPNF   G    QG +  Q   +   P      QN   Q ++  Q  TT    TS         +E+L+ EYM +NDA++QSQ  S+RNLEV
Subjt:  TYNPGWRNHPNFGWGGSGQQQGRHGGQGDHRGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSP--------MENLLHEYMQKNDALLQSQASSIRNLEV

Query:  QLGQLASDFSERQQGSLWSNIETPNQAG
        Q+GQLA++   R  G L ++ E P + G
Subjt:  QLGQLASDFSERQQGSLWSNIETPNQAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAATTTTTCCCACCTCACGAAAATGCTAGAAGGAGGAAGGAACTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACG
GATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATAATGATAGAGGTTTTCTACTTTGGACTAAACAAGGCCACACAGCAGACTGCTGATGCTGTGTTTGTAG
ACGGTATGCTGAAAAGTACATACAACCAGATTAAGACGATGCTGGACACGATGGCCAGCAATAATGAAGAGTGGGATGAAGATGATTTCGGCAATCGCCGAGGAGGACGA
GCAAAAAATGATGGCATGGATAAGAACGCCGTGGTGGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTTAAATCAATAGCAATATCGCAAGTCAATGTCGCAGG
AAGCTCTGTGCTCGCGGCTAACCAAATTGATGACATGGGATGTGTGGGATGCGGCGGTCATCATAACACTGACGCATGCCCACTCAATACTGAAACCGTCGCGTTCGTAA
GGAATGACCCCTTCTCCAATACCTACAACCCTGGTTGGAGGAACCATCCCAACTTTGGATGGGGAGGATCGGGTCAACAACAAGGGCGACATGGTGGTCAAGGTGACCAT
CGCGGGGAAGCACCTGGCTCCCACGCGAGGTACCAAAACAATAGACCCCAACAATCCCATCATCAACAGCATCCCACCACCACCGCCTCGTCCACCTCTCCCATGGAAAA
CCTCCTCCACGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAGCTCGCTAGTGATTTCTCCGAAA
GACAGCAAGGATCCCTCTGGAGCAATATAGAAACGCCAAATCAGGCGGGAGGATCTGGTAAAGAGAAGTGTCACGCGGTGACACTACGCAGTGGAAGAAATTTAACCATC
CACGACCCTGATGCTGAACGTAGCTACCCCAATTCTAACTCTACTGCCGAGATTGGCAGTTCAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAATTTTTCCCACCTCACGAAAATGCTAGAAGGAGGAAGGAACTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACG
GATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATAATGATAGAGGTTTTCTACTTTGGACTAAACAAGGCCACACAGCAGACTGCTGATGCTGTGTTTGTAG
ACGGTATGCTGAAAAGTACATACAACCAGATTAAGACGATGCTGGACACGATGGCCAGCAATAATGAAGAGTGGGATGAAGATGATTTCGGCAATCGCCGAGGAGGACGA
GCAAAAAATGATGGCATGGATAAGAACGCCGTGGTGGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTTAAATCAATAGCAATATCGCAAGTCAATGTCGCAGG
AAGCTCTGTGCTCGCGGCTAACCAAATTGATGACATGGGATGTGTGGGATGCGGCGGTCATCATAACACTGACGCATGCCCACTCAATACTGAAACCGTCGCGTTCGTAA
GGAATGACCCCTTCTCCAATACCTACAACCCTGGTTGGAGGAACCATCCCAACTTTGGATGGGGAGGATCGGGTCAACAACAAGGGCGACATGGTGGTCAAGGTGACCAT
CGCGGGGAAGCACCTGGCTCCCACGCGAGGTACCAAAACAATAGACCCCAACAATCCCATCATCAACAGCATCCCACCACCACCGCCTCGTCCACCTCTCCCATGGAAAA
CCTCCTCCACGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAGCTCGCTAGTGATTTCTCCGAAA
GACAGCAAGGATCCCTCTGGAGCAATATAGAAACGCCAAATCAGGCGGGAGGATCTGGTAAAGAGAAGTGTCACGCGGTGACACTACGCAGTGGAAGAAATTTAACCATC
CACGACCCTGATGCTGAACGTAGCTACCCCAATTCTAACTCTACTGCCGAGATTGGCAGTTCAAAATAA
Protein sequenceShow/hide protein sequence
MKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCIMIEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNRRGGR
AKNDGMDKNAVVALQGQMTAMNNLLKSIAISQVNVAGSSVLAANQIDDMGCVGCGGHHNTDACPLNTETVAFVRNDPFSNTYNPGWRNHPNFGWGGSGQQQGRHGGQGDH
RGEAPGSHARYQNNRPQQSHHQQHPTTTASSTSPMENLLHEYMQKNDALLQSQASSIRNLEVQLGQLASDFSERQQGSLWSNIETPNQAGGSGKEKCHAVTLRSGRNLTI
HDPDAERSYPNSNSTAEIGSSK