; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0001962 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0001962
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr03:11023733..11024824
RNA-Seq ExpressionPI0001962
SyntenyPI0001962
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017216983.1 PREDICTED: uncharacterized protein LOC108194534 [Daucus carota subsp. sativus]2.7e-6140.66Show/hide
Query:  PNAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDE
        P  A+I  D DR IR YAAP     N GI  P   +  +FE+KPVM QM+Q +GQF G   EDPH H+R F  I  SF   G++ + LR  LF   +RD 
Subjt:  PNAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDE

Query:  VKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKAT-------------
         + W+N+L  G V  W+ L EKF+ K+FPP+ NA+ R E+ SFQQ+D E+L+DAW RFK +++ CPH+ I  CI ME FY GLN  T             
Subjt:  VKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKAT-------------

Query:  -----QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGM-DRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHA-ANQIDDMGCVGCGGHHNTDACP
             Q    L+T+ +NN +W      S R    K   G+ D   + +++ Q+ +M ++LK++++    +   S+++  NQ  ++ CV CG  H  D+CP
Subjt:  -----QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGM-DRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHA-ANQIDDMGCVGCGGHHNTDACP

Query:  LNTETVAFVRN----DPFSNTYNPGWRNHPNF
         N E V ++RN    DP+SNTYN  WR HPNF
Subjt:  LNTETVAFVRN----DPFSNTYNPGWRNHPNF

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]4.6e-6138.95Show/hide
Query:  MENNNRNAPPPQADPEPNAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGIS
        M++N  N   P     P  A+I  D DR IR YAAP     N GI  P   +  +FE+KPVM QM+Q +GQF G   EDPH H+R F  I  SF   G+ 
Subjt:  MENNNRNAPPPQADPEPNAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGIS

Query:  PEELRFALFRLTLRDEVKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLN
         + LR  LF  ++RD  + W+N+L  G V TW+ L EKF+ K+FPP+ NA+ R E+ SFQQ+D E+L+DAW RFK +++ CPH+ I  CI ME FY GLN
Subjt:  PEELRFALFRLTLRDEVKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLN

Query:  KAT------------------QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGM-DRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHA-ANQIDD
          T                  Q    L+T+ + N +W      S R    K   G+ D   + +++ Q+ +M ++LK++++    +   S+ +  NQ  +
Subjt:  KAT------------------QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGM-DRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHA-ANQIDD

Query:  MGCVGCGGHHNTDACPLNTETVAFVRND----PFSNTYNPGWRNHPNFGWGGSGQQQGRHGG
        + CV CG  H  D+CP N E+V ++ N     P+SNTYN  WR HPNF W   G   G   G
Subjt:  MGCVGCGGHHNTDACPLNTETVAFVRND----PFSNTYNPGWRNHPNFGWGGSGQQQGRHGG

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]1.9e-6241.5Show/hide
Query:  ADPEPNAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLT
        A  E N   +A D  R IR YAAP     N GI  P   +   FE+KPVM QM+Q VGQFGG   EDPH HIRSF  +  SF + G+S E LR  LF  +
Subjt:  ADPEPNAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLT

Query:  LRDEVKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKATQIK------
        LRD  + W+N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E   DAW RFK +++ CPH+ IP CI +E FY GLN A+++       
Subjt:  LRDEVKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKATQIK------

Query:  ------------TTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVH--AANQIDDMGCVGCGGHHNT
                      L+ + SNN +W  +   + R    K    ++   + AL  QM +M N+LK+M       +G SV   AA Q     CV CG  H  
Subjt:  ------------TTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVH--AANQIDDMGCVGCGGHHNT

Query:  DACPLNTETVAFV-------RNDPFSNTYNPGWRNHPNFGWGGSGQQ
        + CP N  +V +V        N+P+SN+YNP W++HPNF WGG G+Q
Subjt:  DACPLNTETVAFV-------RNDPFSNTYNPGWRNHPNFGWGGSGQQ

XP_038880527.1 uncharacterized protein LOC120072192 [Benincasa hispida]3.9e-6842.23Show/hide
Query:  IAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDEVKRWV
        +A++  RP+R YA+P LY+F+ GI YP+  +  RFE+K VMLQM+Q   QFGG  GEDPH H++ F   C  F +P I+PE++R +LF  +LRD+ K+WV
Subjt:  IAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDEVKRWV

Query:  NALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKATQI----------------
        ++LE  E+ TW++L+EKFM+K+FPP  NARRR+E+M+F+Q+D E L  A  RF  +VK CP++ +   I ME FY GLN+A+QI                
Subjt:  NALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKATQI----------------

Query:  --KTTLDTMVSNNEEWDEDDFGSR--RGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMG-----CVGCGGHHNTDACP
          K  L  +  +N EW +D +  R  R  R++  + +D + +  L  Q+  M +LL+++ +        +    NQ+   G     CVGCG  H+   CP
Subjt:  --KTTLDTMVSNNEEWDEDDFGSR--RGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMG-----CVGCGGHHNTDACP

Query:  LNTETVAFVRNDPFSNTYNPGWRNHPNFGWGGSGQQQGRHG
         N ++V F++N+PFSNTYNPGW NHPNF W G  QQ+   G
Subjt:  LNTETVAFVRNDPFSNTYNPGWRNHPNFGWGGSGQQQGRHG

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]6.4e-7143.14Show/hide
Query:  ENNNRNAPPPQAD---PEPNAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPG
        +NNN  AP        P  +  ++A D + PIR YAAPNLY+F+ GI+ P+  EN RFEIKPVM+QMIQN+ QF     E+PH H+  F  +C++F +PG
Subjt:  ENNNRNAPPPQAD---PEPNAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPG

Query:  ISPEELRFALFRLTLRDEVKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFG
        I+P  +R  LF  TLRD+ KRW ++LE  E+ + DQL+E FMKKFFPP  N RRRK +++F++ D E L  AW RF+R+VK CPH  I  C+LME+FY G
Subjt:  ISPEELRFALFRLTLRDEVKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFG

Query:  LNKATQ------------------IKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDG-MDRSVVVALQGQMTAMNNLLKSMAISQ-VNAVGSS-VHAANQ
        LN++TQ                   K  LD +  N ++W +D +  R   R + D+  +  + +  L  QM  + +LL+ MA++Q V++ GS+  +A  Q
Subjt:  LNKATQ------------------IKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDG-MDRSVVVALQGQMTAMNNLLKSMAISQ-VNAVGSS-VHAANQ

Query:  IDDMGCVGCGGHHNTDACPLNTETVAFVRNDPFSNTYNPGWRNHPNFGWG
        +  +  +  G  H  + CP N + V  ++N+P++NTYNP WRNHPNFGWG
Subjt:  IDDMGCVGCGGHHNTDACPLNTETVAFVRNDPFSNTYNPGWRNHPNFGWG

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333946.4e-6138.78Show/hide
Query:  NAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDEV
        NA ++A D +R IR YA P +   N  I  P   +   FE+KPVM QM+Q +GQF G   EDPH H++SF  +  SF    +  + +R +LF  +LRD  
Subjt:  NAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDEV

Query:  KRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKAT--------------
        K W+N L  G + +W+ L+EKF+ K+FPP  NAR R E++ FQQ + + L +AW RFK M++ CPH+ +P CI ME FY GLN AT              
Subjt:  KRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKAT--------------

Query:  ----QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAA---NQIDDMGCVGCGGHHNTDACP
            +    L+ + SNN +W   D  S  G + +G   ++   + ++  Q+ ++ N+L+++A+ Q + + + VH     NQ     CV CG  H  D CP
Subjt:  ----QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAA---NQIDDMGCVGCGGHHNTDACP

Query:  LNTETVAFV---------RNDPFSNTYNPGWRNHPNFGWGGSG
         N  ++ +V         +N+PFSNTYNPGWRNHPNF W G G
Subjt:  LNTETVAFV---------RNDPFSNTYNPGWRNHPNFGWGGSG

A0A6J1EQ90 uncharacterized protein LOC1114364112.1e-5637.36Show/hide
Query:  NAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSI-------CASFHMPGISPEELRFALFR
        N  ++A D +R IR YA P +   N  I  P   +   FE+KPVM QM+Q +GQF G   EDPH H++SF  +         SF   G+  + +R +LF 
Subjt:  NAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSI-------CASFHMPGISPEELRFALFR

Query:  LTLRDEVKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKAT-------
          LRD  K W+N L  G + +W+ L E F+ K+FPP  NAR + E+++FQQ + E L +A  RFK M++ CPH+ +P CI ME FY GLN  T       
Subjt:  LTLRDEVKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKAT-------

Query:  -----------QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAA---NQIDDMGCVGCGGH
                   +    L+ + SNN +W   D  S  G + +G   ++   + ++  Q+ ++ N+L+++A+ Q + + + VH A   NQ     CV CG  
Subjt:  -----------QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAA---NQIDDMGCVGCGGH

Query:  HNTDACPLNTETVAFV---------RNDPFSNTYNPGWRNHPNFGWGG
        H  D CP N  ++ +V         +N+PFSNTYNPGWRNHPNF W G
Subjt:  HNTDACPLNTETVAFV---------RNDPFSNTYNPGWRNHPNFGWGG

A0A6J1G7Q6 uncharacterized protein LOC1114515985.3e-5536.73Show/hide
Query:  NAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDEV
        NA ++A D +R IR YA P +   N  I  P   +   FE+KPVM QM+Q +GQF G   +DPH H++SF  +  SF   G+  + +R + F  +LRD  
Subjt:  NAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDEV

Query:  KRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKAT--------------
        K W+N L  G + +W+ L EKF+ K+FPP  +AR R E+++FQ+ + E L +AW RFK  ++ CPH+ +P CI +E FY GLN AT              
Subjt:  KRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKAT--------------

Query:  ----QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAAN---QIDDMGCVGCGGHHNTDACP
            +    L+ + SNN +W   D  S  G + +  + ++   + ++  Q+ +M N+L+++A  Q + + +  H A    Q     CV CG  H  D CP
Subjt:  ----QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAAN---QIDDMGCVGCGGHHNTDACP

Query:  LNTETVAFVRN---------DPFSNTYNPGWRNHPNFGWGGSG
         N  ++ +V N         +P SNTYNPGWRNHPNF   G G
Subjt:  LNTETVAFVRN---------DPFSNTYNPGWRNHPNFGWGGSG

A0A6J1H7E4 uncharacterized protein LOC1114611681.2e-5938.78Show/hide
Query:  NAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDEV
        NA  +A D +R IR YA P +   N  I  P   +   FE+KPVM QM+Q +GQF G   EDPH H++SF  +  SF   G+  + +R +LF  +LRD  
Subjt:  NAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDEV

Query:  KRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKAT--------------
        K W+N L    + +W+ L EKF+ K+FPP  NAR R E+++FQQ + E L +AW RFK M++ CPH+ +P CI ME FY GLN AT              
Subjt:  KRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKAT--------------

Query:  ----QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAA---NQIDDMGCVGCGGHHNTDACP
            +    L+ + SNN +W   D  S  G + +G   ++   + ++  Q+ ++ N+L+++A  Q   + +  H A    Q     CV CG  H  D CP
Subjt:  ----QIKTTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAA---NQIDDMGCVGCGGHHNTDACP

Query:  LNTETVAFVR---------NDPFSNTYNPGWRNHPNFGWGGSG
         N  ++ +VR         N+P SNTYNPGWRNHPNF W G G
Subjt:  LNTETVAFVR---------NDPFSNTYNPGWRNHPNFGWGGSG

U5CUI2 Retrotrans_gag domain-containing protein2.4e-5540.76Show/hide
Query:  NAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDEV
        N   +A D  R IR YAAP     N GI  P   +  +FE+KPVM QM+Q VGQF G   EDPH H+RSF  +  SF + G+S E LR  LF  +LRD  
Subjt:  NAAYIAHDLDRPIR-YAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRLTLRDEV

Query:  KRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKATQIK-----------
        + W+N L    V  W+ L EKF++K+FPP  NA+ R E+MSFQQ + E+  DAW RFK +++ CPH+ IP CI ME FY GLN A+++            
Subjt:  KRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKATQIK-----------

Query:  -------TTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACPLNT
                 L+T+ SNN +W      +R     K    ++   + AL  QM +M N+LK+++I   NA      AA Q DD+ CV CG  H  + CP N 
Subjt:  -------TTLDTMVSNNEEWDEDDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACPLNT

Query:  ETVAFVRNDPFSNT
        E+V ++ N   + T
Subjt:  ETVAFVRNDPFSNT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATAACAACAGAAATGCCCCTCCGCCGCAAGCTGACCCAGAACCAAATGCTGCCTACATAGCACACGACTTGGATAGGCCAATTAGATATGCGGCACCCAACCT
CTACAACTTCAACCTAGGAATCACCTACCCTGTATTCGGCGAGAACATCAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAACGTCGGACAATTCGGCGGAC
ATCTTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCGGGCATCTCACCTGAAGAATTAAGATTCGCTCTCTTCCGGTTA
ACTCTGAGGGATGAGGTGAAGAGGTGGGTAAATGCTCTGGAAGATGGCGAGGTGGGAACATGGGATCAACTAATAGAGAAATTTATGAAGAAATTTTTTCCACCTCACGA
AAATGCTAGAAGAAGGAAGGAGCTTATGAGCTTCCAACAGAAGGACAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCCACAATT
GTATTCCTAAATGCATATTGATGGAGGTTTTCTATTTTGGACTAAACAAGGCTACACAGATTAAGACGACGCTGGACACGATGGTCAGCAACAATGAAGAATGGGATGAA
GATGATTTCGGTAGTCGCCGAGGAGGACGAGCAAAAGGTGATGATGGCATGGATAGGAGCGTCGTGGTGGCATTACAGGGACAAATGACTGCGATGAACAATTTACTCAA
ATCAATGGCAATATCGCAAGTCAACGCCGTAGGAAGCTCTGTGCACGCGGCTAACCAAATTGATGACATGGGATGTGTGGGATGCGGCGGTCATCATAACACTGACGCAT
GCCCACTCAATACTGAAACCGTCGCATTCGTAAGGAACGATCCCTTCTCCAATACTTACAACCCTGGTTGGAGGAACCATCCCAACTTTGGATGGGGAGGATCGGGTCAA
CAACAAGGGCGACATGGTGGTCAAGGTGACCATCGCAGGGAAGCATCTGGCTCCCACGCGAGGTACCAAAACAATAGACCCCAACACTCCCATCATCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATAACAACAGAAATGCCCCTCCGCCGCAAGCTGACCCAGAACCAAATGCTGCCTACATAGCACACGACTTGGATAGGCCAATTAGATATGCGGCACCCAACCT
CTACAACTTCAACCTAGGAATCACCTACCCTGTATTCGGCGAGAACATCAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAACGTCGGACAATTCGGCGGAC
ATCTTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCGGGCATCTCACCTGAAGAATTAAGATTCGCTCTCTTCCGGTTA
ACTCTGAGGGATGAGGTGAAGAGGTGGGTAAATGCTCTGGAAGATGGCGAGGTGGGAACATGGGATCAACTAATAGAGAAATTTATGAAGAAATTTTTTCCACCTCACGA
AAATGCTAGAAGAAGGAAGGAGCTTATGAGCTTCCAACAGAAGGACAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCCACAATT
GTATTCCTAAATGCATATTGATGGAGGTTTTCTATTTTGGACTAAACAAGGCTACACAGATTAAGACGACGCTGGACACGATGGTCAGCAACAATGAAGAATGGGATGAA
GATGATTTCGGTAGTCGCCGAGGAGGACGAGCAAAAGGTGATGATGGCATGGATAGGAGCGTCGTGGTGGCATTACAGGGACAAATGACTGCGATGAACAATTTACTCAA
ATCAATGGCAATATCGCAAGTCAACGCCGTAGGAAGCTCTGTGCACGCGGCTAACCAAATTGATGACATGGGATGTGTGGGATGCGGCGGTCATCATAACACTGACGCAT
GCCCACTCAATACTGAAACCGTCGCATTCGTAAGGAACGATCCCTTCTCCAATACTTACAACCCTGGTTGGAGGAACCATCCCAACTTTGGATGGGGAGGATCGGGTCAA
CAACAAGGGCGACATGGTGGTCAAGGTGACCATCGCAGGGAAGCATCTGGCTCCCACGCGAGGTACCAAAACAATAGACCCCAACACTCCCATCATCAATAG
Protein sequenceShow/hide protein sequence
MENNNRNAPPPQADPEPNAAYIAHDLDRPIRYAAPNLYNFNLGITYPVFGENIRFEIKPVMLQMIQNVGQFGGHLGEDPHEHIRSFYSICASFHMPGISPEELRFALFRL
TLRDEVKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNCIPKCILMEVFYFGLNKATQIKTTLDTMVSNNEEWDE
DDFGSRRGGRAKGDDGMDRSVVVALQGQMTAMNNLLKSMAISQVNAVGSSVHAANQIDDMGCVGCGGHHNTDACPLNTETVAFVRNDPFSNTYNPGWRNHPNFGWGGSGQ
QQGRHGGQGDHRREASGSHARYQNNRPQHSHHQ