; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0008571 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0008571
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr03:18652758..18653813
RNA-Seq ExpressionPI0008571
SyntenyPI0008571
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]2.9e-6839.61Show/hide
Query:  MENNNRNAPPPQADPKPNAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGIS
        M++N  N   P     P  A+I  D DR+IR YAAP     N GI  P   +  +FE+KPVM QM+Q +GQF G P EDPH H+R F  I  SF   G+ 
Subjt:  MENNNRNAPPPQADPKPNAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGIS

Query:  PEELRFALFLLTLRDEAKRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLN
         + LR  LF  ++RD A+ W N+L  G V  W+ L EKF+ K+FP + NA+ R E+ SFQQ+D E+L++AW RFK +++ CPH+GI  CI ME FY GLN
Subjt:  PEELRFALFLLTLRDEAKRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLN

Query:  KATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQVNAAGSSVLA-ANQIDEME
          T+   DA     +L  +YNQ    L+T+A+ N +W        + G+      D  ++ +++ Q+ +M ++LK++++    +   S+ +  NQ   + 
Subjt:  KATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQVNAAGSSVLA-ANQIDEME

Query:  CMGCDGHHNTDACPLNTETVAFVRND----PFSNTYNPGWRNHPNFGWGGSGQQQG
        C+ C   H  D+CP N E+V ++ N     P+SNTYN  WR HPNF W   G   G
Subjt:  CMGCDGHHNTDACPLNTETVAFVRND----PFSNTYNPGWRNHPNFGWGGSGQQQG

XP_022926214.1 uncharacterized protein LOC111433394 [Cucurbita moschata]2.0e-6941.23Show/hide
Query:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEA
        NA ++A D +R+IR+YA P +   NP I  P   +   FE+KPVM QM+Q +GQF G P EDPH H++SF  +  SF    +  + +R +LF  +LRD A
Subjt:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEA

Query:  KRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK
        K W N L  G +  W+ L+EKF+ K+FP   NAR R E++ FQQ + + L  AW RFK M++ CPH+G+P CI ME FY GLN AT+Q  DA     +L 
Subjt:  KRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK

Query:  STYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQ---VNAAGSSVLAANQIDEMECMGCDGHHNTDACPL
         TYN+    L+ +ASNN +W +        GR     ++  A+ ++  Q+ ++ N+L+++A+ Q   + A   +V   NQ     C+ C   H  D CP 
Subjt:  STYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQ---VNAAGSSVLAANQIDEMECMGCDGHHNTDACPL

Query:  NTETVAFV---------RNDPFSNTYNPGWRNHPNFGWGGSG
        N  ++ +V         +N+PFSNTYNPGWRNHPNF W G G
Subjt:  NTETVAFV---------RNDPFSNTYNPGWRNHPNFGWGGSG

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]1.5e-6942.9Show/hide
Query:  ADPKPNAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLT
        A  + N   +A D  R+IR YAAP     NPGI  P   +   FE+KPVM QM+Q VGQF G P EDPH HIRSF  +  SF + G+S E LR  LF  +
Subjt:  ADPKPNAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLT

Query:  LRDEAKRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFV
        LRD A+ W N L    V  W+ L EKF++K+FP   NA+ R E+MSFQQ + E   +AW RFK +++ CPH+GIP CI +E FY GLN A++   DA   
Subjt:  LRDEAKRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFV

Query:  DGMLKSTYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQVNAAGS-SVLAANQIDEMECMGCDGHHNTDA
          +L  +YN+    L+ +ASNN +W  +        R     ++  A+ AL  QM +M N+LK+M     N  GS    AA Q  +  C+ C   H  + 
Subjt:  DGMLKSTYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQVNAAGS-SVLAANQIDEMECMGCDGHHNTDA

Query:  CPLNTETVAFV-------RNDPFSNTYNPGWRNHPNFGWGGSGQQ
        CP N  +V +V        N+P+SN+YNP W++HPNF WGG G+Q
Subjt:  CPLNTETVAFV-------RNDPFSNTYNPGWRNHPNFGWGGSGQQ

XP_038880527.1 uncharacterized protein LOC120072192 [Benincasa hispida]1.9e-7242.73Show/hide
Query:  IAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEAKRWA
        +A++  R +R YA+P LY F+PGI YP+  +  RFE+K VMLQM+Q   QF G  GEDPH H++ F   C  F +P I+PE++R +LF  +LRD+AK+W 
Subjt:  IAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEAKRWA

Query:  NALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYN
        ++LE  E+  W++L+EKFM+K+FP   NARRR+E+M+F+Q+D E L  A  RF  +VK CP++ +   I ME FY GLN+A+Q  ADA   +G++  +Y 
Subjt:  NALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYN

Query:  QIKTTLDTMASNNEEWDEDDFENRREGRAKE---DDMDKSAVVALQGQMTAMNNLLKSMAISQVNAAGSSVLAANQI-----DEMECMGCDGHHNTDACP
        + K  L  +A +N EW +D ++ R + R +    + +D +A+  L  Q+  M +LL+++ +   NA        NQ+       + C+GC   H+   CP
Subjt:  QIKTTLDTMASNNEEWDEDDFENRREGRAKE---DDMDKSAVVALQGQMTAMNNLLKSMAISQVNAAGSSVLAANQI-----DEMECMGCDGHHNTDACP

Query:  LNTETVAFVRNDPFSNTYNPGWRNHPNFGWGGSGQQQ
         N ++V F++N+PFSNTYNPGW NHPNF W G  QQ+
Subjt:  LNTETVAFVRNDPFSNTYNPGWRNHPNFGWGGSGQQQ

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]3.6e-7945.71Show/hide
Query:  ENNNRNAPPPQAD---PKPNAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPG
        +NNN  AP        P  +  ++A D +  IR+YAAPNLY F+PGI+ PI  ENARFEIKPVM+QMIQN+ QF+    E+PH H+  F  +C++F +PG
Subjt:  ENNNRNAPPPQAD---PKPNAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPG

Query:  ISPEELRFALFLLTLRDEAKRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFG
        I+P  +R  LF  TLRD+AKRWA++LE  E+   DQL+E FMKKFFP   N RRRK +++F++ D E L  AW RF+R+VK CPH GI  C+LME+FY G
Subjt:  ISPEELRFALFLLTLRDEAKRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFG

Query:  LNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDD--MDKSAVVALQGQMTAMNNLLKSMAISQ-VNAAGSS-VLAANQ
        LN++TQ  ADA  V+  +  TY + K  LD ++ N ++W +D +  R   R + D+  +  + +  L  QM  + +LL+ MA++Q V++ GS+   A  Q
Subjt:  LNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDD--MDKSAVVALQGQMTAMNNLLKSMAISQ-VNAAGSS-VLAANQ

Query:  IDEMECMGCDGHHNTDACPLNTETVAFVRNDPFSNTYNPGWRNHPNFGWG
        +  +  +     H  + CP N + V  ++N+P++NTYNP WRNHPNFGWG
Subjt:  IDEMECMGCDGHHNTDACPLNTETVAFVRNDPFSNTYNPGWRNHPNFGWG

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333949.6e-7041.23Show/hide
Query:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEA
        NA ++A D +R+IR+YA P +   NP I  P   +   FE+KPVM QM+Q +GQF G P EDPH H++SF  +  SF    +  + +R +LF  +LRD A
Subjt:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEA

Query:  KRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK
        K W N L  G +  W+ L+EKF+ K+FP   NAR R E++ FQQ + + L  AW RFK M++ CPH+G+P CI ME FY GLN AT+Q  DA     +L 
Subjt:  KRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK

Query:  STYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQ---VNAAGSSVLAANQIDEMECMGCDGHHNTDACPL
         TYN+    L+ +ASNN +W +        GR     ++  A+ ++  Q+ ++ N+L+++A+ Q   + A   +V   NQ     C+ C   H  D CP 
Subjt:  STYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQ---VNAAGSSVLAANQIDEMECMGCDGHHNTDACPL

Query:  NTETVAFV---------RNDPFSNTYNPGWRNHPNFGWGGSG
        N  ++ +V         +N+PFSNTYNPGWRNHPNF W G G
Subjt:  NTETVAFV---------RNDPFSNTYNPGWRNHPNFGWGGSG

A0A6J1EQ90 uncharacterized protein LOC1114364119.3e-6539.48Show/hide
Query:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSI-------CASFHMPGISPEELRFALFL
        N  ++A D +R+IR+YA P +   NP I  P   +   FE+KPVM QM+Q +GQF G P EDPH H++SF  +         SF   G+  + +R +LF 
Subjt:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSI-------CASFHMPGISPEELRFALFL

Query:  LTLRDEAKRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAV
          LRD AK W N L  G +  W+ L E F+ K+FP   NAR + E+++FQQ + E L  A  RFK M++ CPH+G+P CI ME FY GLN  T+Q  DA 
Subjt:  LTLRDEAKRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAV

Query:  FVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQ---VNAAGSSVLAANQIDEMECMGCDGHH
            +L  TYN+    L+ +ASNN +W +        GR     ++  A+ ++  Q+ ++ N+L+++A+ Q   + A   +  A NQ     C+ C   H
Subjt:  FVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQ---VNAAGSSVLAANQIDEMECMGCDGHH

Query:  NTDACPLNTETVAFV---------RNDPFSNTYNPGWRNHPNFGWGG
          D CP N  ++ +V         +N+PFSNTYNPGWRNHPNF W G
Subjt:  NTDACPLNTETVAFV---------RNDPFSNTYNPGWRNHPNFGWGG

A0A6J1G7Q6 uncharacterized protein LOC1114515981.5e-6238.6Show/hide
Query:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEA
        NA ++A D +R+IR+YA P +   NP I  P   +   FE+KPVM QM+Q +GQF G   +DPH H++SF  +  SF   G+  + +R + F  +LRD A
Subjt:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEA

Query:  KRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK
        K W N L  G +  W+ L EKF+ K+FP   +AR R E+++FQ+ + E L  AW RFK  ++ CPH+G+P CI +E FY GLN AT+Q  DA     +L 
Subjt:  KRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK

Query:  STYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQ---VNAAGSSVLAANQIDEMECMGCDGHHNTDACPL
         TYN+    L+ +ASNN +W +        G+   + ++  A+ ++  Q+ +M N+L+++A  Q   + A   +     Q     C+ C   H  D CP 
Subjt:  STYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQ---VNAAGSSVLAANQIDEMECMGCDGHHNTDACPL

Query:  NTETVAFVRN---------DPFSNTYNPGWRNHPNFGWGGSG
        N  ++ +V N         +P SNTYNPGWRNHPNF   G G
Subjt:  NTETVAFVRN---------DPFSNTYNPGWRNHPNFGWGGSG

A0A6J1H7E4 uncharacterized protein LOC1114611681.8e-6840.94Show/hide
Query:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEA
        NA  +A D +R+IR+YA P +   NP I  P   +   FE+KPVM QM+Q +GQF G P EDPH H++SF  +  SF   G+  + +R +LF  +LRD A
Subjt:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEA

Query:  KRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK
        K W N L    +  W+ L EKF+ K+FP   NAR R E+++FQQ + E L  AW RFK M++ CPH+G+P CI ME FY GLN AT+Q  DA     ML 
Subjt:  KRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK

Query:  STYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQ---VNAAGSSVLAANQIDEMECMGCDGHHNTDACPL
         TYN+    L+ +ASNN +W +        G+     ++  A+ ++  Q+ ++ N+L+++A  Q   + A   +     Q     C+ C   H  D CP 
Subjt:  STYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQ---VNAAGSSVLAANQIDEMECMGCDGHHNTDACPL

Query:  NTETVAFVR---------NDPFSNTYNPGWRNHPNFGWGGSG
        N  ++ +VR         N+P SNTYNPGWRNHPNF W G G
Subjt:  NTETVAFVR---------NDPFSNTYNPGWRNHPNFGWGGSG

U5CUI2 Retrotrans_gag domain-containing protein8.7e-6342.49Show/hide
Query:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEA
        N   +A D  R+IR YAAP     NPGI  P   +  +FE+KPVM QM+Q VGQF G P EDPH H+RSF  +  SF + G+S E LR  LF  +LRD A
Subjt:  NAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFLLTLRDEA

Query:  KRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK
        + W N L    V  W+ L EKF++K+FP   NA+ R E+MSFQQ + E+  +AW RFK +++ CPH+GIP CI ME FY GLN A++   DA     +L 
Subjt:  KRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK

Query:  STYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQVNAAGSSVLAANQIDEMECMGCDGHHNTDACPLNTE
         +YN+    L+T+ASNN +W        R+       ++  A+ AL  QM +M N+LK+++I   NA      AA Q D++ C+ C   H  + CP N E
Subjt:  STYNQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQVNAAGSSVLAANQIDEMECMGCDGHHNTDACPLNTE

Query:  TVAFVRNDPFSNT
        +V ++ N   + T
Subjt:  TVAFVRNDPFSNT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATAACAACAGAAATGCCCCTCCGCCGCAAGCTGACCCAAAACCAAACGCCGCCTATATAGCACATGACTTGGATAGGTCGATTAGATCTTATGCGGCGCCCAA
CCTCTATTACTTCAACCCAGGAATCGCCTACCCTATATTCGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAATGTCGGACAATTCGACG
GACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTAAGATTCGCCCTCTTCCTG
TTAACTCTGAGGGATGAGGCGAAGAGATGGGCGAATGCCCTGGAAGATGGCGAGGTGGGAATATGGGATCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACTTCA
CGAAAATGCTAGAAGAAGGAAGGAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATAACGCGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCCACA
ATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTATTTTGGACTAAACAAGGCTACACAGCAGACTGCTGATGCTGTGTTTGTAGATGGTATGCTGAAAAGTACATAC
AACCAGATTAAGACGACGCTGGACACGATGGCCAGCAACAATGAAGAGTGGGATGAAGATGATTTCGAAAATCGCCGAGAAGGACGAGCAAAAGAGGATGACATGGATAA
GAGCGCCGTGGTTGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTTAAATCAATGGCAATATCGCAAGTCAACGCCGCAGGAAGCTCTGTGCTCGCGGCTAACC
AAATTGATGAAATGGAATGCATGGGATGCGACGGTCATCATAACACTGACGCATGCCCACTCAATACTGAAACCGTCGCGTTCGTAAGGAACGACCCCTTCTCCAATACT
TACAACCCTGGTTGGAGGAACCATCCCAACTTTGGATGGGGAGGATCGGGTCAACAACAAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATAACAACAGAAATGCCCCTCCGCCGCAAGCTGACCCAAAACCAAACGCCGCCTATATAGCACATGACTTGGATAGGTCGATTAGATCTTATGCGGCGCCCAA
CCTCTATTACTTCAACCCAGGAATCGCCTACCCTATATTCGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAATGTCGGACAATTCGACG
GACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTAAGATTCGCCCTCTTCCTG
TTAACTCTGAGGGATGAGGCGAAGAGATGGGCGAATGCCCTGGAAGATGGCGAGGTGGGAATATGGGATCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACTTCA
CGAAAATGCTAGAAGAAGGAAGGAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATAACGCGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCCACA
ATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTATTTTGGACTAAACAAGGCTACACAGCAGACTGCTGATGCTGTGTTTGTAGATGGTATGCTGAAAAGTACATAC
AACCAGATTAAGACGACGCTGGACACGATGGCCAGCAACAATGAAGAGTGGGATGAAGATGATTTCGAAAATCGCCGAGAAGGACGAGCAAAAGAGGATGACATGGATAA
GAGCGCCGTGGTTGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTTAAATCAATGGCAATATCGCAAGTCAACGCCGCAGGAAGCTCTGTGCTCGCGGCTAACC
AAATTGATGAAATGGAATGCATGGGATGCGACGGTCATCATAACACTGACGCATGCCCACTCAATACTGAAACCGTCGCGTTCGTAAGGAACGACCCCTTCTCCAATACT
TACAACCCTGGTTGGAGGAACCATCCCAACTTTGGATGGGGAGGATCGGGTCAACAACAAGGATGA
Protein sequenceShow/hide protein sequence
MENNNRNAPPPQADPKPNAAYIAHDLDRSIRSYAAPNLYYFNPGIAYPIFGENARFEIKPVMLQMIQNVGQFDGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFL
LTLRDEAKRWANALEDGEVGIWDQLIEKFMKKFFPLHENARRRKELMSFQQKDRENLHNAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTY
NQIKTTLDTMASNNEEWDEDDFENRREGRAKEDDMDKSAVVALQGQMTAMNNLLKSMAISQVNAAGSSVLAANQIDEMECMGCDGHHNTDACPLNTETVAFVRNDPFSNT
YNPGWRNHPNFGWGGSGQQQG