; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0009866 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0009866
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionABC transporter F family member 4-like
Genome locationchr02:25103576..25104874
RNA-Seq ExpressionPI0009866
SyntenyPI0009866
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057746.1 ABC transporter F family member 4-like [Cucumis melo var. makuwa]6.2e-12483.51Show/hide
Query:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHTNNNATKDEHNSASTIPPS
        MGCGNSKLNPEGE++PPRIRPL VRNKFLELRKRKNGTHLRDGALSKKVLLK+GESEEEN M V+NRH CGSTKCLASQQHT NNATKDEHNSASTIPPS
Subjt:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHTNNNATKDEHNSASTIPPS

Query:  NNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKV
        NN+ANATKN EQSN                             KTMNEHKCIQEGDENNKKE EDGRPDNEENRGS I PGSPSFRFYFVEET D+KEKV
Subjt:  NNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKV

Query:  EMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA
        EMKDAGG+GDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISK+RPVSVGVKNLLN+KSCYHLSCSGNDRANLLARKAEA
Subjt:  EMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA

KAE8652373.1 hypothetical protein Csa_013810 [Cucumis sativus]2.6e-9073Show/hide
Query:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHT-NNNATKDEHNSASTIPP
        MGCGNSKLNP GE++PPRIRPLHVRNK LELRKRKNGTHLRDGALSKKVLLKDGESEEEN M V+NRH CGSTKCLASQQHT NNNATKDEHNSASTIPP
Subjt:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHT-NNNATKDEHNSASTIPP

Query:  SNNNANATKNCEQSNHKTMNEHKCIQEGDENNKKEEDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKVEMKDAGGVGDVSHKKSPSHDSVESTTSA
        SNNNA+ATK  EQSNH        ++E  + + + +   PD+           +P       ++   N+ K EMKDA G+GDVSHKKSPS DSVESTTSA
Subjt:  SNNNANATKNCEQSNHKTMNEHKCIQEGDENNKKEEDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKVEMKDAGGVGDVSHKKSPSHDSVESTTSA

Query:  KSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA
        K  EGQENK IKKGKK TTFNRV+SKKRPVSVGVKNLLN+KSCYHLSCSGNDRANLLARKAEA
Subjt:  KSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA

XP_008464387.1 PREDICTED: uncharacterized protein LOC103502290 [Cucumis melo]3.5e-12783.84Show/hide
Query:  MEEERQMGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHTNNNATKDEHNSA
        MEEERQMGCGNSKLNPEGE++PPRIRPL VRNKFLELRKRKNGTHLRDGALSKKVLLK+GESEEEN M V+NRH CGSTKCLASQQHT NNATKDEHNSA
Subjt:  MEEERQMGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHTNNNATKDEHNSA

Query:  STIPPSNNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETL
        STIPPSNN+ANATKN EQSN                             KTMNEHKCIQEGDENNKKE EDGRPDNEENRGS I PGSPSFRFYFVEET 
Subjt:  STIPPSNNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETL

Query:  DNKEKVEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA
        D+KEKVEMKDAGG+GDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISK+RPVSVGVKNLLN+KSCYHLSCSGNDRANLLARKAEA
Subjt:  DNKEKVEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA

XP_011649820.1 probable DNA-directed RNA polymerase I subunit RPA43 isoform X1 [Cucumis sativus]1.6e-11981.51Show/hide
Query:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHT-NNNATKDEHNSASTIPP
        MGCGNSKLNP GE++PPRIRPLHVRNK LELRKRKNGTHLRDGALSKKVLLKDGESEEEN M V+NRH CGSTKCLASQQHT NNNATKDEHNSASTIPP
Subjt:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHT-NNNATKDEHNSASTIPP

Query:  SNNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEK
        SNNNA+ATK  EQSNH                            KTMNEHKCIQEGDENNKKE EDGRPDNEENRGSFI PGSPSFR YFVEET D+KEK
Subjt:  SNNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEK

Query:  VEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA
        VEMKDA G+GDVSHKKSPS DSVESTTSAK  EGQENK IKKGKK TTFNRV+SKKRPVSVGVKNLLN+KSCYHLSCSGNDRANLLARKAEA
Subjt:  VEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA

XP_022921428.1 uncharacterized protein LOC111429712 [Cucurbita moschata]2.0e-6659.34Show/hide
Query:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPML-VNNRHAC-----GSTKCLASQQHTNNNATKDEHNSA
        MGCGNSKL PEGE I P IRPL  R KF E RKRKNGTHLR+ ALSKKVLLK+GE EEEN +L V+NR++      G T CL    HT      DEH+S 
Subjt:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPML-VNNRHAC-----GSTKCLASQQHTNNNATKDEHNSA

Query:  ST---IPPSNNNANATKNCEQSNHKTMNEHK-CIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKVEMKDAGGVGDVSHKKSPS
        +    +     N N T+  E   +KTMN  +  ++EGD+ NK+E E+GRPDNE+NR   I PGSPSFR YFVE+T ++K+ VEM D G + D S KKSPS
Subjt:  ST---IPPSNNNANATKNCEQSNHKTMNEHK-CIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKVEMKDAGGVGDVSHKKSPS

Query:  HDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKS-CYHLSCSGNDRANLLARKAE
         DSVES++S KS EGQENK IKKGKKGTT NR  S++RPV VG+K+LLN+ + CYHLSC+GNDR N LARKAE
Subjt:  HDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKS-CYHLSCSGNDRANLLARKAE

TrEMBL top hitse value%identityAlignment
A0A0A0LNT2 Uncharacterized protein7.7e-12081.51Show/hide
Query:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHT-NNNATKDEHNSASTIPP
        MGCGNSKLNP GE++PPRIRPLHVRNK LELRKRKNGTHLRDGALSKKVLLKDGESEEEN M V+NRH CGSTKCLASQQHT NNNATKDEHNSASTIPP
Subjt:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHT-NNNATKDEHNSASTIPP

Query:  SNNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEK
        SNNNA+ATK  EQSNH                            KTMNEHKCIQEGDENNKKE EDGRPDNEENRGSFI PGSPSFR YFVEET D+KEK
Subjt:  SNNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEK

Query:  VEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA
        VEMKDA G+GDVSHKKSPS DSVESTTSAK  EGQENK IKKGKK TTFNRV+SKKRPVSVGVKNLLN+KSCYHLSCSGNDRANLLARKAEA
Subjt:  VEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA

A0A1S3CLT6 uncharacterized protein LOC1035022901.7e-12783.84Show/hide
Query:  MEEERQMGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHTNNNATKDEHNSA
        MEEERQMGCGNSKLNPEGE++PPRIRPL VRNKFLELRKRKNGTHLRDGALSKKVLLK+GESEEEN M V+NRH CGSTKCLASQQHT NNATKDEHNSA
Subjt:  MEEERQMGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHTNNNATKDEHNSA

Query:  STIPPSNNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETL
        STIPPSNN+ANATKN EQSN                             KTMNEHKCIQEGDENNKKE EDGRPDNEENRGS I PGSPSFRFYFVEET 
Subjt:  STIPPSNNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETL

Query:  DNKEKVEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA
        D+KEKVEMKDAGG+GDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISK+RPVSVGVKNLLN+KSCYHLSCSGNDRANLLARKAEA
Subjt:  DNKEKVEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA

A0A5D3BHA4 ABC transporter F family member 4-like3.0e-12483.51Show/hide
Query:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHTNNNATKDEHNSASTIPPS
        MGCGNSKLNPEGE++PPRIRPL VRNKFLELRKRKNGTHLRDGALSKKVLLK+GESEEEN M V+NRH CGSTKCLASQQHT NNATKDEHNSASTIPPS
Subjt:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHTNNNATKDEHNSASTIPPS

Query:  NNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKV
        NN+ANATKN EQSN                             KTMNEHKCIQEGDENNKKE EDGRPDNEENRGS I PGSPSFRFYFVEET D+KEKV
Subjt:  NNNANATKNCEQSNH----------------------------KTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKV

Query:  EMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA
        EMKDAGG+GDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISK+RPVSVGVKNLLN+KSCYHLSCSGNDRANLLARKAEA
Subjt:  EMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA

A0A6J1E3W4 uncharacterized protein LOC1114297129.9e-6759.34Show/hide
Query:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPML-VNNRHAC-----GSTKCLASQQHTNNNATKDEHNSA
        MGCGNSKL PEGE I P IRPL  R KF E RKRKNGTHLR+ ALSKKVLLK+GE EEEN +L V+NR++      G T CL    HT      DEH+S 
Subjt:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPML-VNNRHAC-----GSTKCLASQQHTNNNATKDEHNSA

Query:  ST---IPPSNNNANATKNCEQSNHKTMNEHK-CIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKVEMKDAGGVGDVSHKKSPS
        +    +     N N T+  E   +KTMN  +  ++EGD+ NK+E E+GRPDNE+NR   I PGSPSFR YFVE+T ++K+ VEM D G + D S KKSPS
Subjt:  ST---IPPSNNNANATKNCEQSNHKTMNEHK-CIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKVEMKDAGGVGDVSHKKSPS

Query:  HDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKS-CYHLSCSGNDRANLLARKAE
         DSVES++S KS EGQENK IKKGKKGTT NR  S++RPV VG+K+LLN+ + CYHLSC+GNDR N LARKAE
Subjt:  HDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKS-CYHLSCSGNDRANLLARKAE

A0A6J1JK15 uncharacterized protein LOC111485806 isoform X29.2e-6558.24Show/hide
Query:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPML-VNNRHAC-----GSTKCLASQQHTNNNATKDEHNSA
        MGCGNSKL PEGE I P IRPL  R KF E RKRKNGTHLRD ALSKKVLL +GE EEEN +L V+NR+       G T CL    HT     KDEH+S 
Subjt:  MGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPML-VNNRHAC-----GSTKCLASQQHTNNNATKDEHNSA

Query:  STIPPSNN---NANATKNCEQSNHKTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKVEMKDAGGVGDVSHKKSPSH
        +   P      N N T+   + +    N+HK ++EGD+ +K E E+GRPDNE+NR   I PGSPSFR YFVE+T + K+ VEM D G + D S KKSPS 
Subjt:  STIPPSNN---NANATKNCEQSNHKTMNEHKCIQEGDENNKKE-EDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKVEMKDAGGVGDVSHKKSPSH

Query:  DSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKS-CYHLSCSGNDRANLLARKAEA
        DSVEST+S KS E QE K IKKGKKGTT NR  S+KRPV VG+K+LLN+ + CYHLSC+GNDR N L  KAE+
Subjt:  DSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKS-CYHLSCSGNDRANLLARKAEA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50830.1 unknown protein9.0e-0427.3Show/hide
Query:  MGCGNSKLN-------PEGEMI--PPRIRPLHVRNKFLELRKRKNGTHLRDG-ALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQH---------
        MGCG S+L         EG ++  P  IRPL +R +  E++KR +   L+    LSKK LL+   SE+      N+     S K   +  H         
Subjt:  MGCGNSKLN-------PEGEMI--PPRIRPLHVRNKFLELRKRKNGTHLRDG-ALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQH---------

Query:  -----------------TNNNATKDEHNSASTIPPSNNNANATKNCEQSNHK----TMNEHKCIQEGDE-NNKKEEDGRPDNEENRGSFIYPGSPSFRFY
                           N   +D H+          N    K  E+ NH      +N  K   EGD+  N   ++G  +N + R   I PGSPSFR Y
Subjt:  -----------------TNNNATKDEHNSASTIPPSNNNANATKNCEQSNHK----TMNEHKCIQEGDE-NNKKEEDGRPDNEENRGSFIYPGSPSFRFY

Query:  FVE-ETLDNKEKVEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKS-CY-HLSCSGNDRANLL
         V+  + D+ E+ +++DA        +KS   +SV  TT  K    ++  ++KK KK         K+  +++  K L N+ + CY    C GN  + L+
Subjt:  FVE-ETLDNKEKVEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKS-CY-HLSCSGNDRANLL

Query:  ARKA
          K+
Subjt:  ARKA

AT5G50830.2 unknown protein1.4e-0427.39Show/hide
Query:  MGCGNSKLN-------PEGEMI--PPRIRPLHVRNKFLELRKRKNGTHLRDG-ALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQH---------
        MGCG S+L         EG ++  P  IRPL +R +  E++KR +   L+    LSKK LL+   SE+      N+     S K   +  H         
Subjt:  MGCGNSKLN-------PEGEMI--PPRIRPLHVRNKFLELRKRKNGTHLRDG-ALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQH---------

Query:  -----------------TNNNATKDEHNSASTIPPSNNNANATKNCEQSNHK----TMNEHKCIQEGDE-NNKKEEDGRPDNEENRGSFIYPGSPSFRFY
                           N   +D H+          N    K  E+ NH      +N  K   EGD+  N   ++G  +N + R   I PGSPSFR Y
Subjt:  -----------------TNNNATKDEHNSASTIPPSNNNANATKNCEQSNHK----TMNEHKCIQEGDE-NNKKEEDGRPDNEENRGSFIYPGSPSFRFY

Query:  FVE-ETLDNKEKVEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCY-HLSCSGNDRANLLA
         V+  + D+ E+ +++DA        +KS   +SV  TT  K  +G   K  KK ++G  F   + +K   +V          CY    C GN  + L+ 
Subjt:  FVE-ETLDNKEKVEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGKKGTTFNRVISKKRPVSVGVKNLLNIKSCY-HLSCSGNDRANLLA

Query:  RKA
         K+
Subjt:  RKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAGAGAGGCAAATGGGGTGTGGAAACTCCAAGCTTAATCCAGAAGGAGAGATGATTCCTCCAAGGATTCGCCCACTTCATGTACGAAATAAATTTTTGGAGTT
GAGGAAACGTAAGAATGGAACCCATCTTAGAGATGGAGCTTTGTCAAAGAAAGTGCTTCTGAAAGATGGAGAATCAGAAGAAGAGAACCCTATGCTTGTCAATAACAGAC
ATGCGTGTGGCAGCACAAAATGTTTGGCCTCACAACAACATACCAACAATAATGCAACCAAAGATGAACATAATTCAGCTTCAACTATTCCCCCATCCAACAACAATGCT
AATGCAACAAAAAATTGTGAACAAAGCAACCACAAAACCATGAATGAACATAAATGTATTCAAGAAGGAGATGAAAACAACAAGAAAGAAGAAGATGGGAGGCCTGACAA
CGAAGAGAATCGAGGAAGCTTCATTTATCCTGGATCTCCCAGTTTCAGATTTTATTTTGTTGAAGAAACACTAGACAACAAAGAAAAAGTTGAAATGAAAGATGCAGGTG
GTGTGGGAGATGTCTCACACAAGAAGTCGCCAAGTCATGACAGCGTTGAGAGCACAACTAGTGCAAAATCTGGCGAGGGCCAGGAGAACAAGGTAATAAAGAAAGGGAAA
AAAGGAACGACTTTCAATAGAGTCATCAGTAAAAAAAGACCAGTTAGCGTTGGTGTCAAGAATTTGTTGAATATTAAATCTTGCTATCATTTGAGTTGTTCCGGCAATGA
CAGAGCCAATCTTCTAGCTAGAAAAGCAGAAGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAGAGAGGCAAATGGGGTGTGGAAACTCCAAGCTTAATCCAGAAGGAGAGATGATTCCTCCAAGGATTCGCCCACTTCATGTACGAAATAAATTTTTGGAGTT
GAGGAAACGTAAGAATGGAACCCATCTTAGAGATGGAGCTTTGTCAAAGAAAGTGCTTCTGAAAGATGGAGAATCAGAAGAAGAGAACCCTATGCTTGTCAATAACAGAC
ATGCGTGTGGCAGCACAAAATGTTTGGCCTCACAACAACATACCAACAATAATGCAACCAAAGATGAACATAATTCAGCTTCAACTATTCCCCCATCCAACAACAATGCT
AATGCAACAAAAAATTGTGAACAAAGCAACCACAAAACCATGAATGAACATAAATGTATTCAAGAAGGAGATGAAAACAACAAGAAAGAAGAAGATGGGAGGCCTGACAA
CGAAGAGAATCGAGGAAGCTTCATTTATCCTGGATCTCCCAGTTTCAGATTTTATTTTGTTGAAGAAACACTAGACAACAAAGAAAAAGTTGAAATGAAAGATGCAGGTG
GTGTGGGAGATGTCTCACACAAGAAGTCGCCAAGTCATGACAGCGTTGAGAGCACAACTAGTGCAAAATCTGGCGAGGGCCAGGAGAACAAGGTAATAAAGAAAGGGAAA
AAAGGAACGACTTTCAATAGAGTCATCAGTAAAAAAAGACCAGTTAGCGTTGGTGTCAAGAATTTGTTGAATATTAAATCTTGCTATCATTTGAGTTGTTCCGGCAATGA
CAGAGCCAATCTTCTAGCTAGAAAAGCAGAAGCTTAA
Protein sequenceShow/hide protein sequence
MEEERQMGCGNSKLNPEGEMIPPRIRPLHVRNKFLELRKRKNGTHLRDGALSKKVLLKDGESEEENPMLVNNRHACGSTKCLASQQHTNNNATKDEHNSASTIPPSNNNA
NATKNCEQSNHKTMNEHKCIQEGDENNKKEEDGRPDNEENRGSFIYPGSPSFRFYFVEETLDNKEKVEMKDAGGVGDVSHKKSPSHDSVESTTSAKSGEGQENKVIKKGK
KGTTFNRVISKKRPVSVGVKNLLNIKSCYHLSCSGNDRANLLARKAEA