; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021933 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021933
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTransmembrane protein
Genome locationChr05:18570304..18571113
RNA-Seq ExpressionHG10021933
SyntenyHG10021933
Gene Ontology termsGO:0006914 - autophagy (biological process)
GO:0042594 - response to starvation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN66119.2 hypothetical protein Csa_019878 [Cucumis sativus]9.1e-9983.13Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET
        MAS+SWLYIGLGIILGFLFLGIL ELYYLL +K KRIN+T EVE+   +H++KQPSL LIPSET NP +N  GFD+E GSGE  LLKTT  EEE++ EET
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET

Query:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL
        E+DLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA SP+  +LE+YNYK+HGFNPLFESSEE+EQNL
Subjt:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL

Query:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEA-QRKTEVTKNEQRK
        KRLRSSPPPKFKFLRDAEEKLYRRLMEEA QRKTE+ KNEQRK
Subjt:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEA-QRKTEVTKNEQRK

XP_004150184.1 uncharacterized protein LOC101218094 [Cucumis sativus]5.7e-10982.66Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET
        MAS+SWLYIGLGIILGFLFLGIL ELYYLL +K KRIN+T EVE+   +H++KQPSL LIPSET NP +N  GFD+E GSGE  LLKTT  EEE++ EET
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET

Query:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL
        E+DLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA SP+  +LE+YNYK+HGFNPLFESSEE+EQNL
Subjt:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL

Query:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEA-QRKTEVTKNEQRKAINNNNNNQRFHYSSSSSQVLPLISSPP
        KRLRSSPPPKFKFLRDAEEKLYRRLMEEA QRKTE+ KNEQRK I   NNNQ+F YSSSSSQVLPLISSPP
Subjt:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEA-QRKTEVTKNEQRKAINNNNNNQRFHYSSSSSQVLPLISSPP

XP_008457488.1 PREDICTED: uncharacterized protein LOC103497168 [Cucumis melo]2.0e-11483.7Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET
        MAS+SWLYIGLGI+LGFLFLGIL ELYYLL +K KRIN+T EVE+   +H+TKQPS  LIP+ET NP EN  GFD ELGSGE  LLKTT  EEE+++EET
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET

Query:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL
        E+DLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDII AIETPFFTPMASPPLKA SP+  +LESYNYK+HGFNPLFESSEELEQNL
Subjt:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL

Query:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEVTKNEQRKAI--NNNNNNQRFHYSSSSSQVLPLISSPPEAFS
        KRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTE+ KNEQRK I  NNNNNNQRF YSSSSSQVLPLISSPP  FS
Subjt:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEVTKNEQRKAI--NNNNNNQRFHYSSSSSQVLPLISSPPEAFS

XP_023523159.1 uncharacterized protein LOC111787432 [Cucurbita pepo subsp. pepo]7.0e-9171.79Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVEDNHYTKQPSLHLIPSETT-------NPSENHHGFDLELGSGEDLLLKTTEAEEEED
        MAS +WLYIGLG+ LGFLFLGI+AELYYLLW+  KRIN T EV+ + Y K  SL LIPSET        NP EN HG DLE GSGEDLLLKTT AE++E 
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVEDNHYTKQPSLHLIPSETT-------NPSENHHGFDLELGSGEDLLLKTTEAEEEED

Query:  EEETEEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGK-----SRKGSRNRSLSDIITAIETPFFTPMASPPLKASPMGSLESYNYKLHGFNPLFESS
        +E  EED+EL  IYNLAGQPRFLFTI EET+EDLESED K     SRKGSRNRSLSDIITAIETPFFTP+ASPPLKASP+ SL+S  YKLH FNPLFESS
Subjt:  EEETEEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGK-----SRKGSRNRSLSDIITAIETPFFTPMASPPLKASPMGSLESYNYKLHGFNPLFESS

Query:  EELEQNLKRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEV--TKNEQRKAINNNNNNQRFHYSSSSSQVLPLISSPPE
         ELEQNL R+RSSPPPKFKFLRDAEEKLYRRLMEEAQ++ E+  TKNEQRK              SSSSQVLPL+SSPPE
Subjt:  EELEQNLKRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEV--TKNEQRKAINNNNNNQRFHYSSSSSQVLPLISSPPE

XP_038893882.1 uncharacterized protein LOC120082683 [Benincasa hispida]1.3e-12190.66Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVEDNHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEETEED
        MAS+SWLYIGLGIILGFLFLGILAELYYLLW+KKKRINN PEVEDNH+TKQPSLHLIPSET NP EN  GFDLELGSGED LLKTT  EEE+DEEETE+D
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVEDNHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEETEED

Query:  LELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKASPMGSLESYNYKLHGFNPLFESSEELEQNLKRLRS
        LELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDII AI+TPFFTPMASPPLKASP+ SLESYNYKLHGFNPLFESS+ELEQNLKRLRS
Subjt:  LELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKASPMGSLESYNYKLHGFNPLFESSEELEQNLKRLRS

Query:  SPPPKFKFLRDAEEKLYRRLMEEAQRKTEVTKNEQRKAINNNNNNQRFHYSSSSSQV
        SPPPKFKFLRDAEEKLYRRLMEEAQ+KTEV+KNEQRK I  NNNNQRF YSSSSSQV
Subjt:  SPPPKFKFLRDAEEKLYRRLMEEAQRKTEVTKNEQRKAINNNNNNQRFHYSSSSSQV

TrEMBL top hitse value%identityAlignment
A0A0A0LXD2 Uncharacterized protein2.8e-10982.66Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET
        MAS+SWLYIGLGIILGFLFLGIL ELYYLL +K KRIN+T EVE+   +H++KQPSL LIPSET NP +N  GFD+E GSGE  LLKTT  EEE++ EET
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET

Query:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL
        E+DLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA SP+  +LE+YNYK+HGFNPLFESSEE+EQNL
Subjt:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL

Query:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEA-QRKTEVTKNEQRKAINNNNNNQRFHYSSSSSQVLPLISSPP
        KRLRSSPPPKFKFLRDAEEKLYRRLMEEA QRKTE+ KNEQRK I   NNNQ+F YSSSSSQVLPLISSPP
Subjt:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEA-QRKTEVTKNEQRKAINNNNNNQRFHYSSSSSQVLPLISSPP

A0A1S3C577 uncharacterized protein LOC1034971689.8e-11583.7Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET
        MAS+SWLYIGLGI+LGFLFLGIL ELYYLL +K KRIN+T EVE+   +H+TKQPS  LIP+ET NP EN  GFD ELGSGE  LLKTT  EEE+++EET
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET

Query:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL
        E+DLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDII AIETPFFTPMASPPLKA SP+  +LESYNYK+HGFNPLFESSEELEQNL
Subjt:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL

Query:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEVTKNEQRKAI--NNNNNNQRFHYSSSSSQVLPLISSPPEAFS
        KRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTE+ KNEQRK I  NNNNNNQRF YSSSSSQVLPLISSPP  FS
Subjt:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEVTKNEQRKAI--NNNNNNQRFHYSSSSSQVLPLISSPPEAFS

A0A5D3DEB1 Uncharacterized protein9.8e-11583.7Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET
        MAS+SWLYIGLGI+LGFLFLGIL ELYYLL +K KRIN+T EVE+   +H+TKQPS  LIP+ET NP EN  GFD ELGSGE  LLKTT  EEE+++EET
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVED---NHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET

Query:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL
        E+DLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDII AIETPFFTPMASPPLKA SP+  +LESYNYK+HGFNPLFESSEELEQNL
Subjt:  EEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKA-SPMG-SLESYNYKLHGFNPLFESSEELEQNL

Query:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEVTKNEQRKAI--NNNNNNQRFHYSSSSSQVLPLISSPPEAFS
        KRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTE+ KNEQRK I  NNNNNNQRF YSSSSSQVLPLISSPP  FS
Subjt:  KRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEVTKNEQRKAI--NNNNNNQRFHYSSSSSQVLPLISSPPEAFS

A0A6J1G9H1 uncharacterized protein LOC1114521872.4e-8970.71Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVEDNHYTKQPSLHLIPSETT-------NPSENHHGFDLELGSGEDLLLKTTEAEEEED
        MAS +WLYIGLG+ LGFLFLGI+AELYYLLW+  KRIN T  V+ + Y K   L LIPSET        NP EN HG DLE GSGEDLLLKT  AE++E 
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVEDNHYTKQPSLHLIPSETT-------NPSENHHGFDLELGSGEDLLLKTTEAEEEED

Query:  EEETEEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGK-----SRKGSRNRSLSDIITAIETPFFTPMASPPLKASPMGSLESYNYKLHGFNPLFESS
        +E  EED+EL  IYNLAGQPRFLFTI EET+EDLESED K     SRKGSRNRSLSDIITAIETPFFTP+ASPPLKASP+ SL+S  YKLH FNPLFESS
Subjt:  EEETEEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGK-----SRKGSRNRSLSDIITAIETPFFTPMASPPLKASPMGSLESYNYKLHGFNPLFESS

Query:  EELEQNLKRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEV--TKNEQRKAINNNNNNQRFHYSSSSSQVLPLISSPPE
         ELEQNL R+RSSPPPKFKFLRDAEEKLYRRLMEEAQ++ E+  TKNEQRK              SSSSQVLPL+SSPPE
Subjt:  EELEQNLKRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEV--TKNEQRKAINNNNNNQRFHYSSSSSQVLPLISSPPE

A0A6J1KBC0 uncharacterized protein LOC1114927461.3e-8769.64Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVEDNHYTKQPSLHLIPSETT-------NPSENHHGFDLELGSGEDLLLKTTEAEEEED
        M S  WLYIGLG+ LGFLFLGI+AELYYLLW+  +RIN T E+E + Y K      IPSET        NP EN HG DLE GSGEDLLLKTT AE+ E 
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVEDNHYTKQPSLHLIPSETT-------NPSENHHGFDLELGSGEDLLLKTTEAEEEED

Query:  EEETEEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGK-----SRKGSRNRSLSDIITAIETPFFTPMASPPLKASPMGSLESYNYKLHGFNPLFESS
        +E  +ED+EL  IYNLAGQPRFLFTI EET+EDLESED K     SRKGSRNRSLSDIITAIETPFFTP+ASPPLKASP+ SL+S  YKLH FNPLFESS
Subjt:  EEETEEDLELQGIYNLAGQPRFLFTINEETKEDLESEDGK-----SRKGSRNRSLSDIITAIETPFFTPMASPPLKASPMGSLESYNYKLHGFNPLFESS

Query:  EELEQNLKRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEV--TKNEQRKAINNNNNNQRFHYSSSSSQVLPLISSPPE
         +LEQNL R+RSSPPPKFKFLRDAEEKLYRRLMEEAQ++ EV  TKNEQRK              SSSSQVLPL+SSPPE
Subjt:  EELEQNLKRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRKTEV--TKNEQRKAINNNNNNQRFHYSSSSSQVLPLISSPPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39560.1 Putative membrane lipoprotein2.1e-2136.44Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVEDNHY---TKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET
        M S S + + L I+ G L L +LAELYYLLW KK+     P+  +++    T++       S +TNPS +                 ++ +       +T
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVEDNHY---TKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEET

Query:  EEDLEL-QGIYNLAGQ---PRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIET-----PFFTPMASPPLKASPMGSL--ESYNYKLHGFNPLFE
        ++   L  G  N+ G    PRFLFTI EET E++ESED  S KG   +SL+D+   +E+     P+ TP ASP L   P+  L  ES N +    + LFE
Subjt:  EEDLEL-QGIYNLAGQ---PRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIET-----PFFTPMASPPLKASPMGSL--ESYNYKLHGFNPLFE

Query:  SSEELEQN-LKRL---------RSSPPPKFKFLRDAEEKLYRRLMEE
        SS + E N L R           SSP  +FKFLRDAEEKLY++ + E
Subjt:  SSEELEQN-LKRL---------RSSPPPKFKFLRDAEEKLYRRLMEE

AT5G59350.1 unknown protein3.0e-3143.51Show/hide
Query:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQK--KKRI-------NNTPEVEDNHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEE
        M + S L IGL ++ GFL L ++ E+YYLL  K  KKR+           E + N Y K+        +  +  +N+ G + E+   ED      + E  
Subjt:  MASTSWLYIGLGIILGFLFLGILAELYYLLWQK--KKRI-------NNTPEVEDNHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEE

Query:  EDEEETEEDLELQGIYNLAGQPRFLFTINEETKEDLESED-GKSRKGSRN--RSLSDIITAIETPFFTPMASPPLKASPMGSLESYNYKLHGFNPLFESS
          +     DL  +         RFLFTI EETK DLESED GKSR GSR+  RSLSD+     TP FTP+ASP   +SP   LESY +  HGFNPLFES 
Subjt:  EDEEETEEDLELQGIYNLAGQPRFLFTINEETKEDLESED-GKSRKGSRN--RSLSDIITAIETPFFTPMASPPLKASPMGSLESYNYKLHGFNPLFESS

Query:  EELEQN-LKRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRK--TEVTKNEQRKAINNNNNNQR
         ELE N   R  SSPPPKFKF+RDAEEKL +RL+EEA+R+  + VT+    K +N   N ++
Subjt:  EELEQN-LKRLRSSPPPKFKFLRDAEEKLYRRLMEEAQRK--TEVTKNEQRKAINNNNNNQR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTACCAGTTGGTTGTATATTGGGTTAGGCATAATTTTAGGGTTCCTCTTCTTAGGCATACTTGCTGAGCTTTACTATCTTCTATGGCAGAAGAAAAAAAGAAT
CAACAACACTCCAGAGGTTGAAGACAACCACTATACTAAACAACCTTCCCTTCATCTAATTCCCTCTGAAACCACAAACCCATCTGAGAATCACCATGGATTCGACCTGG
AGCTCGGCTCCGGTGAGGATTTGTTGCTGAAGACGACGGAGGCAGAGGAAGAAGAAGATGAAGAAGAAACAGAGGAAGATTTAGAGTTGCAGGGGATTTACAATCTTGCA
GGGCAACCGAGATTTTTATTCACAATCAATGAGGAAACGAAGGAAGATTTGGAATCAGAAGATGGGAAATCGAGAAAAGGGTCAAGAAATAGAAGTTTAAGTGATATAAT
TACAGCAATTGAAACTCCTTTTTTCACTCCCATGGCTTCTCCGCCATTGAAGGCTTCTCCTATGGGTTCTTTAGAGTCTTACAATTACAAACTCCATGGATTCAATCCTT
TGTTTGAATCATCAGAAGAATTGGAGCAGAATTTGAAGAGGTTGAGATCTTCACCTCCTCCAAAGTTCAAATTTCTTAGAGATGCAGAGGAGAAACTGTACAGAAGGCTA
ATGGAAGAAGCTCAAAGAAAAACAGAGGTTACTAAAAATGAACAGAGAAAGGCAATTAATAATAATAATAATAATCAAAGGTTTCATTATTCCTCAAGCTCTTCTCAGGT
TCTTCCTCTCATTTCTTCTCCTCCTGAGGCCTTCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTACCAGTTGGTTGTATATTGGGTTAGGCATAATTTTAGGGTTCCTCTTCTTAGGCATACTTGCTGAGCTTTACTATCTTCTATGGCAGAAGAAAAAAAGAAT
CAACAACACTCCAGAGGTTGAAGACAACCACTATACTAAACAACCTTCCCTTCATCTAATTCCCTCTGAAACCACAAACCCATCTGAGAATCACCATGGATTCGACCTGG
AGCTCGGCTCCGGTGAGGATTTGTTGCTGAAGACGACGGAGGCAGAGGAAGAAGAAGATGAAGAAGAAACAGAGGAAGATTTAGAGTTGCAGGGGATTTACAATCTTGCA
GGGCAACCGAGATTTTTATTCACAATCAATGAGGAAACGAAGGAAGATTTGGAATCAGAAGATGGGAAATCGAGAAAAGGGTCAAGAAATAGAAGTTTAAGTGATATAAT
TACAGCAATTGAAACTCCTTTTTTCACTCCCATGGCTTCTCCGCCATTGAAGGCTTCTCCTATGGGTTCTTTAGAGTCTTACAATTACAAACTCCATGGATTCAATCCTT
TGTTTGAATCATCAGAAGAATTGGAGCAGAATTTGAAGAGGTTGAGATCTTCACCTCCTCCAAAGTTCAAATTTCTTAGAGATGCAGAGGAGAAACTGTACAGAAGGCTA
ATGGAAGAAGCTCAAAGAAAAACAGAGGTTACTAAAAATGAACAGAGAAAGGCAATTAATAATAATAATAATAATCAAAGGTTTCATTATTCCTCAAGCTCTTCTCAGGT
TCTTCCTCTCATTTCTTCTCCTCCTGAGGCCTTCTCTTAA
Protein sequenceShow/hide protein sequence
MASTSWLYIGLGIILGFLFLGILAELYYLLWQKKKRINNTPEVEDNHYTKQPSLHLIPSETTNPSENHHGFDLELGSGEDLLLKTTEAEEEEDEEETEEDLELQGIYNLA
GQPRFLFTINEETKEDLESEDGKSRKGSRNRSLSDIITAIETPFFTPMASPPLKASPMGSLESYNYKLHGFNPLFESSEELEQNLKRLRSSPPPKFKFLRDAEEKLYRRL
MEEAQRKTEVTKNEQRKAINNNNNNQRFHYSSSSSQVLPLISSPPEAFS