; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0012986 (gene) of Chayote v1 genome

Gene IDSed0012986
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function, DUF599
Genome locationLG04:20260910..20264274
RNA-Seq ExpressionSed0012986
SyntenySed0012986
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR006747 - Protein of unknown function DUF599


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017182.1 hypothetical protein SDJN02_19044, partial [Cucurbita argyrosperma subsp. argyrosperma]5.2e-6066.67Show/hide
Query:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        MEE  +D+  +SLS+LLV+GYHA+LW   KKKPEKTT G+Q E RR WLE  L L   SMQVVQ LRNNLMIIILRASISIAVS+SVAALTNNAYK    
Subjt:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA
        ++    GTQSS     LF VK+A AF VS+SSF+ SSFGVGFLID C+LVS+A +S+ H+QRL DTGF LAFIGNRLMWLSF +LLWSLGPI V LCS A
Subjt:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA

Query:  LVCGFSTMDFCSQISK
         V GFS +DF  +  K
Subjt:  LVCGFSTMDFCSQISK

XP_016899668.1 PREDICTED: uncharacterized protein LOC107990610 [Cucumis melo]5.4e-5759.82Show/hide
Query:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        MEE  +D   +SLSVLLV+GYH +LW   KKKPEKT+ G+QWE RR W+E+ L +   SMQVVQ+LRNNLMIIILRASISI +S+SVAALTNNAYK+   
Subjt:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLALV
        + G     QS    LF VK+A AF VS+SSF+CSSFGVGFL+D C+L+++   ++ H+ RL DTGF  AF+GNRLMW SFV+LLWSLGPI V L S ALV
Subjt:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLALV

Query:  CGFSTMDFCSQISKINNSH
         GFS +DF ++ +  + S+
Subjt:  CGFSTMDFCSQISKINNSH

XP_022934642.1 uncharacterized protein LOC111441778 [Cucurbita moschata]7.0e-5768.32Show/hide
Query:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        MEE  +D+  +SLS+LLV+GYHA+LW   KKKPEKTT G+Q E RR WLE  L L   SMQVVQ LRNNLMIIILRASISIAVS+SVAALTNNAYK    
Subjt:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA
        ++    GTQSS     LF VK+A AF VS+SSF+ SSFGVGFLID C+LVS+A +S+ H+QRL DTGF LAFIGNRLMWLSF +LLWSLGPI V LCS A
Subjt:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA

Query:  LV
         V
Subjt:  LV

XP_022982646.1 uncharacterized protein LOC111481460 [Cucurbita maxima]3.0e-6066.67Show/hide
Query:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        MEE  +D+  +SLS+LLV+GYHA+LW   KKKPEKTT G+Q E RR WLE TL L   SMQVVQ LRNNLMIIILRASISIAVS+SVAALTNNAYK    
Subjt:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA
        ++    GTQSS     LF VK+A AF VS+SSF+ SSFGVGFLID C+LVS+A +S+ H+QRL DTGF LAFIGNRLMWLSF +LLWSLGPI V LCS A
Subjt:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA

Query:  LVCGFSTMDFCSQISK
         + GFS +DF  +  K
Subjt:  LVCGFSTMDFCSQISK

XP_023528701.1 uncharacterized protein LOC111791548 [Cucurbita pepo subsp. pepo]5.2e-6066.67Show/hide
Query:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        MEE  +D+  +SLS+LLV+GYHA+LW   KKKPEKTT G+Q E RR WLE  L L   SMQVVQ LRNNLMIIILRASISIAVS+SVAALTNNAYK    
Subjt:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA
        ++    GTQSS     LF VK+A AF VS+SSF+ SSFGVGFLID C+LVS+A +S+ H+QRL DTGF LAFIGNRLMWLSF +LLWSLGPI V LCS A
Subjt:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA

Query:  LVCGFSTMDFCSQISK
         V GFS +DF  +  K
Subjt:  LVCGFSTMDFCSQISK

TrEMBL top hitse value%identityAlignment
A0A1S4DUM5 uncharacterized protein LOC1079906102.6e-5759.82Show/hide
Query:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        MEE  +D   +SLSVLLV+GYH +LW   KKKPEKT+ G+QWE RR W+E+ L +   SMQVVQ+LRNNLMIIILRASISI +S+SVAALTNNAYK+   
Subjt:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLALV
        + G     QS    LF VK+A AF VS+SSF+CSSFGVGFL+D C+L+++   ++ H+ RL DTGF  AF+GNRLMW SFV+LLWSLGPI V L S ALV
Subjt:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLALV

Query:  CGFSTMDFCSQISKINNSH
         GFS +DF ++ +  + S+
Subjt:  CGFSTMDFCSQISKINNSH

A0A5A7TL41 DUF599 domain-containing protein3.7e-5658.9Show/hide
Query:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        MEE  +D   +SLSVLLV+GYH +LW   KKKPEKT+ G+QWE RR W+E+ L +   SMQVVQ+LRNNLMIIILRASISI +S+SVAALTNNAYK+   
Subjt:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLALV
        + G     QS    LF VK+  AF VS+SSF+CSSFGVGFL+D C+L+++   ++ H+ RL D GF  AF+GNRLMW SFV+LLWSLGPI V L S ALV
Subjt:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLALV

Query:  CGFSTMDFCSQISKINNSH
         GFS +DF ++ +  + S+
Subjt:  CGFSTMDFCSQISKINNSH

A0A5D3DMR3 DUF599 domain-containing protein2.6e-5759.82Show/hide
Query:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        MEE  +D   +SLSVLLV+GYH +LW   KKKPEKT+ G+QWE RR W+E+ L +   SMQVVQ+LRNNLMIIILRASISI +S+SVAALTNNAYK+   
Subjt:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLALV
        + G     QS    LF VK+A AF VS+SSF+CSSFGVGFL+D C+L+++   ++ H+ RL DTGF  AF+GNRLMW SFV+LLWSLGPI V L S ALV
Subjt:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLALV

Query:  CGFSTMDFCSQISKINNSH
         GFS +DF ++ +  + S+
Subjt:  CGFSTMDFCSQISKINNSH

A0A6J1F3D7 uncharacterized protein LOC1114417783.4e-5768.32Show/hide
Query:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        MEE  +D+  +SLS+LLV+GYHA+LW   KKKPEKTT G+Q E RR WLE  L L   SMQVVQ LRNNLMIIILRASISIAVS+SVAALTNNAYK    
Subjt:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA
        ++    GTQSS     LF VK+A AF VS+SSF+ SSFGVGFLID C+LVS+A +S+ H+QRL DTGF LAFIGNRLMWLSF +LLWSLGPI V LCS A
Subjt:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA

Query:  LV
         V
Subjt:  LV

A0A6J1IZX2 uncharacterized protein LOC1114814601.5e-6066.67Show/hide
Query:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        MEE  +D+  +SLS+LLV+GYHA+LW   KKKPEKTT G+Q E RR WLE TL L   SMQVVQ LRNNLMIIILRASISIAVS+SVAALTNNAYK    
Subjt:  MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTL-LGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA
        ++    GTQSS     LF VK+A AF VS+SSF+ SSFGVGFLID C+LVS+A +S+ H+QRL DTGF LAFIGNRLMWLSF +LLWSLGPI V LCS A
Subjt:  YQGLLLGTQSSTGM--LFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLA

Query:  LVCGFSTMDFCSQISK
         + GFS +DF  +  K
Subjt:  LVCGFSTMDFCSQISK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31330.1 Protein of unknown function, DUF5992.2e-1626.61Show/hide
Query:  EFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTLLGHDSMQV--VQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANKY
        E  LD I + L +++   YH YLW + + +P  T +G     RR W+   +  +D   +  VQTLRN +M   L A+ SI +   +AA+ ++ Y      
Subjt:  EFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTLLGHDSMQV--VQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANKY

Query:  QGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSS-----------AASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPI
           + G +     +  +K+     + + SF   S  + F+    +L+++             ++  +V  L + GF L  +GNRL + +  ++LW  GP+
Subjt:  QGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSS-----------AASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPI

Query:  VVVLCSLALVCGFSTMDF
        +V LCS+ +V     +DF
Subjt:  VVVLCSLALVCGFSTMDF

AT5G10580.1 Protein of unknown function, DUF5991.0e-1323.89Show/hide
Query:  EEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTLLGHDSMQV--VQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        E++ LDA+ +  ++L++ GYH YLW + +  P  T +G     RR+W+   +  ++   +  VQTLRN +M   L A+  I +   +AA+ ++ Y     
Subjt:  EEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTLLGHDSMQV--VQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAAS-------------SSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSL
            + G          +K+     + + +F   S  + F+    +L+++                +  +V  L +  F L  +GNRL ++   ++LW  
Subjt:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAAS-------------SSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSL

Query:  GPIVVVLCSLALVCGFSTMDFCSQIS
        GP++V L S  ++     +DF   +S
Subjt:  GPIVVVLCSLALVCGFSTMDFCSQIS

AT5G10580.2 Protein of unknown function, DUF5991.1e-0724Show/hide
Query:  EEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTLLGHDSMQV--VQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        E++ LDA+ +  ++L++ GYH YLW + +  P  T +G     RR+W+   +  ++   +  VQTLRN +M   L A+  I +   +AA+ ++ Y     
Subjt:  EEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTLLGHDSMQV--VQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSS
            + G          +K+     + + +F   S  + F+    +L+++
Subjt:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSS

AT5G24790.1 Protein of unknown function, DUF5992.7e-1426.24Show/hide
Query:  EEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTLLGHDSMQV--VQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK
        +++ LDAI + L++++++ YH YL    +  P  T +G+    RR W+   +  +    +  VQTLRN +M   L A+  + +   +AA+ ++ Y     
Subjt:  EEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTLLGHDSMQV--VQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANK

Query:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVS----------SAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPI
            + G          +K+     + + SF   S  + FL    +LV+              +S HV  + + G  L  +GNRL +  F ++LW  GPI
Subjt:  YQGLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVS----------SAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPI

Query:  VVVLCSLALVCGFSTMDFCSQ
        +V    L +V   S +DF S+
Subjt:  VVVLCSLALVCGFSTMDFCSQ

AT5G43180.1 Protein of unknown function, DUF5993.4e-2534.22Show/hide
Query:  DAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTLLGHD--SMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAAN---KYQ
        D+I + LS+L+ +GYH +LW  +K  P +T++G+    R++W      G D   M  VQ+LRN  M+ IL A+I+I +  S+AA+TNNA+KA++      
Subjt:  DAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTLLGHD--SMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAAN---KYQ

Query:  GLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAA-----------------SSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLW
         +  G+Q++   +FV+K+A A  +  +SF  SS  + +L+DA  L+++ A                 S   + + + + GF +A +GNR+M +S  +LLW
Subjt:  GLLLGTQSSTGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAA-----------------SSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLW

Query:  SLGPIVVVLCSLALVCGFSTMDFCS
          GP+ V+  SL LV      DF S
Subjt:  SLGPIVVVLCSLALVCGFSTMDFCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGTTTTGCTTAGATGCCATATTCATCAGCCTCAGTGTGTTGCTTGTGATGGGATATCACGCCTATCTATGGCTACGCTGGAAGAAGAAACCAGAGAAGACAAC
CATGGGAGTTCAATGGGAGCATCGGCGAACGTGGCTCGAGAAGACGCTGCTGGGCCATGACAGCATGCAGGTAGTACAGACCTTAAGAAACAATCTCATGATCATAATTC
TGAGAGCTTCAATATCAATCGCGGTATCCACTTCTGTAGCAGCCCTCACAAACAACGCATACAAAGCTGCAAATAAATATCAAGGACTGTTACTTGGAACTCAATCTAGC
ACTGGGATGTTGTTTGTTGTGAAATTTGCTGTTGCATTTGCAGTGTCGATGTCGAGCTTCATCTGTAGCTCGTTTGGGGTTGGGTTTCTGATCGACGCCTGCGTGTTGGT
CAGCAGTGCGGCAAGCAGTAGCGGCCATGTTCAGAGGCTGGCAGACACAGGATTCACCTTGGCTTTTATAGGGAACCGCCTGATGTGGCTCAGTTTTGTTGTGTTGTTAT
GGTCACTTGGTCCTATTGTTGTGGTCCTCTGTTCCTTGGCTCTAGTTTGTGGGTTTTCTACCATGGACTTTTGCAGCCAAATCAGTAAGATTAATAATTCACACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGTTTTGCTTAGATGCCATATTCATCAGCCTCAGTGTGTTGCTTGTGATGGGATATCACGCCTATCTATGGCTACGCTGGAAGAAGAAACCAGAGAAGACAAC
CATGGGAGTTCAATGGGAGCATCGGCGAACGTGGCTCGAGAAGACGCTGCTGGGCCATGACAGCATGCAGGTAGTACAGACCTTAAGAAACAATCTCATGATCATAATTC
TGAGAGCTTCAATATCAATCGCGGTATCCACTTCTGTAGCAGCCCTCACAAACAACGCATACAAAGCTGCAAATAAATATCAAGGACTGTTACTTGGAACTCAATCTAGC
ACTGGGATGTTGTTTGTTGTGAAATTTGCTGTTGCATTTGCAGTGTCGATGTCGAGCTTCATCTGTAGCTCGTTTGGGGTTGGGTTTCTGATCGACGCCTGCGTGTTGGT
CAGCAGTGCGGCAAGCAGTAGCGGCCATGTTCAGAGGCTGGCAGACACAGGATTCACCTTGGCTTTTATAGGGAACCGCCTGATGTGGCTCAGTTTTGTTGTGTTGTTAT
GGTCACTTGGTCCTATTGTTGTGGTCCTCTGTTCCTTGGCTCTAGTTTGTGGGTTTTCTACCATGGACTTTTGCAGCCAAATCAGTAAGATTAATAATTCACACTGA
Protein sequenceShow/hide protein sequence
MEEFCLDAIFISLSVLLVMGYHAYLWLRWKKKPEKTTMGVQWEHRRTWLEKTLLGHDSMQVVQTLRNNLMIIILRASISIAVSTSVAALTNNAYKAANKYQGLLLGTQSS
TGMLFVVKFAVAFAVSMSSFICSSFGVGFLIDACVLVSSAASSSGHVQRLADTGFTLAFIGNRLMWLSFVVLLWSLGPIVVVLCSLALVCGFSTMDFCSQISKINNSH