; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0018874 (gene) of Chayote v1 genome

Gene IDSed0018874
OrganismSechium edule (Chayote v1)
DescriptionIntegral membrane HPP family protein
Genome locationLG01:13508863..13513108
RNA-Seq ExpressionSed0018874
SyntenySed0018874
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007065 - HPP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437643.1 PREDICTED: uncharacterized protein LOC103482985 [Cucumis melo]1.1e-10779.32Show/hide
Query:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVPI
        MSLQLKP  HHLHH   RHC   + Y+PS   ++++PSA +L NHS VSLLP CH LN KRGI       +  +  RR R S  I HRSIVAS IAG P+
Subjt:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVPI

Query:  SDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSSA
        SDGSKPEKGF+SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIF+AQIGCAAIGVLAFTLLGPGWLARSSA
Subjt:  SDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSSA

Query:  LAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        LAASMAFMIYTGSTHPPAASLPILFIDGAK+  LNFWYALFPGAAGCILLCLIQE+VV LKEK KF
Subjt:  LAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

XP_011651222.2 uncharacterized protein LOC105434855 [Cucumis sativus]1.3e-11080.45Show/hide
Query:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVPI
        MSLQLKP  HHLHH   RHC + + Y+PS    ++ PS  +L NHSFVSLLP+CH LN KRGISA     +  +  RR R S RI HRSIVAS IAG P+
Subjt:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVPI

Query:  SDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSSA
        SDGSKPEKGF+SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIF+AQIGCAAIGVLAFTLLGPGWLARSSA
Subjt:  SDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSSA

Query:  LAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        LAASMAFMIYTGSTHPPAASLPILFIDGAK+  LNFWYALFPGAAGCILLCLIQE+VV LKEK KF
Subjt:  LAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

XP_022137446.1 uncharacterized protein LOC111008888 [Momordica charantia]6.9e-10781.27Show/hide
Query:  MSLQLKPTSHHLHHRSRRHCLHRQQY-RPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVP
        MSLQLKP  HHL HR RRH  H+QQY + SSNVRL++ SAS  PN SFVSLLPN H  N  RG+        RLFGDRRRRS     HR I ASGI G  
Subjt:  MSLQLKPTSHHLHHRSRRHCLHRQQY-RPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVP

Query:  ISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSS
        +SDG+KPEKG  SP LSDILWPSAGAFAAMAMLGKMDQILA KGLSMTIAPLGAVCAVLFATPSAPAARKYNIF+AQIGCAAIGV AFTLLGPGWLARSS
Subjt:  ISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSS

Query:  ALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        ALAASMAFMI TGSTHPPAASLPILFIDGAKL HLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
Subjt:  ALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

XP_023519271.1 uncharacterized protein LOC111782708 isoform X1 [Cucurbita pepo subsp. pepo]4.6e-10376.1Show/hide
Query:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGS--SSWRLFGDRRRR----SSSRISHRSIVASG
        MSLQLKP    +HHR       +Q Y+PS  V           NHSF+SLLPNCH LN KRG+S DGS      L  DRRRR        +S+RSIVASG
Subjt:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGS--SSWRLFGDRRRR----SSSRISHRSIVASG

Query:  IAGVPISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGW
        IAG PISDGSKP+KGF+SPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+F+AQIGCAAIGVLAFTLLGPGW
Subjt:  IAGVPISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGW

Query:  LARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        LARSSALAASMAFMIYTGSTHPPAASLP++FIDGAK+ HLNFWYALFPGAAGC+LLC IQEIVVYLKEKFKF
Subjt:  LARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

XP_038894638.1 uncharacterized protein LOC120083135 [Benincasa hispida]3.8e-11380.67Show/hide
Query:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRR---SSSRISHRSIVASGIAG
        MSLQLKP  HHLHH  RRHC  ++ Y+PS  V++++PSASL  NHSFVSLLPNCH LN  RG+S        LF +RR+R      RI HR IVASGIAG
Subjt:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRR---SSSRISHRSIVASGIAG

Query:  VPISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLAR
         PISDGSK EKGF+SPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAARKYN+FIAQIGCAAIGVLAFTLLGPGWLAR
Subjt:  VPISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLAR

Query:  SSALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        SSALAASMAFMIYTGSTHPPAASLPILFIDGAK+  LNFWYALFPGAAGCILLCLIQE+V++LKEKFKF
Subjt:  SSALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

TrEMBL top hitse value%identityAlignment
A0A0A0LR37 Uncharacterized protein9.4e-11080.08Show/hide
Query:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVPI
        MSLQLKP  HHLHH   R C + + Y+PS    ++ PS  +L NHSFVSLLP+CH LN KRGISA     +  +  RR R S RI HRSIVAS IAG P+
Subjt:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVPI

Query:  SDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSSA
        SDGSKPEKGF+SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIF+AQIGCAAIGVLAFTLLGPGWLARSSA
Subjt:  SDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSSA

Query:  LAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        LAASMAFMIYTGSTHPPAASLPILFIDGAK+  LNFWYALFPGAAGCILLCLIQE+VV LKEK KF
Subjt:  LAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

A0A1S3AUM8 uncharacterized protein LOC1034829855.1e-10879.32Show/hide
Query:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVPI
        MSLQLKP  HHLHH   RHC   + Y+PS   ++++PSA +L NHS VSLLP CH LN KRGI       +  +  RR R S  I HRSIVAS IAG P+
Subjt:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVPI

Query:  SDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSSA
        SDGSKPEKGF+SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIF+AQIGCAAIGVLAFTLLGPGWLARSSA
Subjt:  SDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSSA

Query:  LAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        LAASMAFMIYTGSTHPPAASLPILFIDGAK+  LNFWYALFPGAAGCILLCLIQE+VV LKEK KF
Subjt:  LAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

A0A6J1C6M6 uncharacterized protein LOC1110088883.3e-10781.27Show/hide
Query:  MSLQLKPTSHHLHHRSRRHCLHRQQY-RPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVP
        MSLQLKP  HHL HR RRH  H+QQY + SSNVRL++ SAS  PN SFVSLLPN H  N  RG+        RLFGDRRRRS     HR I ASGI G  
Subjt:  MSLQLKPTSHHLHHRSRRHCLHRQQY-RPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVP

Query:  ISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSS
        +SDG+KPEKG  SP LSDILWPSAGAFAAMAMLGKMDQILA KGLSMTIAPLGAVCAVLFATPSAPAARKYNIF+AQIGCAAIGV AFTLLGPGWLARSS
Subjt:  ISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSS

Query:  ALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        ALAASMAFMI TGSTHPPAASLPILFIDGAKL HLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
Subjt:  ALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

A0A6J1E7R0 uncharacterized protein LOC111431576 isoform X11.2e-10175.37Show/hide
Query:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGS--SSWRLFGDRRRR----SSSRISHRSIVASG
        MSLQLKP    +HHR       +Q Y+PS  V           NHSF+SLLPNCH LN KRG+S DGS      L  DRRRR          +RSIVASG
Subjt:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGS--SSWRLFGDRRRR----SSSRISHRSIVASG

Query:  IAGVPISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGW
        IA  PISDGSKP+KGF+SPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+F+AQIGCAAIGVLAFTLLGPGW
Subjt:  IAGVPISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGW

Query:  LARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        LARSSALAASMAFMIYTGSTHPPAASLP++FIDGAK+ HLNFWYALFPGAAGC+LLC IQEIVVYLKEKFKF
Subjt:  LARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

A0A6J1KLD7 uncharacterized protein LOC1114956292.1e-10174.73Show/hide
Query:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGS--SSWRLFGDRRRR-----SSSRISHRSIVAS
        M+LQLKP    +HHR       +Q Y+PS  V           NHSF+SLLPNCH LN  RG S DGS      L  DRRRR         I +RSIVAS
Subjt:  MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGS--SSWRLFGDRRRR-----SSSRISHRSIVAS

Query:  GIAGVPISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPG
        GIAG PISDGSKP+KGF+SPPLSDILWPSAGAFAAMAMLGKMDQ+LAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+F+AQIGCAAIGVLAFTLLGPG
Subjt:  GIAGVPISDGSKPEKGFISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPG

Query:  WLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        WLARSSALAASMAFMIYTGSTHPPAASLP++FIDGAK+ HLNFWYALFPGAAGC+LLC IQEIVVYLKEKFKF
Subjt:  WLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G47980.1 Integral membrane HPP family protein3.7e-6656.38Show/hide
Query:  PSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGS----SSWRLFGDRRRRSSSRISHRSIVASGIAGVPISDGSKPEKGFISPPLSDILWPSA
        PS +++L S +  + P+   V    +     +  G+  D S     S R   +RRR S S      + +S        +  KPEK  ++P LSD++WP+A
Subjt:  PSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGS----SSWRLFGDRRRRSSSRISHRSIVASGIAGVPISDGSKPEKGFISPPLSDILWPSA

Query:  GAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPI
        GAFAAMA++G++DQ+L PKG+SM++APLGAV A+LF TPSAPAARKYN+F AQIGCAAIGVLAF+  GP WLARS+ALAAS+AFM+ T + HPPAASLP+
Subjt:  GAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPI

Query:  LFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        LFIDGAKLH LNFWYALFPGAA CILLC +Q IV YLKE  KF
Subjt:  LFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

AT5G62720.1 Integral membrane HPP family protein7.4e-6764.43Show/hide
Query:  RSSSRISHRSIVASGIAG-----VPISDGSKPEKGFISPP--LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNI
        ++ + ++HR  V++ +A       P  D  KP+K   +    LSD++WP+AGAFAAMA+LG+MDQ+L+PKG+SM++APLGAV A+LF TPSAPAARKYNI
Subjt:  RSSSRISHRSIVASGIAG-----VPISDGSKPEKGFISPP--LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNI

Query:  FIAQIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF
        F+AQIGCAAIGV+AF++ GPGWLARS ALAAS+AFM+ T + HPPAASLP++FIDGAK HHLNFWYALFPGAA C++LCL+Q IV YLKE  KF
Subjt:  FIAQIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF

AT5G62720.2 Integral membrane HPP family protein1.4e-4159.06Show/hide
Query:  RSSSRISHRSIVASGIAG-----VPISDGSKPEKGFISPP--LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNI
        ++ + ++HR  V++ +A       P  D  KP+K   +    LSD++WP+AGAFAAMA+LG+MDQ+L+PKG+SM++APLGAV A+LF TPSAPAARKYNI
Subjt:  RSSSRISHRSIVASGIAG-----VPISDGSKPEKGFISPP--LSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNI

Query:  FIAQIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASL
        F+AQIGCAAIGV+AF++ GPGWLARS ALAAS+AFM+ T + HPP   L
Subjt:  FIAQIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTGCAACTAAAGCCAACTAGCCACCATCTCCATCACCGTAGCCGCCGCCATTGCCTTCATCGGCAGCAGTATCGACCCAGTTCTAATGTACGATTACGATCTCC
GTCCGCTTCTCTGCTCCCGAATCATTCATTTGTTTCTTTGTTGCCGAATTGCCATTCATTGAATGCAAAACGAGGGATTTCGGCGGACGGGTCTAGTTCGTGGAGATTAT
TCGGTGACCGGAGAAGACGAAGCAGCAGCAGAATCAGTCACCGGAGTATCGTGGCATCCGGCATTGCTGGTGTGCCGATTTCAGATGGGTCAAAACCAGAAAAGGGCTTT
ATTTCTCCTCCTCTCAGTGACATCCTTTGGCCTTCTGCAGGGGCATTTGCAGCAATGGCAATGCTGGGGAAAATGGATCAAATCCTAGCACCCAAGGGGCTTTCTATGAC
AATTGCGCCACTAGGAGCCGTGTGTGCGGTCCTGTTCGCAACGCCGTCGGCCCCTGCAGCTCGAAAGTACAATATCTTCATAGCACAGATTGGTTGTGCGGCAATCGGGG
TATTGGCGTTTACTTTGCTGGGGCCGGGATGGCTGGCAAGAAGCTCTGCTCTGGCTGCATCCATGGCGTTTATGATCTACACTGGTTCAACTCATCCTCCAGCTGCGAGC
TTGCCGATTCTGTTCATCGATGGAGCTAAATTGCATCATCTCAATTTCTGGTATGCTCTGTTTCCAGGAGCCGCTGGATGTATTCTTCTATGTTTGATACAAGAGATAGT
GGTGTACTTGAAGGAGAAGTTCAAATTTTAA
mRNA sequenceShow/hide mRNA sequence
AGGTACCTATATCTATTCCTAGAGTAGGCACCGGGAAACATTGAAGAAATGGAGATACGGGAGTTGTCTACACGTTAATGTCATAGTGGGTCTTGAACCTATGACCTTGA
GGAGGTATACATTCAAAACTCAAGTCTTCACCAATGCGCCACTCCTTAGAGATAATTGGAAGGGCGGCTTAGGGTAAATAGTGAAAGGGGGAGAATGACAAATGATTATG
AAATAATTGTTGGTGATTTAAACATGAGTGAGGGATCTAATGATCATTCTCATTCTTCTTGACTCAAACACCTTTAAGACAAGAAGAATGAGAATTATTACGGAGAGGAT
CTTAGAAAATAATGATTTTGAAGAGAATGAATATTGTTTTGGGAATGAAGTGATTGATTATTAGGGAAGTGTACAGGGTAAAAAGTTATTACAAGTACGAATAAAAAAAT
GTTAGGAAGAAATGAGATTGTGAACTGATTGATTCTTGGGGGTGGAATGATCCTCTTTTCCATTATCCTCATTTGACTTATTTTCCATATTTAGACTTTGCTGTGACAAT
TAAATGATGGGCTTTTCTTAGCTGCTCTCTTCAATAAAAGTCAAAACTGTAACAGAGGATCACTTCAAATGGCAACTGAAATTTAATCCCATTTCACTGATTCGATGCTG
ATGAGTTTTTAAGTATATCTCGAAATAGACAAGCACGATGGCAGAGAGGGTGCTATTATTTATTGAATTTTCCACTATTATTATTATTATTATTATTTATGATTTAGCTT
TTGAGTTTTTAATGTTCGAGAAGTGGCAACTATACAGAAATGGAGTTGTCTCTTTCTGTGAATGGCCTTTGGAGCTCTTCAAAGGCCAACTCTGCAACACAGCTTCGAGT
TTTTTTTTCTTCTTTTTCTTCAGGCCTCTGTTTTCAAGCAGAAACTTTTCCCAAATCCCATTTCCTTACAAGATGGTGTCTTTGTCAGATTCACGTCTGAATTTACAATT
GCATGCTATTAATTTATTCACTATTTTCTGTTTCTTATCTTTTTTATTTATCACAAACTCTTTTTGCGGCTTTTCCCATACGCTCTCTCTGTTTCTCTGTATCTTTTTCT
TCTTCTTCTTCTTCTTCCCCTGTTTTGTCCACAAATATCTTAAGATAGAACAATAGAAGCTTTGAGGTGGGTTGAAGGTAAAATCCAGTATGAGCCTGCAACTAAAGCCA
ACTAGCCACCATCTCCATCACCGTAGCCGCCGCCATTGCCTTCATCGGCAGCAGTATCGACCCAGTTCTAATGTACGATTACGATCTCCGTCCGCTTCTCTGCTCCCGAA
TCATTCATTTGTTTCTTTGTTGCCGAATTGCCATTCATTGAATGCAAAACGAGGGATTTCGGCGGACGGGTCTAGTTCGTGGAGATTATTCGGTGACCGGAGAAGACGAA
GCAGCAGCAGAATCAGTCACCGGAGTATCGTGGCATCCGGCATTGCTGGTGTGCCGATTTCAGATGGGTCAAAACCAGAAAAGGGCTTTATTTCTCCTCCTCTCAGTGAC
ATCCTTTGGCCTTCTGCAGGGGCATTTGCAGCAATGGCAATGCTGGGGAAAATGGATCAAATCCTAGCACCCAAGGGGCTTTCTATGACAATTGCGCCACTAGGAGCCGT
GTGTGCGGTCCTGTTCGCAACGCCGTCGGCCCCTGCAGCTCGAAAGTACAATATCTTCATAGCACAGATTGGTTGTGCGGCAATCGGGGTATTGGCGTTTACTTTGCTGG
GGCCGGGATGGCTGGCAAGAAGCTCTGCTCTGGCTGCATCCATGGCGTTTATGATCTACACTGGTTCAACTCATCCTCCAGCTGCGAGCTTGCCGATTCTGTTCATCGAT
GGAGCTAAATTGCATCATCTCAATTTCTGGTATGCTCTGTTTCCAGGAGCCGCTGGATGTATTCTTCTATGTTTGATACAAGAGATAGTGGTGTACTTGAAGGAGAAGTT
CAAATTTTAAGCTCTCAATGAAACAGTGTGACACATTGAACCACCCATTATTTGAAAGCCAAATATTGATGAAAATAATTCATTGTCATTCCCCAACAAATATTTTTTGT
CCCATAATTGTTCTTGTTCAACCCCCACATCACTCCCAATTGCCTCTTCATTCTCAAATTAAGTTGTAAATTTATCTTCCCTACTTTGTGCATTTGATCTTTGTGAATCT
TCATCTATATATGAATGGTTAAATTATAAGTTTAGTTCTTCATCTTTCATGAATTTTAGAGAGATACCTAATAAATTCTTACTTTGATCTTGTGTTCAGTAAATTCTTGA
A
Protein sequenceShow/hide protein sequence
MSLQLKPTSHHLHHRSRRHCLHRQQYRPSSNVRLRSPSASLLPNHSFVSLLPNCHSLNAKRGISADGSSSWRLFGDRRRRSSSRISHRSIVASGIAGVPISDGSKPEKGF
ISPPLSDILWPSAGAFAAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFIAQIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAAS
LPILFIDGAKLHHLNFWYALFPGAAGCILLCLIQEIVVYLKEKFKF