; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0020047 (gene) of Chayote v1 genome

Gene IDSed0020047
OrganismSechium edule (Chayote v1)
DescriptionIntegral membrane HPP family protein
Genome locationLG08:38225404..38228246
RNA-Seq ExpressionSed0020047
SyntenySed0020047
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007065 - HPP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437643.1 PREDICTED: uncharacterized protein LOC103482985 [Cucumis melo]4.8e-10278.79Show/hide
Query:  MSLQLKPIHHYF-------CHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGL------LRRRRRGV-GIGRWSIVASGIAGVPISD
        MSLQLKPIHH+        CH+ +QPSYR ++QAPSA +L NHS VSLLP CH LNG +GI    L       RRR RG  GIG  SIVAS IAG P+SD
Subjt:  MSLQLKPIHHYF-------CHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGL------LRRRRRGV-GIGRWSIVASGIAGVPISD

Query:  GSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALA
        GSKPEK  VSPPLSDILWPSAGAFAAMA+LGKMDQILA KGLSMTIAPLGAVCAVLFATPS+PAARKYNIFMAQIGCAAIGVLA  LLGPGWLARSSALA
Subjt:  GSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALA

Query:  ASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        ASMAFMIYTGS HPPAASLPILFIDGAK+Q LNFW+ALFPGAAGCILLC IQELVV LKEK KF
Subjt:  ASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

XP_011651222.2 uncharacterized protein LOC105434855 [Cucumis sativus]7.7e-10077.44Show/hide
Query:  MSLQLKPIHHYF-------CH--QQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGL------LRRRRRGVG-IGRWSIVASGIAGVPI
        MSLQLKPIHH+        CH  + +QPSY   +Q PS  +L NHSFVSLLP+CH LNG +GISA  L       RRR RG   IG  SIVAS IAG P+
Subjt:  MSLQLKPIHHYF-------CH--QQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGL------LRRRRRGVG-IGRWSIVASGIAGVPI

Query:  SDGSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSA
        SDGSKPEK  VSPPLSDILWPSAGAFAAMA+LGKMDQILA KGLSMTIAPLGAVCAVLFATPS+PAARKYNIF+AQIGCAAIGVLA  LLGPGWLARSSA
Subjt:  SDGSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSA

Query:  LAASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        LAASMAFMIYTGS HPPAASLPILFIDGAK+Q LNFW+ALFPGAAGCILLC IQELVV LKEK KF
Subjt:  LAASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

XP_023001510.1 uncharacterized protein LOC111495629 [Cucurbita maxima]1.3e-9976.14Show/hide
Query:  MSLQLKPIHHYFCHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLL---------RRRRR-----GVGIGRWSIVASGIAGVPISD
        M+LQLKPIHH    Q +QPS+RV           NHSF+SLLPNCH LNG +G S DG +         RRRRR     G GIG  SIVASGIAG PISD
Subjt:  MSLQLKPIHHYFCHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLL---------RRRRR-----GVGIGRWSIVASGIAGVPISD

Query:  GSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALA
        GSKP+K  VSPPLSDILWPSAGAFAAMA+LGKMDQ+LA KGLSMTIAPLGAVCA+LFA PSSPAARKYN+FMAQIGCAAIGVLA  LLGPGWLARSSALA
Subjt:  GSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALA

Query:  ASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        ASMAFMIYTGS HPPAASLP++FIDGAK+QHLNFW+ALFPGAAGC+LLCFIQE+VVYLKEKFKF
Subjt:  ASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

XP_023519271.1 uncharacterized protein LOC111782708 isoform X1 [Cucurbita pepo subsp. pepo]7.7e-10076.43Show/hide
Query:  MSLQLKPIHHYFCHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLL---------RRRRR----GVGIGRWSIVASGIAGVPISDG
        MSLQLKPIHH    Q +QPS+RV           NHSF+SLLPNCH LNG +G+S DG +         RRRRR    G G+   SIVASGIAG PISDG
Subjt:  MSLQLKPIHHYFCHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLL---------RRRRR----GVGIGRWSIVASGIAGVPISDG

Query:  SKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALAA
        SKP+K  VSPPLSDILWPSAGAFAAMA+LGKMDQILA KGLSMTIAPLGAVCA+LFA PSSPAARKYN+FMAQIGCAAIGVLA  LLGPGWLARSSALAA
Subjt:  SKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALAA

Query:  SMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        SMAFMIYTGS HPPAASLP++FIDGAK+QHLNFW+ALFPGAAGC+LLCFIQE+VVYLKEKFKF
Subjt:  SMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

XP_038894638.1 uncharacterized protein LOC120083135 [Benincasa hispida]1.0e-10479.47Show/hide
Query:  MSLQLKPIHHYF-------CHQQ--FQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLLRRRRRGVG----IGRWSIVASGIAGVPISDG
        MSLQLKPIHH+        CH Q  +QPSYRV++QAPSASL  NHSFVSLLPNCH LNG +G+S      RR+R  G    IG   IVASGIAG PISDG
Subjt:  MSLQLKPIHHYF-------CHQQ--FQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLLRRRRRGVG----IGRWSIVASGIAGVPISDG

Query:  SKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALAA
        SK EK  VSPPLSDILWPSAGAFAAMA+LGKMDQILA KGLSMTIAPLGAVCAVLFATPSSPAARKYN+F+AQIGCAAIGVLA  LLGPGWLARSSALAA
Subjt:  SKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALAA

Query:  SMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        SMAFMIYTGS HPPAASLPILFIDGAK+Q LNFW+ALFPGAAGCILLC IQELV++LKEKFKF
Subjt:  SMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

TrEMBL top hitse value%identityAlignment
A0A0A0LR37 Uncharacterized protein3.7e-10077.44Show/hide
Query:  MSLQLKPIHHYF-------CH--QQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGL------LRRRRRGVG-IGRWSIVASGIAGVPI
        MSLQLKPIHH+        CH  + +QPSY   +Q PS  +L NHSFVSLLP+CH LNG +GISA  L       RRR RG   IG  SIVAS IAG P+
Subjt:  MSLQLKPIHHYF-------CH--QQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGL------LRRRRRGVG-IGRWSIVASGIAGVPI

Query:  SDGSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSA
        SDGSKPEK  VSPPLSDILWPSAGAFAAMA+LGKMDQILA KGLSMTIAPLGAVCAVLFATPS+PAARKYNIF+AQIGCAAIGVLA  LLGPGWLARSSA
Subjt:  SDGSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSA

Query:  LAASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        LAASMAFMIYTGS HPPAASLPILFIDGAK+Q LNFW+ALFPGAAGCILLC IQELVV LKEK KF
Subjt:  LAASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

A0A1S3AUM8 uncharacterized protein LOC1034829852.3e-10278.79Show/hide
Query:  MSLQLKPIHHYF-------CHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGL------LRRRRRGV-GIGRWSIVASGIAGVPISD
        MSLQLKPIHH+        CH+ +QPSYR ++QAPSA +L NHS VSLLP CH LNG +GI    L       RRR RG  GIG  SIVAS IAG P+SD
Subjt:  MSLQLKPIHHYF-------CHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGL------LRRRRRGV-GIGRWSIVASGIAGVPISD

Query:  GSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALA
        GSKPEK  VSPPLSDILWPSAGAFAAMA+LGKMDQILA KGLSMTIAPLGAVCAVLFATPS+PAARKYNIFMAQIGCAAIGVLA  LLGPGWLARSSALA
Subjt:  GSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALA

Query:  ASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        ASMAFMIYTGS HPPAASLPILFIDGAK+Q LNFW+ALFPGAAGCILLC IQELVV LKEK KF
Subjt:  ASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

A0A6J1E7R0 uncharacterized protein LOC111431576 isoform X11.4e-9976.43Show/hide
Query:  MSLQLKPIHHYFCHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLL---------RRRRR----GVGIGRWSIVASGIAGVPISDG
        MSLQLKPIHH    Q +QPS+RV           NHSF+SLLPNCH LNG +G+S DG +         RRRRR    G G G  SIVASGIA  PISDG
Subjt:  MSLQLKPIHHYFCHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLL---------RRRRR----GVGIGRWSIVASGIAGVPISDG

Query:  SKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALAA
        SKP+K  VSPPLSDILWPSAGAFAAMA+LGKMDQILA KGLSMTIAPLGAVCA+LFA PSSPAARKYN+FMAQIGCAAIGVLA  LLGPGWLARSSALAA
Subjt:  SKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALAA

Query:  SMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        SMAFMIYTGS HPPAASLP++FIDGAK+QHLNFW+ALFPGAAGC+LLCFIQE+VVYLKEKFKF
Subjt:  SMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

A0A6J1EB70 uncharacterized protein LOC111431576 isoform X21.7e-9776.05Show/hide
Query:  MSLQLKPIHHYFCHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLL---------RRRRR----GVGIGRWSIVASGIAGVPISDG
        MSLQLKPIHH    Q +QPS+RV           NHSF+SLLPNCH LNG +G+S DG +         RRRRR    G G G  SIVASGIA  PISDG
Subjt:  MSLQLKPIHHYFCHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLL---------RRRRR----GVGIGRWSIVASGIAGVPISDG

Query:  SKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALAA
        SKP+K  VSPPLSDILWPSAGAFAAMA+LGKMDQILA KGLSMTIAPLGAVCA+LFA PSSPAAR YN+FMAQIGCAAIGVLA  LLGPGWLARSSALAA
Subjt:  SKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALAA

Query:  SMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        SMAFMIYTGS HPPAASLP++FIDGAK+QHLNFW+ALFPGAAGC+LLCFIQE+VVYLKEKFKF
Subjt:  SMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

A0A6J1KLD7 uncharacterized protein LOC1114956296.3e-10076.14Show/hide
Query:  MSLQLKPIHHYFCHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLL---------RRRRR-----GVGIGRWSIVASGIAGVPISD
        M+LQLKPIHH    Q +QPS+RV           NHSF+SLLPNCH LNG +G S DG +         RRRRR     G GIG  SIVASGIAG PISD
Subjt:  MSLQLKPIHHYFCHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLL---------RRRRR-----GVGIGRWSIVASGIAGVPISD

Query:  GSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALA
        GSKP+K  VSPPLSDILWPSAGAFAAMA+LGKMDQ+LA KGLSMTIAPLGAVCA+LFA PSSPAARKYN+FMAQIGCAAIGVLA  LLGPGWLARSSALA
Subjt:  GSKPEKVSVSPPLSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALA

Query:  ASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        ASMAFMIYTGS HPPAASLP++FIDGAK+QHLNFW+ALFPGAAGC+LLCFIQE+VVYLKEKFKF
Subjt:  ASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G47980.1 Integral membrane HPP family protein3.6e-6356.49Show/hide
Query:  LQAPSASLLPNHSFVSLLPNCHFLNGIQ--GISADGLLR----------RRRRGVGIGRWSIVASGIAGVPIS-DGSKPEKVSVSPPLSDILWPSAGAFA
        LQ  S +L+   S V++    H   G+   G+  D  +R          RRR     G    VAS      +S +  KPEK +V+P LSD++WP+AGAFA
Subjt:  LQAPSASLLPNHSFVSLLPNCHFLNGIQ--GISADGLLR----------RRRRGVGIGRWSIVASGIAGVPIS-DGSKPEKVSVSPPLSDILWPSAGAFA

Query:  AMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALAASMAFMIYTGSLHPPAASLPILFID
        AMA++G++DQ+L  KG+SM++APLGAV A+LF TPS+PAARKYN+F AQIGCAAIGVLA +  GP WLARS+ALAAS+AFM+ T + HPPAASLP+LFID
Subjt:  AMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALAASMAFMIYTGSLHPPAASLPILFID

Query:  GAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        GAKL  LNFW+ALFPGAA CILLCF+Q +V YLKE  KF
Subjt:  GAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

AT5G62720.1 Integral membrane HPP family protein4.0e-6264.25Show/hide
Query:  IVASGIAGVPISDGSKPEKVSVSPP--LSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLAL
        + ++G    P  D  KP+K + +    LSD++WP+AGAFAAMA+LG+MDQ+L+ KG+SM++APLGAV A+LF TPS+PAARKYNIF+AQIGCAAIGV+A 
Subjt:  IVASGIAGVPISDGSKPEKVSVSPP--LSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLAL

Query:  ALLGPGWLARSSALAASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF
        ++ GPGWLARS ALAAS+AFM+ T + HPPAASLP++FIDGAK  HLNFW+ALFPGAA C++LC +Q +V YLKE  KF
Subjt:  ALLGPGWLARSSALAASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNFWFALFPGAAGCILLCFIQELVVYLKEKFKF

AT5G62720.2 Integral membrane HPP family protein4.7e-3961.19Show/hide
Query:  IVASGIAGVPISDGSKPEKVSVSPP--LSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLAL
        + ++G    P  D  KP+K + +    LSD++WP+AGAFAAMA+LG+MDQ+L+ KG+SM++APLGAV A+LF TPS+PAARKYNIF+AQIGCAAIGV+A 
Subjt:  IVASGIAGVPISDGSKPEKVSVSPP--LSDILWPSAGAFAAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLAL

Query:  ALLGPGWLARSSALAASMAFMIYTGSLHPPAASL
        ++ GPGWLARS ALAAS+AFM+ T + HPP   L
Subjt:  ALLGPGWLARSSALAASMAFMIYTGSLHPPAASL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTGCAATTGAAGCCGATCCATCACTATTTCTGTCACCAACAATTTCAACCCAGTTACAGAGTAGAATTACAAGCTCCATCGGCGTCTCTGCTCCCAAATCATTC
ATTTGTTTCTTTGCTGCCGAATTGCCATTTCTTGAATGGAATACAAGGGATTTCAGCGGATGGGTTGTTGAGAAGACGAAGGCGCGGTGTCGGAATCGGTCGCTGGAGCA
TTGTAGCGTCTGGCATTGCTGGTGTACCGATATCAGATGGGTCGAAACCCGAAAAGGTCTCTGTTTCTCCTCCTCTCAGTGACATCCTTTGGCCTTCTGCAGGGGCATTT
GCAGCAATGGCAGTACTGGGAAAAATGGATCAAATCCTAGCAGCAAAGGGGCTTTCGATGACAATTGCACCACTTGGAGCCGTCTGTGCCGTCCTGTTCGCAACACCATC
ATCCCCTGCCGCTCGAAAGTACAACATCTTCATGGCACAAATTGGGTGTGCGGCAATCGGGGTTTTGGCGCTTGCCTTGTTGGGGCCGGGATGGCTGGCTCGAAGCTCTG
CTCTTGCTGCTTCAATGGCGTTCATGATCTATACTGGTTCATTGCATCCGCCAGCTGCAAGTTTGCCGATTCTGTTCATCGATGGAGCTAAGTTGCAGCATCTGAATTTC
TGGTTTGCTCTGTTTCCGGGAGCTGCTGGATGTATTCTTCTTTGCTTCATACAAGAATTAGTGGTGTACTTGAAGGAAAAATTCAAATTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCTGCAATTGAAGCCGATCCATCACTATTTCTGTCACCAACAATTTCAACCCAGTTACAGAGTAGAATTACAAGCTCCATCGGCGTCTCTGCTCCCAAATCATTC
ATTTGTTTCTTTGCTGCCGAATTGCCATTTCTTGAATGGAATACAAGGGATTTCAGCGGATGGGTTGTTGAGAAGACGAAGGCGCGGTGTCGGAATCGGTCGCTGGAGCA
TTGTAGCGTCTGGCATTGCTGGTGTACCGATATCAGATGGGTCGAAACCCGAAAAGGTCTCTGTTTCTCCTCCTCTCAGTGACATCCTTTGGCCTTCTGCAGGGGCATTT
GCAGCAATGGCAGTACTGGGAAAAATGGATCAAATCCTAGCAGCAAAGGGGCTTTCGATGACAATTGCACCACTTGGAGCCGTCTGTGCCGTCCTGTTCGCAACACCATC
ATCCCCTGCCGCTCGAAAGTACAACATCTTCATGGCACAAATTGGGTGTGCGGCAATCGGGGTTTTGGCGCTTGCCTTGTTGGGGCCGGGATGGCTGGCTCGAAGCTCTG
CTCTTGCTGCTTCAATGGCGTTCATGATCTATACTGGTTCATTGCATCCGCCAGCTGCAAGTTTGCCGATTCTGTTCATCGATGGAGCTAAGTTGCAGCATCTGAATTTC
TGGTTTGCTCTGTTTCCGGGAGCTGCTGGATGTATTCTTCTTTGCTTCATACAAGAATTAGTGGTGTACTTGAAGGAAAAATTCAAATTCTAA
Protein sequenceShow/hide protein sequence
MSLQLKPIHHYFCHQQFQPSYRVELQAPSASLLPNHSFVSLLPNCHFLNGIQGISADGLLRRRRRGVGIGRWSIVASGIAGVPISDGSKPEKVSVSPPLSDILWPSAGAF
AAMAVLGKMDQILAAKGLSMTIAPLGAVCAVLFATPSSPAARKYNIFMAQIGCAAIGVLALALLGPGWLARSSALAASMAFMIYTGSLHPPAASLPILFIDGAKLQHLNF
WFALFPGAAGCILLCFIQELVVYLKEKFKF