; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0022347 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0022347
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionIntegral membrane HPP family protein
Genome locationchr12:25546884..25550650
RNA-Seq ExpressionPay0022347
SyntenyPay0022347
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007065 - HPP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437643.1 PREDICTED: uncharacterized protein LOC103482985 [Cucumis melo]6.1e-14097.72Show/hide
Query:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGISVRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPVSDE
        MSLQLKPIHHHLHHHGG HC KPY+PSYREKIQAPSACMLNHS VSLLPICHLLNGKRGI VRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPVSD 
Subjt:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGISVRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPVSDE

Query:  SKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSALAA
        SKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSALAA
Subjt:  SKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSALAA

Query:  SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
Subjt:  SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

XP_011651222.2 uncharacterized protein LOC105434855 [Cucumis sativus]2.5e-13393.58Show/hide
Query:  MSLQLKPIHHHLHHHGGHHC--PKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGISVRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPVS
        MSLQLKPIHHHLHH+GG HC   +PY+PSY E IQ PS CMLNHSFVSLLP CHLLNGKRGIS RSLGLFNDWRRRR+RGSD IGHRSIVASSIAGTPVS
Subjt:  MSLQLKPIHHHLHHHGGHHC--PKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGISVRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPVS

Query:  DESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSAL
        D SKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIF+AQIGCAAIGVLAFTLLGPGWLARSSAL
Subjt:  DESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSAL

Query:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
Subjt:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

XP_023001510.1 uncharacterized protein LOC111495629 [Cucurbita maxima]1.2e-10376.67Show/hide
Query:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGI----SVRSLG-LFNDWRRRRS--RGSDGIGHRSIVASSIA
        M+LQLKPIH        H   +PY+PS+R          +NHSF+SLLP CHLLNG RG     SVR LG L ND RRRR+   G DGIG+RSIVAS IA
Subjt:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGI----SVRSLG-LFNDWRRRRS--RGSDGIGHRSIVASSIA

Query:  GTPVSDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLA
        G P+SD SKP+KGFVSPPLSDILWPSAGAFAAMA+LGKMDQ+LAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+FMAQIGCAAIGVLAFTLLGPGWLA
Subjt:  GTPVSDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLA

Query:  RSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        RSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGC+LLC IQE+VV LKEK KF
Subjt:  RSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

XP_023519271.1 uncharacterized protein LOC111782708 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-10376.95Show/hide
Query:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGI----SVRSLG-LFNDWRRRRS-RGSDGIGHRSIVASSIAG
        MSLQLKPIH        H   +PY+PS+R          +NHSF+SLLP CHLLNGKRG+    SVR LG L ND RRRR+  G  G+ +RSIVAS IAG
Subjt:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGI----SVRSLG-LFNDWRRRRS-RGSDGIGHRSIVASSIAG

Query:  TPVSDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLAR
         P+SD SKP+KGFVSPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+FMAQIGCAAIGVLAFTLLGPGWLAR
Subjt:  TPVSDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLAR

Query:  SSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        SSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGC+LLC IQE+VV LKEK KF
Subjt:  SSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

XP_038894638.1 uncharacterized protein LOC120083135 [Benincasa hispida]5.9e-11985.71Show/hide
Query:  MSLQLKPIHHHLHHHGGHHC--PKPYKPSYREKIQAPSACM-LNHSFVSLLPICHLLNGKRGISVRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPV
        MSLQLKPIHHHLHHHG  HC   +PY+PSYR +IQAPSA + LNHSFVSLLP CHLLNG RG+   SLGLFN+ R+RR  G   IGHR IVAS IAGTP+
Subjt:  MSLQLKPIHHHLHHHGGHHC--PKPYKPSYREKIQAPSACM-LNHSFVSLLPICHLLNGKRGISVRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPV

Query:  SDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSA
        SD SK EKGFVSPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAARKYN+F+AQIGCAAIGVLAFTLLGPGWLARSSA
Subjt:  SDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSA

Query:  LAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        LAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELV+ LKEK KF
Subjt:  LAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

TrEMBL top hitse value%identityAlignment
A0A0A0LR37 Uncharacterized protein1.7e-13293.21Show/hide
Query:  MSLQLKPIHHHLHHHGGHHC--PKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGISVRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPVS
        MSLQLKPIHHHLHH+GG  C   +PY+PSY E IQ PS CMLNHSFVSLLP CHLLNGKRGIS RSLGLFNDWRRRR+RGSD IGHRSIVASSIAGTPVS
Subjt:  MSLQLKPIHHHLHHHGGHHC--PKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGISVRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPVS

Query:  DESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSAL
        D SKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIF+AQIGCAAIGVLAFTLLGPGWLARSSAL
Subjt:  DESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSAL

Query:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
Subjt:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

A0A1S3AUM8 uncharacterized protein LOC1034829853.0e-14097.72Show/hide
Query:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGISVRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPVSDE
        MSLQLKPIHHHLHHHGG HC KPY+PSYREKIQAPSACMLNHS VSLLPICHLLNGKRGI VRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPVSD 
Subjt:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGISVRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPVSDE

Query:  SKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSALAA
        SKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSALAA
Subjt:  SKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSALAA

Query:  SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
Subjt:  SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

A0A6J1E7R0 uncharacterized protein LOC111431576 isoform X12.2e-10376.95Show/hide
Query:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGI----SVRSLG-LFNDWRRRRS-RGSDGIGHRSIVASSIAG
        MSLQLKPIH        H   +PY+PS+R          +NHSF+SLLP CHLLNGKRG+    SVR LG L ND RRRR+  G  G G+RSIVAS IA 
Subjt:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGI----SVRSLG-LFNDWRRRRS-RGSDGIGHRSIVASSIAG

Query:  TPVSDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLAR
         P+SD SKP+KGFVSPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+FMAQIGCAAIGVLAFTLLGPGWLAR
Subjt:  TPVSDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLAR

Query:  SSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        SSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGC+LLC IQE+VV LKEK KF
Subjt:  SSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

A0A6J1EB70 uncharacterized protein LOC111431576 isoform X22.7e-10176.58Show/hide
Query:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGI----SVRSLG-LFNDWRRRRS-RGSDGIGHRSIVASSIAG
        MSLQLKPIH        H   +PY+PS+R          +NHSF+SLLP CHLLNGKRG+    SVR LG L ND RRRR+  G  G G+RSIVAS IA 
Subjt:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGI----SVRSLG-LFNDWRRRRS-RGSDGIGHRSIVASSIAG

Query:  TPVSDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLAR
         P+SD SKP+KGFVSPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCA+LFA PS+PAAR YN+FMAQIGCAAIGVLAFTLLGPGWLAR
Subjt:  TPVSDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLAR

Query:  SSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        SSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGC+LLC IQE+VV LKEK KF
Subjt:  SSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

A0A6J1KLD7 uncharacterized protein LOC1114956295.8e-10476.67Show/hide
Query:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGI----SVRSLG-LFNDWRRRRS--RGSDGIGHRSIVASSIA
        M+LQLKPIH        H   +PY+PS+R          +NHSF+SLLP CHLLNG RG     SVR LG L ND RRRR+   G DGIG+RSIVAS IA
Subjt:  MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGI----SVRSLG-LFNDWRRRRS--RGSDGIGHRSIVASSIA

Query:  GTPVSDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLA
        G P+SD SKP+KGFVSPPLSDILWPSAGAFAAMA+LGKMDQ+LAPKGLSMTIAPLGAVCA+LFA PS+PAARKYN+FMAQIGCAAIGVLAFTLLGPGWLA
Subjt:  GTPVSDESKPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLA

Query:  RSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        RSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGC+LLC IQE+VV LKEK KF
Subjt:  RSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G47980.1 Integral membrane HPP family protein2.4e-6567.89Show/hide
Query:  RSRGSDGIGHRSIVASSIAGTPVSDES-KPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQ
        R R S   G    VASS     VS ES KPEK  V+P LSD++WP+AGAFAAMA++G++DQ+L PKG+SM++APLGAV A+LF TPSAPAARKYN+F AQ
Subjt:  RSRGSDGIGHRSIVASSIAGTPVSDES-KPEKGFVSPPLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQ

Query:  IGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        IGCAAIGVLAF+  GP WLARS+ALAAS+AFM+ T + HPPAASLP+LFIDGAK+ +LNFWYALFPGAA CILLC +Q +V  LKE +KF
Subjt:  IGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

AT5G62720.1 Integral membrane HPP family protein4.4e-6464.55Show/hide
Query:  IGHRSIVASSIAG-----TPVSDESKPEKGFVSPP--LSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQI
        + HR  V++ +A       P  D  KP+K   +    LSD++WP+AGAFAAMALLG+MDQ+L+PKG+SM++APLGAV A+LF TPSAPAARKYNIF+AQI
Subjt:  IGHRSIVASSIAG-----TPVSDESKPEKGFVSPP--LSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQI

Query:  GCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF
        GCAAIGV+AF++ GPGWLARS ALAAS+AFM+ T + HPPAASLP++FIDGAK   LNFWYALFPGAA C++LCL+Q +V  LKE +KF
Subjt:  GCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF

AT5G62720.2 Integral membrane HPP family protein4.8e-4261.81Show/hide
Query:  IGHRSIVASSIAG-----TPVSDESKPEKGFVSPP--LSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQI
        + HR  V++ +A       P  D  KP+K   +    LSD++WP+AGAFAAMALLG+MDQ+L+PKG+SM++APLGAV A+LF TPSAPAARKYNIF+AQI
Subjt:  IGHRSIVASSIAG-----TPVSDESKPEKGFVSPP--LSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQI

Query:  GCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASL
        GCAAIGV+AF++ GPGWLARS ALAAS+AFM+ T + HPP   L
Subjt:  GCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTGCAATTGAAGCCAATTCATCACCACCTCCACCACCATGGCGGCCACCATTGCCCCAAGCCGTATAAACCCAGTTACCGAGAAAAAATCCAAGCTCCATCCGC
ATGCATGCTCAACCATTCATTCGTTTCGTTGCTGCCGATTTGCCATTTATTGAATGGAAAACGAGGGATTTCCGTTAGGTCGCTGGGATTATTCAACGATTGGAGAAGAA
GACGAAGCAGGGGCAGCGACGGAATCGGTCACCGGAGTATTGTGGCGTCCAGCATTGCTGGGACACCGGTTTCAGATGAGTCAAAACCAGAGAAAGGCTTTGTTTCTCCT
CCCCTCAGTGACATCCTTTGGCCTTCTGCAGGGGCATTTGCAGCAATGGCATTGCTGGGGAAAATGGATCAGATTCTAGCGCCAAAGGGGCTTTCTATGACAATTGCACC
ATTAGGAGCCGTCTGTGCTGTCCTGTTCGCAACACCCTCAGCCCCTGCAGCTCGAAAATACAACATTTTCATGGCTCAGATTGGGTGTGCGGCAATTGGGGTTTTGGCGT
TCACTTTGTTGGGGCCTGGATGGCTGGCTAGAAGCTCTGCTTTGGCTGCATCAATGGCGTTTATGATCTATACTGGTTCGACGCACCCACCAGCTGCAAGTTTGCCGATA
CTGTTCATCGATGGAGCTAAGATGCAACAGCTCAATTTCTGGTACGCTCTGTTTCCCGGTGCCGCCGGCTGTATTCTCCTTTGCTTAATACAAGAGCTAGTGGTGTTGTT
GAAGGAGAAGATCAAATTTTGA
mRNA sequenceShow/hide mRNA sequence
GATTTTTTTAGTAATCCTAGATAAGGACAAGCACGATGGGACAGAGAAAAATAGAGAGGCTGCTATTGTTTATTCATTTTCCCATTATTAATGTGATTTAGCCTTTGGGT
TTCTTTTTAATGTTTGAGAAGTGGCAAAGATACAGAAATGGAGTTGTCTCTTTATGGGAATGACCTTTGGACGTCGTCAAAGGTCAAGTCTGCAACTCAGCTTCGAGTAT
TTTTTTCAAGCCTCTGCCATTCTTTCATAACTTTTCCCCAAATCTCATTCCCCTCCAAATGGTGTCTTTGTCAGATTCACGTTTGAATTTACATTGCATGCTATTAATAT
ATTGACAATATTCTATTTTCTATATATTTTTATTTTATATCACGAACTTCGTTTTGCGGCTTTTGCCATACTTTCTCTGTTTCTCTCGATCTTCCCCATATTTTGTCGCC
AAATATCCAAAAAACGTCAGGTTGAGATAGAACTACAGAGGGTTGGACGTGGGTCAAGGGTAGAATCCGGAATGAGCCTGCAATTGAAGCCAATTCATCACCACCTCCAC
CACCATGGCGGCCACCATTGCCCCAAGCCGTATAAACCCAGTTACCGAGAAAAAATCCAAGCTCCATCCGCATGCATGCTCAACCATTCATTCGTTTCGTTGCTGCCGAT
TTGCCATTTATTGAATGGAAAACGAGGGATTTCCGTTAGGTCGCTGGGATTATTCAACGATTGGAGAAGAAGACGAAGCAGGGGCAGCGACGGAATCGGTCACCGGAGTA
TTGTGGCGTCCAGCATTGCTGGGACACCGGTTTCAGATGAGTCAAAACCAGAGAAAGGCTTTGTTTCTCCTCCCCTCAGTGACATCCTTTGGCCTTCTGCAGGGGCATTT
GCAGCAATGGCATTGCTGGGGAAAATGGATCAGATTCTAGCGCCAAAGGGGCTTTCTATGACAATTGCACCATTAGGAGCCGTCTGTGCTGTCCTGTTCGCAACACCCTC
AGCCCCTGCAGCTCGAAAATACAACATTTTCATGGCTCAGATTGGGTGTGCGGCAATTGGGGTTTTGGCGTTCACTTTGTTGGGGCCTGGATGGCTGGCTAGAAGCTCTG
CTTTGGCTGCATCAATGGCGTTTATGATCTATACTGGTTCGACGCACCCACCAGCTGCAAGTTTGCCGATACTGTTCATCGATGGAGCTAAGATGCAACAGCTCAATTTC
TGGTACGCTCTGTTTCCCGGTGCCGCCGGCTGTATTCTCCTTTGCTTAATACAAGAGCTAGTGGTGTTGTTGAAGGAGAAGATCAAATTTTGAAGTGTGGGAGAAAGAGT
TGAACAGCCATTATTGAAGGTCTCCCATAGCCTTATGATCTGATGGAAGAAAACTAATTCATTCCCCCAAACCAAATATTTTGGACCCATCTTCATAAATTCAACCTCAC
ACACACCACATTCAATCAATTGCCTCTTCATTCATTCTTAATTTAAAGTTGTAAATTTCACCCCCCCACCCACCATACTTTGTGTTTTTGTTTTTGTATCACTAAAAGGT
TTCTTGCTTTGGGTTATGAAAGTATGATGTGATAAGTTCTTTTTTTCCTCAAAATGAGTCCAACTACATTTATGGTTAAAATGGGTTTGGCTTAGTAGTGTATATATATG
TTGACATCCATGCTGAAGACTGTTGACTAGATTGAGATATATGTATGGGCAAAACGACGGGGGTTTGAATGTTGATGGATGTACTCATATTATTTTGCACATTTTCTATA
GTCAAGTTTTGAAATTAATACTACTCCAAAATACACAAAAGGAAAATGCCTATTCCTATAGTTTTATCGATT
Protein sequenceShow/hide protein sequence
MSLQLKPIHHHLHHHGGHHCPKPYKPSYREKIQAPSACMLNHSFVSLLPICHLLNGKRGISVRSLGLFNDWRRRRSRGSDGIGHRSIVASSIAGTPVSDESKPEKGFVSP
PLSDILWPSAGAFAAMALLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSAPAARKYNIFMAQIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPI
LFIDGAKMQQLNFWYALFPGAAGCILLCLIQELVVLLKEKIKF