; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G14200 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G14200
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionIntegral membrane HPP family protein
Genome locationClcChr01:27031879..27034065
RNA-Seq ExpressionClc01G14200
SyntenyClc01G14200
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007065 - HPP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437643.1 PREDICTED: uncharacterized protein LOC103482985 [Cucumis melo]3.3e-11483.77Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGI---SLGLFNG-RRRRCGGGGRISHRSIAASGIAGTPVS
        MSLQLKPIHHHLHH G RHCH  +PY+ S+R +I   ++  LNHS +SLLP  HLLNGKRGI   SLGLFN  RRRR  G   I HRSI AS IAGTPVS
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGI---SLGLFNG-RRRRCGGGGRISHRSIAASGIAGTPVS

Query:  DGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL
        DGSKP+KGFVSPPLSDILWPSAGAF AMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAARKYN+FMAQIGCAAIGVLAFTLLGPGWLARSSAL
Subjt:  DGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL

Query:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGC+LLCL+QELVV LKEK KF
Subjt:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

XP_011651222.2 uncharacterized protein LOC105434855 [Cucumis sativus]2.1e-11684.15Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGI---SLGLFNG-RRRRCGGGGRISHRSIAASGIAGTPVS
        MSLQLKPIHHHLHH G RHCH+ EPY+ S+   I + +   LNHSF+SLLP  HLLNGKRGI   SLGLFN  RRRR  G  RI HRSI AS IAGTPVS
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGI---SLGLFNG-RRRRCGGGGRISHRSIAASGIAGTPVS

Query:  DGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL
        DGSKP+KGFVSPPLSDILWPSAGAF AMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAARKYN+F+AQIGCAAIGVLAFTLLGPGWLARSSAL
Subjt:  DGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL

Query:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGC+LLCL+QELVV LKEK KF
Subjt:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

XP_023519271.1 uncharacterized protein LOC111782708 isoform X1 [Cucurbita pepo subsp. pepo]4.0e-10778.6Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGISLG--------LFNGRRRR--CGGGGRISHRSIAASGI
        MSLQLKPIHH            Q+PY+ SFRV          NHSFISLLP+ HLLNGKRG+S+         L N RRRR   GGGG +S+RSI ASGI
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGISLG--------LFNGRRRR--CGGGGRISHRSIAASGI

Query:  AGTPVSDGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
        AG P+SDGSKPDKGFVSPPLSDILWPSAGAF AMAMLGKMDQILAPKGLSMTIAPLGAVCA+LFA PSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
Subjt:  AGTPVSDGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL

Query:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        ARSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGCVLLC +QE+VVYLKEKFKF
Subjt:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

XP_023519272.1 uncharacterized protein LOC111782708 isoform X2 [Cucurbita pepo subsp. pepo]3.7e-10578.23Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGISLG--------LFNGRRRR--CGGGGRISHRSIAASGI
        MSLQLKPIHH            Q+PY+ SFRV          NHSFISLLP+ HLLNGKRG+S+         L N RRRR   GGGG +S+RSI ASGI
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGISLG--------LFNGRRRR--CGGGGRISHRSIAASGI

Query:  AGTPVSDGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
        AG P+SDGSKPDKGFVSPPLSDILWPSAGAF AMAMLGKMDQILAPKGLSMTIAPLGAVCA+LFA PSSPAAR YNMFMAQIGCAAIGVLAFTLLGPGWL
Subjt:  AGTPVSDGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL

Query:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        ARSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGCVLLC +QE+VVYLKEKFKF
Subjt:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

XP_038894638.1 uncharacterized protein LOC120083135 [Benincasa hispida]2.0e-12789.73Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQI-ALSASLHLNHSFISLLPDFHLLNGKRGISLGLFNGRR-RRCGGGGRISHRSIAASGIAGTPVSDG
        MSLQLKPIHHHLHH G RHCHSQEPY+ S+RVQI A SASLHLNHSF+SLLP+ HLLNG RG+SLGLFN RR RRCGGGGRI HR I ASGIAGTP+SDG
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQI-ALSASLHLNHSFISLLPDFHLLNGKRGISLGLFNGRR-RRCGGGGRISHRSIAASGIAGTPVSDG

Query:  SKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAA
        SK +KGFVSPPLSDILWPSAGAF AMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYN+F+AQIGCAAIGVLAFTLLGPGWLARSSALAA
Subjt:  SKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAA

Query:  SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGC+LLCL+QELV++LKEKFKF
Subjt:  SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

TrEMBL top hitse value%identityAlignment
A0A0A0LR37 Uncharacterized protein1.5e-11583.77Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGI---SLGLFNG-RRRRCGGGGRISHRSIAASGIAGTPVS
        MSLQLKPIHHHLHH G R CH+ EPY+ S+   I + +   LNHSF+SLLP  HLLNGKRGI   SLGLFN  RRRR  G  RI HRSI AS IAGTPVS
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGI---SLGLFNG-RRRRCGGGGRISHRSIAASGIAGTPVS

Query:  DGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL
        DGSKP+KGFVSPPLSDILWPSAGAF AMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAARKYN+F+AQIGCAAIGVLAFTLLGPGWLARSSAL
Subjt:  DGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL

Query:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGC+LLCL+QELVV LKEK KF
Subjt:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

A0A1S3AUM8 uncharacterized protein LOC1034829851.6e-11483.77Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGI---SLGLFNG-RRRRCGGGGRISHRSIAASGIAGTPVS
        MSLQLKPIHHHLHH G RHCH  +PY+ S+R +I   ++  LNHS +SLLP  HLLNGKRGI   SLGLFN  RRRR  G   I HRSI AS IAGTPVS
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGI---SLGLFNG-RRRRCGGGGRISHRSIAASGIAGTPVS

Query:  DGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL
        DGSKP+KGFVSPPLSDILWPSAGAF AMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAARKYN+FMAQIGCAAIGVLAFTLLGPGWLARSSAL
Subjt:  DGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL

Query:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGC+LLCL+QELVV LKEK KF
Subjt:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

A0A6J1E7R0 uncharacterized protein LOC111431576 isoform X11.8e-10577.49Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGISLG--------LFNGRRRR--CGGGGRISHRSIAASGI
        MSLQLKPIHH            Q+PY+ SFRV          NHSFISLLP+ HLLNGKRG+S+         L N RRRR   GGGG   +RSI ASGI
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGISLG--------LFNGRRRR--CGGGGRISHRSIAASGI

Query:  AGTPVSDGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
        A  P+SDGSKPDKGFVSPPLSDILWPSAGAF AMAMLGKMDQILAPKGLSMTIAPLGAVCA+LFA PSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
Subjt:  AGTPVSDGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL

Query:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        ARSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGC+LLC +QE+VVYLKEKFKF
Subjt:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

A0A6J1EB70 uncharacterized protein LOC111431576 isoform X21.7e-10377.12Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGISLG--------LFNGRRRR--CGGGGRISHRSIAASGI
        MSLQLKPIHH            Q+PY+ SFRV          NHSFISLLP+ HLLNGKRG+S+         L N RRRR   GGGG   +RSI ASGI
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGISLG--------LFNGRRRR--CGGGGRISHRSIAASGI

Query:  AGTPVSDGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
        A  P+SDGSKPDKGFVSPPLSDILWPSAGAF AMAMLGKMDQILAPKGLSMTIAPLGAVCA+LFA PSSPAAR YNMFMAQIGCAAIGVLAFTLLGPGWL
Subjt:  AGTPVSDGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL

Query:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        ARSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGC+LLC +QE+VVYLKEKFKF
Subjt:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

A0A6J1KLD7 uncharacterized protein LOC1114956298.9e-10576.84Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGISLG--------LFNGRRRRCGGGG---RISHRSIAASG
        M+LQLKPIHH            Q+PY+ SFRV          NHSFISLLP+ HLLNG RG S+         L N RRRR  GGG    I +RSI ASG
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGISLG--------LFNGRRRRCGGGG---RISHRSIAASG

Query:  IAGTPVSDGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW
        IAG P+SDGSKPDKGFVSPPLSDILWPSAGAF AMAMLGKMDQ+LAPKGLSMTIAPLGAVCA+LFA PSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW
Subjt:  IAGTPVSDGSKPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW

Query:  LARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        LARSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGCVLLC +QE+VVYLKEKFKF
Subjt:  LARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G47980.1 Integral membrane HPP family protein2.0e-6470.99Show/hide
Query:  KPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAAS
        KP+K  V+P LSD++WP+AGAF AMA++G++DQ+L PKG+SM++APLGAV A+LF TPS+PAARKYNMF AQIGCAAIGVLAF+  GP WLARS+ALAAS
Subjt:  KPDKGFVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAAS

Query:  MAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        +AFM+ T + HPPAASLP+LFIDGAK+ +LNFWYALFPGAA C+LLC +Q +V YLKE  KF
Subjt:  MAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

AT5G62720.1 Integral membrane HPP family protein1.2e-6466.48Show/hide
Query:  IAASGIAGTPVSDGSKPDKGFVSPP--LSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAF
        +A++G    P  D  KPDK   +    LSD++WP+AGAF AMA+LG+MDQ+L+PKG+SM++APLGAV A+LF TPS+PAARKYN+F+AQIGCAAIGV+AF
Subjt:  IAASGIAGTPVSDGSKPDKGFVSPP--LSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAF

Query:  TLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF
        ++ GPGWLARS ALAAS+AFM+ T + HPPAASLP++FIDGAK   LNFWYALFPGAA CV+LCL+Q +V YLKE  KF
Subjt:  TLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF

AT5G62720.2 Integral membrane HPP family protein2.4e-4162.69Show/hide
Query:  IAASGIAGTPVSDGSKPDKGFVSPP--LSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAF
        +A++G    P  D  KPDK   +    LSD++WP+AGAF AMA+LG+MDQ+L+PKG+SM++APLGAV A+LF TPS+PAARKYN+F+AQIGCAAIGV+AF
Subjt:  IAASGIAGTPVSDGSKPDKGFVSPP--LSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAF

Query:  TLLGPGWLARSSALAASMAFMIYTGSTHPPAASL
        ++ GPGWLARS ALAAS+AFM+ T + HPP   L
Subjt:  TLLGPGWLARSSALAASMAFMIYTGSTHPPAASL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTGCAATTGAAGCCAATTCATCACCACCTCCACCACCGTGGCCTCCGCCATTGCCACAGTCAGGAGCCGTATCGATCCAGTTTCAGAGTCCAAATTGCTCTATC
GGCATCTCTGCACCTGAACCATTCATTCATTTCTCTGCTTCCGGATTTCCATTTATTGAACGGAAAACGAGGGATTTCGTTGGGATTATTCAACGGTAGGAGGAGAAGAT
GTGGGGGCGGCGGCAGAATCAGTCACCGGAGTATTGCGGCGTCCGGCATTGCTGGAACACCGGTTTCAGATGGGTCAAAACCGGACAAAGGCTTTGTTTCTCCTCCCCTC
AGTGACATCCTTTGGCCTTCTGCAGGGGCATTTGTAGCAATGGCAATGCTGGGGAAAATGGATCAGATTCTAGCGCCAAAGGGACTTTCAATGACAATTGCGCCATTAGG
AGCCGTTTGTGCTGTCCTGTTCGCAACACCTTCATCCCCTGCTGCTCGAAAATACAACATGTTCATGGCCCAGATTGGGTGTGCGGCAATTGGAGTTTTGGCGTTTACTT
TGTTGGGGCCTGGATGGCTGGCTAGAAGCTCTGCTCTGGCTGCATCCATGGCGTTTATGATCTATACTGGTTCGACGCACCCACCAGCTGCAAGTTTGCCGATATTGTTC
ATCGATGGCGCTAAGATGCAACAGCTTAATTTCTGGTATGCTTTGTTTCCCGGTGCCGCTGGATGTGTTCTCCTTTGCTTAGTACAAGAGTTAGTGGTGTACTTAAAGGA
GAAGTTCAAATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCCTGCAATTGAAGCCAATTCATCACCACCTCCACCACCGTGGCCTCCGCCATTGCCACAGTCAGGAGCCGTATCGATCCAGTTTCAGAGTCCAAATTGCTCTATC
GGCATCTCTGCACCTGAACCATTCATTCATTTCTCTGCTTCCGGATTTCCATTTATTGAACGGAAAACGAGGGATTTCGTTGGGATTATTCAACGGTAGGAGGAGAAGAT
GTGGGGGCGGCGGCAGAATCAGTCACCGGAGTATTGCGGCGTCCGGCATTGCTGGAACACCGGTTTCAGATGGGTCAAAACCGGACAAAGGCTTTGTTTCTCCTCCCCTC
AGTGACATCCTTTGGCCTTCTGCAGGGGCATTTGTAGCAATGGCAATGCTGGGGAAAATGGATCAGATTCTAGCGCCAAAGGGACTTTCAATGACAATTGCGCCATTAGG
AGCCGTTTGTGCTGTCCTGTTCGCAACACCTTCATCCCCTGCTGCTCGAAAATACAACATGTTCATGGCCCAGATTGGGTGTGCGGCAATTGGAGTTTTGGCGTTTACTT
TGTTGGGGCCTGGATGGCTGGCTAGAAGCTCTGCTCTGGCTGCATCCATGGCGTTTATGATCTATACTGGTTCGACGCACCCACCAGCTGCAAGTTTGCCGATATTGTTC
ATCGATGGCGCTAAGATGCAACAGCTTAATTTCTGGTATGCTTTGTTTCCCGGTGCCGCTGGATGTGTTCTCCTTTGCTTAGTACAAGAGTTAGTGGTGTACTTAAAGGA
GAAGTTCAAATTTTAG
Protein sequenceShow/hide protein sequence
MSLQLKPIHHHLHHRGLRHCHSQEPYRSSFRVQIALSASLHLNHSFISLLPDFHLLNGKRGISLGLFNGRRRRCGGGGRISHRSIAASGIAGTPVSDGSKPDKGFVSPPL
SDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILF
IDGAKMQQLNFWYALFPGAAGCVLLCLVQELVVYLKEKFKF