; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G014080 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G014080
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionIntegral membrane HPP family protein
Genome locationCicolChr01:26855537..26858118
RNA-Seq ExpressionCcUC01G014080
SyntenyCcUC01G014080
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007065 - HPP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437643.1 PREDICTED: uncharacterized protein LOC103482985 [Cucumis melo]7.3e-10683.33Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGI---SLGLFNG--RRICGGGGRIGHRSIAASGIAGRPVS
        MSLQLKPIHHHLHH G RHCH  +PY+ SYR +I   ++  LNHS +SLLP  HLLNGKRGI   SLGLFN   RR   G   IGHRSI AS IAG PVS
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGI---SLGLFNG--RRICGGGGRIGHRSIAASGIAGRPVS

Query:  DGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL
        DGSKP+KG VSPPLSDILWPSAGAF AMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAARKYN+FMAQIGCAAIGVLAFTLLGPGWLARSSAL
Subjt:  DGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL

Query:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV
        AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCL+
Subjt:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV

XP_011651222.2 uncharacterized protein LOC105434855 [Cucumis sativus]3.5e-10883.73Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGI---SLGLFNG--RRICGGGGRIGHRSIAASGIAGRPVS
        MSLQLKPIHHHLHH G RHCH+ EPY+ SY   I + +   LNHSF+SLLP+ HLLNGKRGI   SLGLFN   RR   G  RIGHRSI AS IAG PVS
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGI---SLGLFNG--RRICGGGGRIGHRSIAASGIAGRPVS

Query:  DGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL
        DGSKP+KG VSPPLSDILWPSAGAF AMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAARKYN+F+AQIGCAAIGVLAFTLLGPGWLARSSAL
Subjt:  DGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL

Query:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV
        AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCL+
Subjt:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV

XP_022924027.1 uncharacterized protein LOC111431576 isoform X1 [Cucurbita moschata]5.2e-9676.36Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGIS-------LGLF----NGRRICGGGGRIGHRSIAASGI
        MSLQLKPIHH            Q+PY+ S+RV          NHSFISLLPN HLLNGKRG+S       LGL       RR  GGGG  G+RSI ASGI
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGIS-------LGLF----NGRRICGGGGRIGHRSIAASGI

Query:  AGRPVSDGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
        A  P+SDGSKPDKG VSPPLSDILWPSAGAF AMAMLGKMDQILAPKGLSMTIAPLGAVCA+LFA PSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
Subjt:  AGRPVSDGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL

Query:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV
        ARSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGC+LLC +
Subjt:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV

XP_023519271.1 uncharacterized protein LOC111782708 isoform X1 [Cucurbita pepo subsp. pepo]1.4e-9676.36Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGIS-------LGLF----NGRRICGGGGRIGHRSIAASGI
        MSLQLKPIHH            Q+PY+ S+RV          NHSFISLLPN HLLNGKRG+S       LGL       RR  GGGG + +RSI ASGI
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGIS-------LGLF----NGRRICGGGGRIGHRSIAASGI

Query:  AGRPVSDGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
        AG P+SDGSKPDKG VSPPLSDILWPSAGAF AMAMLGKMDQILAPKGLSMTIAPLGAVCA+LFA PSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
Subjt:  AGRPVSDGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL

Query:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV
        ARSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGC+LLC +
Subjt:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV

XP_038894638.1 uncharacterized protein LOC120083135 [Benincasa hispida]7.5e-11990Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQI-ALSASLHLNHSFISLLPNFHLLNGKRGISLGLFNGR--RICGGGGRIGHRSIAASGIAGRPVSDG
        MSLQLKPIHHHLHH G RHCHSQEPY+ SYRVQI A SASLHLNHSF+SLLPN HLLNG RG+SLGLFN R  R CGGGGRIGHR I ASGIAG P+SDG
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQI-ALSASLHLNHSFISLLPNFHLLNGKRGISLGLFNGR--RICGGGGRIGHRSIAASGIAGRPVSDG

Query:  SKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAA
        SK +KG VSPPLSDILWPSAGAF AMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYN+F+AQIGCAAIGVLAFTLLGPGWLARSSALAA
Subjt:  SKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAA

Query:  SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV
        SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCL+
Subjt:  SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV

TrEMBL top hitse value%identityAlignment
A0A0A0LR37 Uncharacterized protein2.4e-10783.33Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGI---SLGLFNG--RRICGGGGRIGHRSIAASGIAGRPVS
        MSLQLKPIHHHLHH G R CH+ EPY+ SY   I + +   LNHSF+SLLP+ HLLNGKRGI   SLGLFN   RR   G  RIGHRSI AS IAG PVS
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGI---SLGLFNG--RRICGGGGRIGHRSIAASGIAGRPVS

Query:  DGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL
        DGSKP+KG VSPPLSDILWPSAGAF AMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAARKYN+F+AQIGCAAIGVLAFTLLGPGWLARSSAL
Subjt:  DGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL

Query:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV
        AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCL+
Subjt:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV

A0A1S3AUM8 uncharacterized protein LOC1034829853.5e-10683.33Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGI---SLGLFNG--RRICGGGGRIGHRSIAASGIAGRPVS
        MSLQLKPIHHHLHH G RHCH  +PY+ SYR +I   ++  LNHS +SLLP  HLLNGKRGI   SLGLFN   RR   G   IGHRSI AS IAG PVS
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGI---SLGLFNG--RRICGGGGRIGHRSIAASGIAGRPVS

Query:  DGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL
        DGSKP+KG VSPPLSDILWPSAGAF AMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAARKYN+FMAQIGCAAIGVLAFTLLGPGWLARSSAL
Subjt:  DGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSAL

Query:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV
        AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCL+
Subjt:  AASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV

A0A6J1C6M6 uncharacterized protein LOC1110088886.2e-9578.8Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPY---RSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGISLGLFNGRRICGGGGRIGHRSIAASGIAGRPVSDG
        MSLQLKPIHHHL HRG RH H Q+ Y    S+ R+Q A SAS   N SF+SLLPNFHL N  RG  + LF  RR      R GHR IAASGI G  VSDG
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPY---RSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGISLGLFNGRRICGGGGRIGHRSIAASGIAGRPVSDG

Query:  SKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAA
        +KP+KGS SP LSDILWPSAGAF AMAMLGKMDQILA KGLSMTIAPLGAVCAVLFATPS+PAARKYN+FMAQIGCAAIGV AFTLLGPGWLARSSALAA
Subjt:  SKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAA

Query:  SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV
        SMAFMI TGSTHPPAASLPILFIDGAK+Q LNFWYALFPGAAGCILLCL+
Subjt:  SMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV

A0A6J1E7R0 uncharacterized protein LOC111431576 isoform X12.5e-9676.36Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGIS-------LGLF----NGRRICGGGGRIGHRSIAASGI
        MSLQLKPIHH            Q+PY+ S+RV          NHSFISLLPN HLLNGKRG+S       LGL       RR  GGGG  G+RSI ASGI
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGIS-------LGLF----NGRRICGGGGRIGHRSIAASGI

Query:  AGRPVSDGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
        A  P+SDGSKPDKG VSPPLSDILWPSAGAF AMAMLGKMDQILAPKGLSMTIAPLGAVCA+LFA PSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL
Subjt:  AGRPVSDGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWL

Query:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV
        ARSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGC+LLC +
Subjt:  ARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV

A0A6J1KLD7 uncharacterized protein LOC1114956294.8e-9574.9Show/hide
Query:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGISLG--------LFNGRRI----CGGGGRIGHRSIAASG
        M+LQLKPIHH            Q+PY+ S+RV          NHSFISLLPN HLLNG RG S+         L N RR      GGG  IG+RSI ASG
Subjt:  MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGISLG--------LFNGRRI----CGGGGRIGHRSIAASG

Query:  IAGRPVSDGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW
        IAG P+SDGSKPDKG VSPPLSDILWPSAGAF AMAMLGKMDQ+LAPKGLSMTIAPLGAVCA+LFA PSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW
Subjt:  IAGRPVSDGSKPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGW

Query:  LARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV
        LARSSALAASMAFMIYTGSTHPPAASLP++FIDGAKMQ LNFWYALFPGAAGC+LLC +
Subjt:  LARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G47980.1 Integral membrane HPP family protein2.2e-6070.32Show/hide
Query:  KPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAAS
        KP+K +V+P LSD++WP+AGAF AMA++G++DQ+L PKG+SM++APLGAV A+LF TPS+PAARKYNMF AQIGCAAIGVLAF+  GP WLARS+ALAAS
Subjt:  KPDKGSVSPPLSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAAS

Query:  MAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLVAWSLGW
        +AFM+ T + HPPAASLP+LFIDGAK+ +LNFWYALFPGAA CILLC +   +G+
Subjt:  MAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLVAWSLGW

AT5G62720.1 Integral membrane HPP family protein1.1e-5966.27Show/hide
Query:  IAASGIAGRPVSDGSKPDKGSVSPP--LSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAF
        +A++G    P  D  KPDK + +    LSD++WP+AGAF AMA+LG+MDQ+L+PKG+SM++APLGAV A+LF TPS+PAARKYN+F+AQIGCAAIGV+AF
Subjt:  IAASGIAGRPVSDGSKPDKGSVSPP--LSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAF

Query:  TLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV
        ++ GPGWLARS ALAAS+AFM+ T + HPPAASLP++FIDGAK   LNFWYALFPGAA C++LCL+
Subjt:  TLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFIDGAKMQQLNFWYALFPGAAGCILLCLV

AT5G62720.2 Integral membrane HPP family protein2.3e-4162.69Show/hide
Query:  IAASGIAGRPVSDGSKPDKGSVSPP--LSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAF
        +A++G    P  D  KPDK + +    LSD++WP+AGAF AMA+LG+MDQ+L+PKG+SM++APLGAV A+LF TPS+PAARKYN+F+AQIGCAAIGV+AF
Subjt:  IAASGIAGRPVSDGSKPDKGSVSPP--LSDILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAF

Query:  TLLGPGWLARSSALAASMAFMIYTGSTHPPAASL
        ++ GPGWLARS ALAAS+AFM+ T + HPP   L
Subjt:  TLLGPGWLARSSALAASMAFMIYTGSTHPPAASL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTGCAATTGAAGCCAATTCATCACCACCTCCACCACCGTGGTCTCCGCCATTGCCACAGTCAGGAGCCGTATCGATCCAGTTACAGAGTCCAAATTGCTCTATC
GGCATCTCTACACCTGAACCATTCATTCATTTCTCTGCTTCCGAATTTCCATTTATTGAACGGAAAACGAGGGATTTCGTTGGGATTATTCAACGGTAGGAGAATATGTG
GGGGCGGCGGGAGAATCGGTCACCGGAGTATTGCGGCGTCCGGCATTGCTGGAAGACCGGTTTCAGATGGGTCAAAACCGGACAAAGGCTCTGTTTCTCCTCCCCTCAGT
GACATCCTTTGGCCTTCTGCAGGGGCATTTGTAGCAATGGCAATGCTGGGGAAAATGGATCAGATTCTAGCGCCAAAGGGACTTTCAATGACAATTGCACCATTAGGAGC
CGTTTGTGCTGTCCTGTTCGCAACACCTTCATCCCCTGCTGCTCGAAAATACAACATGTTCATGGCCCAGATTGGGTGTGCGGCAATTGGAGTTTTGGCGTTTACTTTGT
TGGGGCCTGGATGGCTGGCTAGAAGCTCTGCTCTGGCTGCATCCATGGCGTTTATGATCTATACTGGTTCGACGCACCCACCAGCTGCAAGTTTGCCGATATTGTTCATC
GATGGAGCTAAGATGCAACAGCTAAATTTCTGGTATGCTTTGTTTCCCGGTGCCGCTGGATGTATTCTCCTTTGCTTAGTAGCTTGGAGCCTTGGATGGAACTCATAA
mRNA sequenceShow/hide mRNA sequence
GGAATGGAGTTGTCTCTTTATAGGAATGACCTTTGGACCTCTTCAAAGGCCAAGTCTGCAACTCAGCTTCAAGTTTTTATTTTTTTTCAATCCTCTGGCATTCTTCGCAG
CTTTTCCCCAAATCCCATTCCCCTACAAATGGTGTCTTTGTCAGATTCTCGTTTGAATTTACATTGCATGCGATTAATATATTGACAATATTCTATATTTTATCTTTTAT
CTTTTATATCATAAACTTCGTTTTGCGGCTTTTGCCATATTTTCTCTGTTTCTCTCGACCTTCCCCATATTTTGTCGCGAAATATCCAAAACAACGTCTGTTTTAGATAG
AACTACAGAAGCTTGGTGGTGGGTCAAGGGTAAAATCCGGAATGAGCCTGCAATTGAAGCCAATTCATCACCACCTCCACCACCGTGGTCTCCGCCATTGCCACAGTCAG
GAGCCGTATCGATCCAGTTACAGAGTCCAAATTGCTCTATCGGCATCTCTACACCTGAACCATTCATTCATTTCTCTGCTTCCGAATTTCCATTTATTGAACGGAAAACG
AGGGATTTCGTTGGGATTATTCAACGGTAGGAGAATATGTGGGGGCGGCGGGAGAATCGGTCACCGGAGTATTGCGGCGTCCGGCATTGCTGGAAGACCGGTTTCAGATG
GGTCAAAACCGGACAAAGGCTCTGTTTCTCCTCCCCTCAGTGACATCCTTTGGCCTTCTGCAGGGGCATTTGTAGCAATGGCAATGCTGGGGAAAATGGATCAGATTCTA
GCGCCAAAGGGACTTTCAATGACAATTGCACCATTAGGAGCCGTTTGTGCTGTCCTGTTCGCAACACCTTCATCCCCTGCTGCTCGAAAATACAACATGTTCATGGCCCA
GATTGGGTGTGCGGCAATTGGAGTTTTGGCGTTTACTTTGTTGGGGCCTGGATGGCTGGCTAGAAGCTCTGCTCTGGCTGCATCCATGGCGTTTATGATCTATACTGGTT
CGACGCACCCACCAGCTGCAAGTTTGCCGATATTGTTCATCGATGGAGCTAAGATGCAACAGCTAAATTTCTGGTATGCTTTGTTTCCCGGTGCCGCTGGATGTATTCTC
CTTTGCTTAGTAGCTTGGAGCCTTGGATGGAACTCATAATTCAACCAAATGCATTTATTTCATATGGCATTACTTATTAATATGTTAATAATAGAGGACTGAAAAAAAAA
AGTTGATAAAGCAATCATTTAAGGATACAATACCTAGGCCAAATATATTTTTGGTATTTGGGATTTCTAAGAACTTAGTTTGTAGAAGGGCTGTTTTTAATGTTTGAGAT
TTGAATTTAGTTTTTATTTGGTTAAGCTTGATATGTTTTATCCTTAGGACTTAGGACTTAAATTTAATTTATTACTCGTTTTGATTTTTCATTGGGTATGTATGGAGCAT
AAAAGTAAATTAATAAATCAACTCATTTTTTAAAAAGAAAATATATATCAATAAAGTTTAGATGTATAAATTTGTACATCGTCATTTGGTTTAGATTATTGACAATCGAA
CCACAAAAGAGACATTTGGAAGCTAATCTCAAATGCTATGGATTGATGAGGACTTATATTCAGGGTAATTACAACGGGTAGAACTTTTAGAACTAATAATTAAGTGTATA
GTAACATTTTTAAAGAATTGTAAATATAACAAAATTTAGAGAAATTATTTTAAAGGACAAAACTATTGAAAATATTTGCAAATAATAAGAAAATTTTATAAATTAAGGTT
GGATTTAGTATATTTTACTCTAAATCCTATTAAACAATTTTCAATAGAACTTACCAAAGGTCATAACTAGGAAATTTAAAAAATTGAGGTTGGATTGAGTAATATAAAAT
GTTAAGAATGAAATTGAGAAGCGCTTATATATATATATATATATATATATGTTATGATGTACTAGACAGAAAATTGGAATTTTAGATATTCATGTGAATTTTTTTTTATA
TATACAATTTTAAAAATTTAGGACTAAAATTGTAATTATGGC
Protein sequenceShow/hide protein sequence
MSLQLKPIHHHLHHRGLRHCHSQEPYRSSYRVQIALSASLHLNHSFISLLPNFHLLNGKRGISLGLFNGRRICGGGGRIGHRSIAASGIAGRPVSDGSKPDKGSVSPPLS
DILWPSAGAFVAMAMLGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARKYNMFMAQIGCAAIGVLAFTLLGPGWLARSSALAASMAFMIYTGSTHPPAASLPILFI
DGAKMQQLNFWYALFPGAAGCILLCLVAWSLGWNS