; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G04600 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G04600
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGPI-anchored protein LORELEI-like isoform X2
Genome locationChr7:3429588..3430727
RNA-Seq ExpressionCSPI07G04600
SyntenyCSPI07G04600
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR039307 - GPI-anchored protein LORELEI-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137022.1 GPI-anchored protein LLG1 [Cucumis sativus]1.2e-91100Show/hide
Query:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA
        MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA
Subjt:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA

Query:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL
        ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL
Subjt:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL

XP_016901693.1 PREDICTED: GPI-anchored protein LORELEI-like isoform X1 [Cucumis melo]5.4e-8493.14Show/hide
Query:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA
        MALNQHSYCSYF LLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKK+CPVDMEGQNYTILTSKCKGPKYPA LCCEALLEFCCGFVDELNDMTNNCA
Subjt:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA

Query:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL
        ETMFSYINLYGQYPPGLFANQCKEGK GLSCD AL+AQAEKA+IK ASSA+S+LLTP RL SLALASTFILYLLL
Subjt:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL

XP_016901694.1 PREDICTED: GPI-anchored protein LORELEI-like isoform X2 [Cucumis melo]5.4e-8493.14Show/hide
Query:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA
        MALNQHSYCSYF LLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKK+CPVDMEGQNYTILTSKCKGPKYPA LCCEALLEFCCGFVDELNDMTNNCA
Subjt:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA

Query:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL
        ETMFSYINLYGQYPPGLFANQCKEGK GLSCD AL+AQAEKA+IK ASSA+S+LLTP RL SLALASTFILYLLL
Subjt:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL

XP_016901695.1 PREDICTED: GPI-anchored protein LORELEI-like isoform X3 [Cucumis melo]4.6e-6792.36Show/hide
Query:  EALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLYGQYPPGLFANQCKEGKDGLSC
        EALKSGQFTGRSLLQAKK+CPVDMEGQNYTILTSKCKGPKYPA LCCEALLEFCCGFVDELNDMTNNCAETMFSYINLYGQYPPGLFANQCKEGK GLSC
Subjt:  EALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLYGQYPPGLFANQCKEGKDGLSC

Query:  DNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL
        D AL+AQAEKA+IK ASSA+S+LLTP RL SLALASTFILYLLL
Subjt:  DNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL

XP_038888702.1 GPI-anchored protein LLG1-like isoform X1 [Benincasa hispida]1.9e-7686.93Show/hide
Query:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA
        MA NQ  YC YFALLFLF+AFAYSHSPFLSYEALK+G+FTGRSLLQAKKSCPVDMEGQNYTILTSKCKGP+YP ALCCEALLEFCCGFVDELNDMTNNCA
Subjt:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA

Query:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPS--LALASTFILYLLL
        ETMFSYINLYGQYPPGLFANQCKEG+ GLSCD ALK QAEKA+IKASS+ASSLLT   L S  LALA TF+LYLLL
Subjt:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPS--LALASTFILYLLL

TrEMBL top hitse value%identityAlignment
A0A0A0K5Q4 Uncharacterized protein5.8e-92100Show/hide
Query:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA
        MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA
Subjt:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA

Query:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL
        ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL
Subjt:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL

A0A1S4E0D8 GPI-anchored protein LORELEI-like isoform X12.6e-8493.14Show/hide
Query:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA
        MALNQHSYCSYF LLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKK+CPVDMEGQNYTILTSKCKGPKYPA LCCEALLEFCCGFVDELNDMTNNCA
Subjt:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA

Query:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL
        ETMFSYINLYGQYPPGLFANQCKEGK GLSCD AL+AQAEKA+IK ASSA+S+LLTP RL SLALASTFILYLLL
Subjt:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL

A0A1S4E0F2 GPI-anchored protein LORELEI-like isoform X22.6e-8493.14Show/hide
Query:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA
        MALNQHSYCSYF LLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKK+CPVDMEGQNYTILTSKCKGPKYPA LCCEALLEFCCGFVDELNDMTNNCA
Subjt:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA

Query:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL
        ETMFSYINLYGQYPPGLFANQCKEGK GLSCD AL+AQAEKA+IK ASSA+S+LLTP RL SLALASTFILYLLL
Subjt:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL

A0A1S4E132 GPI-anchored protein LORELEI-like isoform X32.2e-6792.36Show/hide
Query:  EALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLYGQYPPGLFANQCKEGKDGLSC
        EALKSGQFTGRSLLQAKK+CPVDMEGQNYTILTSKCKGPKYPA LCCEALLEFCCGFVDELNDMTNNCAETMFSYINLYGQYPPGLFANQCKEGK GLSC
Subjt:  EALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLYGQYPPGLFANQCKEGKDGLSC

Query:  DNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL
        D AL+AQAEKA+IK ASSA+S+LLTP RL SLALASTFILYLLL
Subjt:  DNALKAQAEKAQIK-ASSAASSLLTPRRLPSLALASTFILYLLL

A0A5A7SQ23 GPI-anchored protein LORELEI-like isoform X32.7e-5792Show/hide
Query:  CPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIK-ASSA
        CPVDMEGQNYTILTSKCKGPKYPA LCCEALLEFCCGFVDELNDMTNNCAETMFSYINLYGQYPPGLFANQCKEGK GLSCD AL+AQAEKA+IK ASSA
Subjt:  CPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIK-ASSA

Query:  ASSLLTPRRLPSLALASTFILYLLL
        +S+LLTP RL SLALASTFILYLLL
Subjt:  ASSLLTPRRLPSLALASTFILYLLL

SwissProt top hitse value%identityAlignment
B3GS44 GPI-anchored protein LORELEI1.7e-2744.24Show/hide
Query:  LLFLF---LAFAYSHSPFLSYEALKS-GQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINL
        LLF F   L  + S S  +S    +S    +GR+L  AKK C V+ E  +Y +LT +CKGP +PA  CC A  EF C +V ++NDM ++CA+TMFSY+N+
Subjt:  LLFLF---LAFAYSHSPFLSYEALKS-GQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINL

Query:  YGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL
        YG YP GLFAN+C+E KDGL C            + AS+A S   TPR +  L  A+T +  LL+
Subjt:  YGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL

Q6NLF4 GPI-anchored protein LLG21.5e-3346.71Show/hide
Query:  YCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYI
        YC   +LL +FL   +S    LSY+       T R+LLQ + +C  D   +NYTI+TS+CKGP YPA +CC A  +F C F + LND  N+CA TMFSYI
Subjt:  YCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYI

Query:  NLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL
        NLYG+YPPG+FAN CKEGK+GL C +  ++        AS+ + S+       SLA+ STF++  LL
Subjt:  NLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL

Q9FKT1 GPI-anchored protein LLG11.9e-3653.1Show/hide
Query:  FALLFLFLAFAYSHSPFLSYEALKSGQFT-GRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLY
        F  L L +  ++S S F+S    +S     GR+LLQ KK+CPV+ E  NYTI+TSKCKGPKYP   CC A  +F C + D+LND++++CA TMFSYINLY
Subjt:  FALLFLFLAFAYSHSPFLSYEALKSGQFT-GRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLY

Query:  GQYPPGLFANQCKEGKDGLSCDNALKAQAE-KAQIKASSAASSLL
        G+YPPGLFANQCKEGK+GL C    +   E  A++ A++ +SS L
Subjt:  GQYPPGLFANQCKEGKDGLSCDNALKAQAE-KAQIKASSAASSLL

Q9M0I0 GPI-anchored protein LLG31.8e-3448.39Show/hide
Query:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA
        M +  H   S  ++L L   FA+SH   +S +  +S   T R+LLQAK +C  D   +NYTI+TSKCKGP YPA +CC A  +F C F + LND   +CA
Subjt:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA

Query:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLT
         TMFSYINLYG+YPPG+FAN CKEGK+GL C +     +  A I   S    L+T
Subjt:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLT

Arabidopsis top hitse value%identityAlignment
AT2G20700.1 LORELEI-LIKE-GPI ANCHORED PROTEIN 21.1e-3446.71Show/hide
Query:  YCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYI
        YC   +LL +FL   +S    LSY+       T R+LLQ + +C  D   +NYTI+TS+CKGP YPA +CC A  +F C F + LND  N+CA TMFSYI
Subjt:  YCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYI

Query:  NLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL
        NLYG+YPPG+FAN CKEGK+GL C +  ++        AS+ + S+       SLA+ STF++  LL
Subjt:  NLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL

AT4G26466.1 lorelei1.2e-2844.24Show/hide
Query:  LLFLF---LAFAYSHSPFLSYEALKS-GQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINL
        LLF F   L  + S S  +S    +S    +GR+L  AKK C V+ E  +Y +LT +CKGP +PA  CC A  EF C +V ++NDM ++CA+TMFSY+N+
Subjt:  LLFLF---LAFAYSHSPFLSYEALKS-GQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINL

Query:  YGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL
        YG YP GLFAN+C+E KDGL C            + AS+A S   TPR +  L  A+T +  LL+
Subjt:  YGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL

AT4G28280.1 LORELEI-LIKE-GPI ANCHORED PROTEIN 33.7e-3053.77Show/hide
Query:  SCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSA
        +C  D   +NYTI+TSKCKGP YPA +CC A  +F C F + LND   +CA TMFSYINLYG+YPPG+FAN CKEGK+GL C +     +  A I   S 
Subjt:  SCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSA

Query:  ASSLLT
           L+T
Subjt:  ASSLLT

AT4G28280.2 LORELEI-LIKE-GPI ANCHORED PROTEIN 31.3e-3548.39Show/hide
Query:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA
        M +  H   S  ++L L   FA+SH   +S +  +S   T R+LLQAK +C  D   +NYTI+TSKCKGP YPA +CC A  +F C F + LND   +CA
Subjt:  MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCA

Query:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLT
         TMFSYINLYG+YPPG+FAN CKEGK+GL C +     +  A I   S    L+T
Subjt:  ETMFSYINLYGQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLT

AT5G56170.1 LORELEI-LIKE-GPI-ANCHORED PROTEIN 11.4e-3753.1Show/hide
Query:  FALLFLFLAFAYSHSPFLSYEALKSGQFT-GRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLY
        F  L L +  ++S S F+S    +S     GR+LLQ KK+CPV+ E  NYTI+TSKCKGPKYP   CC A  +F C + D+LND++++CA TMFSYINLY
Subjt:  FALLFLFLAFAYSHSPFLSYEALKSGQFT-GRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLY

Query:  GQYPPGLFANQCKEGKDGLSCDNALKAQAE-KAQIKASSAASSLL
        G+YPPGLFANQCKEGK+GL C    +   E  A++ A++ +SS L
Subjt:  GQYPPGLFANQCKEGKDGLSCDNALKAQAE-KAQIKASSAASSLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTGAATCAACATTCTTATTGCTCCTATTTTGCCCTTCTCTTCCTTTTCCTTGCCTTTGCTTACTCTCACTCCCCATTTCTCTCCTATGAGGCACTCAAATCTGG
GCAATTTACTGGGCGCTCCCTTCTTCAGGCCAAAAAGAGTTGCCCAGTAGACATGGAAGGGCAGAATTACACAATCCTAACAAGTAAATGCAAAGGCCCTAAATACCCAG
CAGCACTCTGTTGTGAGGCATTGTTGGAATTTTGCTGTGGGTTTGTTGATGAATTAAATGACATGACCAACAATTGTGCTGAGACAATGTTTAGTTACATAAATCTCTAT
GGCCAATACCCTCCTGGCCTCTTTGCCAATCAATGCAAAGAAGGGAAAGATGGTCTTTCTTGTGACAATGCCCTGAAAGCCCAGGCCGAGAAAGCCCAGATCAAAGCTTC
TTCTGCTGCTTCGTCTCTACTAACTCCACGCCGACTCCCTTCTCTCGCTCTCGCTTCTACCTTTATACTCTATTTGTTGTTATAG
mRNA sequenceShow/hide mRNA sequence
AATCCATAAAAATAAAAAATATTTGGTTTGATTATCGTAAGCCGTTTGCAGGCGGAGACAGAAAGCTCCACCGCCAACGGCTACCGGAGAGAACACAAAGTTTAGAAGAG
AAGAGAAATGGCATTGAATCAACATTCTTATTGCTCCTATTTTGCCCTTCTCTTCCTTTTCCTTGCCTTTGCTTACTCTCACTCCCCATTTCTCTCCTATGAGGCACTCA
AATCTGGGCAATTTACTGGGCGCTCCCTTCTTCAGGCCAAAAAGAGTTGCCCAGTAGACATGGAAGGGCAGAATTACACAATCCTAACAAGTAAATGCAAAGGCCCTAAA
TACCCAGCAGCACTCTGTTGTGAGGCATTGTTGGAATTTTGCTGTGGGTTTGTTGATGAATTAAATGACATGACCAACAATTGTGCTGAGACAATGTTTAGTTACATAAA
TCTCTATGGCCAATACCCTCCTGGCCTCTTTGCCAATCAATGCAAAGAAGGGAAAGATGGTCTTTCTTGTGACAATGCCCTGAAAGCCCAGGCCGAGAAAGCCCAGATCA
AAGCTTCTTCTGCTGCTTCGTCTCTACTAACTCCACGCCGACTCCCTTCTCTCGCTCTCGCTTCTACCTTTATACTCTATTTGTTGTTATAGAATGCAAAAGTAACCACA
ACCACCACCCCCTTCGTTATCATAACTTGCTTATATGATGCCAATGCCAAGGCCGTCTCAAACCTCTTCTTGTGTTCTGTTTGTTTGATGGATTTTCTGTATAACTATTA
ACATCCAACTTGGTGGGATTTGGGTAAAGAAACTTAAATTTATTTTATATTTCTTTTGTTAAATTACAAATTGAGTTCCATTCTCAAATTTTAAATTTTGTACTTGTTTA
CTAAACATTTTTTAAAGTTGAACG
Protein sequenceShow/hide protein sequence
MALNQHSYCSYFALLFLFLAFAYSHSPFLSYEALKSGQFTGRSLLQAKKSCPVDMEGQNYTILTSKCKGPKYPAALCCEALLEFCCGFVDELNDMTNNCAETMFSYINLY
GQYPPGLFANQCKEGKDGLSCDNALKAQAEKAQIKASSAASSLLTPRRLPSLALASTFILYLLL