; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G21707 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G21707
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionLEA_2 domain-containing protein
Genome locationctg999:207602..210533
RNA-Seq ExpressionCucsat.G21707
SyntenyCucsat.G21707
Gene Ontology termsGO:0009269 - response to desiccation (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046596.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 1 [Cucumis melo var. makuwa]2.35e-11895.81Show/hide
Query:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MT+SSGDDSVPVPYTL+ SNAAQQNVVVLSLYRP PCRHRRLLRL AFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVP VSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIK
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTET+VEGSMGLFFIKIPIK
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIK

XP_004148717.1 uncharacterized protein LOC101219269 [Cucumis sativus]7.49e-146100Show/hide
Query:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTNNQTIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

XP_008463384.1 PREDICTED: uncharacterized protein LOC103501551 [Cucumis melo]1.52e-13896.28Show/hide
Query:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MT+SSGDDSVPVPYTL+ SNAAQQNVVVLSLYRP PCRHRRLLRL AFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVP VSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTET+VEGSMGLFFIKIPIKARVSCEVLV
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTNNQTIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

XP_022144909.1 uncharacterized protein LOC111014473 [Momordica charantia]6.43e-11982.79Show/hide
Query:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MTSSS DDSVPVPY+L+P NAA QNVVVLSLYRPP  R RRLLRLCAFYSAAFLLL AVAFLLFP+DPSLQLVRLKLNR+KV L+PV+ LDLSFS S+RV
Subjt:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV
        RN NFFSL+YN+LGVSVGYRGRRLG+VSSEGGRVSARG SYVNATLDLNG EV+HD +YL+ DL  GI+PFDTET+VEG MGLFFIK PIKARVSCEV V
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTN++TIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

XP_038878687.1 uncharacterized protein LOC120070868 [Benincasa hispida]1.51e-12686.61Show/hide
Query:  PKLWQTNMTSSS-GDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPV-VSLD
        P   Q  MTSSS  DDSVPVPYTL+P NAAQQNVVVLSLYR PPC+H RLLRLCA YSAAFLLLFAVAFLLFP+DPS QLVRLKLN VKVHLVP  VSLD
Subjt:  PKLWQTNMTSSS-GDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPV-VSLD

Query:  LSFSVSLRVRNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIK
        LSF  SLRVRNKNFFSL+Y+++GVSVGYRG+RLG+VSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTET+VEGSMGLFFIK PIK
Subjt:  LSFSVSLRVRNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIK

Query:  ARVSCEVLVNTNNQTIEHQDCYPE
        A+VSCEVLVN NNQTIEHQDCYPE
Subjt:  ARVSCEVLVNTNNQTIEHQDCYPE

TrEMBL top hitse value%identityAlignment
A0A0A0LTV4 LEA_2 domain-containing protein3.63e-146100Show/hide
Query:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTNNQTIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

A0A1S3CJK6 uncharacterized protein LOC1035015517.37e-13996.28Show/hide
Query:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MT+SSGDDSVPVPYTL+ SNAAQQNVVVLSLYRP PCRHRRLLRL AFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVP VSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTET+VEGSMGLFFIKIPIKARVSCEVLV
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTNNQTIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

A0A5A7TX90 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 11.14e-11895.81Show/hide
Query:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MT+SSGDDSVPVPYTL+ SNAAQQNVVVLSLYRP PCRHRRLLRL AFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVP VSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIK
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSS GGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTET+VEGSMGLFFIKIPIK
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIK

A0A6J1CTN0 uncharacterized protein LOC1110144733.11e-11982.79Show/hide
Query:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MTSSS DDSVPVPY+L+P NAA QNVVVLSLYRPP  R RRLLRLCAFYSAAFLLL AVAFLLFP+DPSLQLVRLKLNR+KV L+PV+ LDLSFS S+RV
Subjt:  MTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV
        RN NFFSL+YN+LGVSVGYRGRRLG+VSSEGGRVSARG SYVNATLDLNG EV+HD +YL+ DL  GI+PFDTET+VEG MGLFFIK PIKARVSCEV V
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTN++TIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

A0A6J1JI07 uncharacterized protein LOC1114852801.07e-11281.31Show/hide
Query:  SSSGDDSVPVPYTLIPSNAAQ-QNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVR
        S S D S+PVPY+ IP NAA  QNVVVLSLYRPP  R RRLLRLCA YSAAFLLL AV FLLFPSDPSLQLVRLKLN VKV L+P V LDLSFS S+RVR
Subjt:  SSSGDDSVPVPYTLIPSNAAQ-QNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVR

Query:  NKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLVN
        NKNFFSL+YN+LGVSVG+RGRRLG+VSS+GGRVSARGSSYVNATLDLNGL+++HDV +LL DL KGIIPFDTET+VEGSMGLFFIK PIKA VSCEV V+
Subjt:  NKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLVN

Query:  TNNQTIEHQDCYPE
        TN+QTIEHQDCYPE
Subjt:  TNNQTIEHQDCYPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.1e-4040.49Show/hide
Query:  YTLIPSNAAQQ--NVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFA--VAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVRNKNFFSLN
        Y  +PS+++ +  + V++S +  PP R R       F  + FL+ FA  + ++ +PSDP ++++R+K++ V VH  PV S+D++  V+L+V N + +S +
Subjt:  YTLIPSNAAQQ--NVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFA--VAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVRNKNFFSLN

Query:  YNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLVNTNNQTIEH
        +  L V++ YRG+ LG+VSS+GG V+A GSSY++A  +L+G+ V  DV++L+ DL KG + FDT T+  G +G+ F + P+KA+V+C +LV+T NQTI  
Subjt:  YNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLVNTNNQTIEH

Query:  QDCYP
        Q C P
Subjt:  QDCYP

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.4e-3238.59Show/hide
Query:  YTLIPSNAAQQ--NVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFA--VAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVRNKNFFSLN
        Y  +PS+++ +  + V++S +  PP R R       F  + FL+ FA  + ++ +PSDP ++++R+K++ V VH  PV S+D++  V+L+V N + +S +
Subjt:  YTLIPSNAAQQ--NVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFA--VAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVRNKNFFSLN

Query:  YNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKAR
        +  L V++ YRG+ LG+VSS+GG V+A GSSY++A  +L+G+ V  DV++L+ DL KG + FDT T+  G +G+ F + P+K R
Subjt:  YNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKAR

AT4G13270.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.1e-5650.93Show/hide
Query:  SSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHR-----RLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVS
        +SS  +   +PYT +PS+   Q+V++L+ YR    RHR     R LR    ++A  LLL A  +LL+PSDP + + R+ LN + V     ++LDLSFS++
Subjt:  SSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHR-----RLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVS

Query:  LRVRNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCE
        ++VRN++FFSL+Y+ L VS+GYRGR LG V S+GG + AR SSY++ATL+L+GLEVVHDV+YL+ DL KG+IPFDT   V+G +G+    IPI+ +VSCE
Subjt:  LRVRNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCE

Query:  VLVNTNNQTIEHQDCY
        V VN NNQ I HQDC+
Subjt:  VLVNTNNQTIEHQDCY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATTCTAATTCTAATTCTCCCTAAGCTTTGGCAAACAAACATGACCTCCAGCTCCGGGGACGATTCGGTCCCGGTGCCCTACACTCTCATTCCCTCAAATGCTGCGCAACA
AAACGTCGTCGTTTTATCCCTCTACCGTCCCCCTCCATGCCGACACCGCCGCCTTCTCCGCCTCTGTGCCTTCTACTCTGCCGCCTTCCTCCTCCTCTTCGCCGTTGCTT
TTCTTCTTTTTCCCTCCGATCCCTCCCTCCAACTTGTCCGTTTGAAACTCAATCGCGTCAAAGTCCATTTGGTGCCTGTTGTTTCTCTTGACCTTTCTTTTTCTGTTTCT
CTTAGGGTTCGCAATAAGAACTTCTTCTCTCTTAATTACAATTTCCTTGGCGTTTCCGTTGGCTACCGGGGAAGACGACTTGGATATGTGAGCTCTGAAGGCGGTCGAGT
TTCTGCTCGAGGTTCTTCTTATGTGAATGCCACTCTCGATTTGAATGGGTTGGAGGTTGTTCATGATGTCTTGTACTTGCTTGCGGATCTGGGGAAGGGTATTATTCCCT
TCGATACGGAGACCGATGTGGAAGGATCCATGGGGCTCTTCTTTATCAAAATCCCGATTAAGGCAAGAGTGTCATGTGAGGTACTTGTGAATACAAATAACCAGACAATT
GAACATCAAGATTGCTACCCTGAGGTGAAAATTCATCATGCTGCTCACTTTTGTATTGATATTTCATATTTCTGCCTAAATTTCAACCTCACAGCCCCTGGGATCCTTTT
TCTTGTATATGCAGTGAAGGGAAGAAGGAAATTGGGTTTTCATTGTTACTTTTGTGACACGATGTTGAAGCTGGAAAGTGGGAACTCCTCTGATGTTGCTGAATTTGGCT
GTAAATATCAATTGCAGAGCGTTTGGGGCTCATTGTCATATGTAGGATTTGATAGAAAAAGGAAATGGTAA
mRNA sequenceShow/hide mRNA sequence
ATTCTAATTCTAATTCTCCCTAAGCTTTGGCAAACAAACATGACCTCCAGCTCCGGGGACGATTCGGTCCCGGTGCCCTACACTCTCATTCCCTCAAATGCTGCGCAACA
AAACGTCGTCGTTTTATCCCTCTACCGTCCCCCTCCATGCCGACACCGCCGCCTTCTCCGCCTCTGTGCCTTCTACTCTGCCGCCTTCCTCCTCCTCTTCGCCGTTGCTT
TTCTTCTTTTTCCCTCCGATCCCTCCCTCCAACTTGTCCGTTTGAAACTCAATCGCGTCAAAGTCCATTTGGTGCCTGTTGTTTCTCTTGACCTTTCTTTTTCTGTTTCT
CTTAGGGTTCGCAATAAGAACTTCTTCTCTCTTAATTACAATTTCCTTGGCGTTTCCGTTGGCTACCGGGGAAGACGACTTGGATATGTGAGCTCTGAAGGCGGTCGAGT
TTCTGCTCGAGGTTCTTCTTATGTGAATGCCACTCTCGATTTGAATGGGTTGGAGGTTGTTCATGATGTCTTGTACTTGCTTGCGGATCTGGGGAAGGGTATTATTCCCT
TCGATACGGAGACCGATGTGGAAGGATCCATGGGGCTCTTCTTTATCAAAATCCCGATTAAGGCAAGAGTGTCATGTGAGGTACTTGTGAATACAAATAACCAGACAATT
GAACATCAAGATTGCTACCCTGAGGTGAAAATTCATCATGCTGCTCACTTTTGTATTGATATTTCATATTTCTGCCTAAATTTCAACCTCACAGCCCCTGGGATCCTTTT
TCTTGTATATGCAGTGAAGGGAAGAAGGAAATTGGGTTTTCATTGTTACTTTTGTGACACGATGTTGAAGCTGGAAAGTGGGAACTCCTCTGATGTTGCTGAATTTGGCT
GTAAATATCAATTGCAGAGCGTTTGGGGCTCATTGTCATATGTAGGATTTGATAGAAAAAGGAAATGGTAA
Protein sequenceShow/hide protein sequence
ILILILPKLWQTNMTSSSGDDSVPVPYTLIPSNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVS
LRVRNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLVNTNNQTI
EHQDCYPEVKIHHAAHFCIDISYFCLNFNLTAPGILFLVYAVKGRRKLGFHCYFCDTMLKLESGNSSDVAEFGCKYQLQSVWGSLSYVGFDRKRKW