; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G041830 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G041830
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionLEA_2 domain-containing protein
Genome locationchrH02:22393143..22395237
RNA-Seq ExpressionChy2G041830
SyntenyChy2G041830
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046596.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 1 [Cucumis melo var. makuwa]3.45e-11694.76Show/hide
Query:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MT+SSGDDSVPVPYTL+  NAAQQNVVVLSLYRP PCRHRRLLRL AFYSAAFL LFAVAFLLFPSDPSLQLVRLKLNRVKVHLVP VSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIK
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSS  GRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIK
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIK

XP_004148717.1 uncharacterized protein LOC101219269 [Cucumis sativus]1.24e-14298.14Show/hide
Query:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MTSSSGDDSVPVPYTLIP NAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFL LFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSSE GRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTET+VEGSMGLFFIKIPIKARVSCEVLV
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTNNQTIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

XP_008463384.1 PREDICTED: uncharacterized protein LOC103501551 [Cucumis melo]2.17e-13695.35Show/hide
Query:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MT+SSGDDSVPVPYTL+  NAAQQNVVVLSLYRP PCRHRRLLRL AFYSAAFL LFAVAFLLFPSDPSLQLVRLKLNRVKVHLVP VSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSS  GRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTNNQTIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

XP_022144909.1 uncharacterized protein LOC111014473 [Momordica charantia]9.91e-11982.79Show/hide
Query:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MTSSS DDSVPVPY+L+PPNAA QNVVVLSLYRPP  R RRLLRLCAFYSAAFL L AVAFLLFP+DPSLQLVRLKLNR+KV L+PV+ LDLSFS S+RV
Subjt:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV
        RN NFFSL+YN+LGVSVGYRGRRLG+VSSE GRVSARG SYVNATLDLNG EV+HD +YL+ DL  GI+PFDTETEVEG MGLFFIK PIKARVSCEV V
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTN++TIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

XP_038878687.1 uncharacterized protein LOC120070868 [Benincasa hispida]8.86e-12488.02Show/hide
Query:  MTSSS-GDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPV-VSLDLSFSVSL
        MTSSS  DDSVPVPYTL+P NAAQQNVVVLSLYR PPC+H RLLRLCA YSAAFL LFAVAFLLFP+DPS QLVRLKLN VKVHLVP  VSLDLSF  SL
Subjt:  MTSSS-GDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPV-VSLDLSFSVSL

Query:  RVRNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEV
        RVRNKNFFSL+Y+++GVSVGYRG+RLG+VSSE GRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIK PIKA+VSCEV
Subjt:  RVRNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEV

Query:  LVNTNNQTIEHQDCYPE
        LVN NNQTIEHQDCYPE
Subjt:  LVNTNNQTIEHQDCYPE

TrEMBL top hitse value%identityAlignment
A0A0A0LTV4 LEA_2 domain-containing protein1.8e-11198.14Show/hide
Query:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MTSSSGDDSVPVPYTLIP NAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFL LFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSSE GRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTET+VEGSMGLFFIKIPIKARVSCEVLV
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTNNQTIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

A0A1S3CJK6 uncharacterized protein LOC1035015511.0e-10695.35Show/hide
Query:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MT+SSGDDSVPVPYTL+  NAAQQNVVVLSLYRP PCRHRRLLRL AFYSAAFL LFAVAFLLFPSDPSLQLVRLKLNRVKVHLVP VSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSS  GRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTNNQTIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

A0A5A7TX90 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 12.7e-9194.76Show/hide
Query:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MT+SSGDDSVPVPYTL+  NAAQQNVVVLSLYRP PCRHRRLLRL AFYSAAFL LFAVAFLLFPSDPSLQLVRLKLNRVKVHLVP VSLDLSFSVSLRV
Subjt:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIK
        RNKNFFSLNYNFLGVSVGYRGRRLGYVSS  GRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIK
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIK

A0A6J1CTN0 uncharacterized protein LOC1110144732.9e-9382.79Show/hide
Query:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV
        MTSSS DDSVPVPY+L+PPNAA QNVVVLSLYRPP  R RRLLRLCAFYSAAFL L AVAFLLFP+DPSLQLVRLKLNR+KV L+PV+ LDLSFS S+RV
Subjt:  MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRV

Query:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV
        RN NFFSL+YN+LGVSVGYRGRRLG+VSSE GRVSARG SYVNATLDLNG EV+HD +YL+ DL  GI+PFDTETEVEG MGLFFIK PIKARVSCEV V
Subjt:  RNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLV

Query:  NTNNQTIEHQDCYPE
        NTN++TIEHQDCYPE
Subjt:  NTNNQTIEHQDCYPE

A0A6J1JI07 uncharacterized protein LOC1114852802.8e-8881.31Show/hide
Query:  SSSGDDSVPVPYTLIPPN-AAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVR
        S S D S+PVPY+ IPPN AA QNVVVLSLYRPP  R RRLLRLCA YSAAFL L AV FLLFPSDPSLQLVRLKLN VKV L+P V LDLSFS S+RVR
Subjt:  SSSGDDSVPVPYTLIPPN-AAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVR

Query:  NKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLVN
        NKNFFSL+YN+LGVSVG+RGRRLG+VSS+ GRVSARGSSYVNATLDLNGL+++HDV +LL DL KGIIPFDTETEVEGSMGLFFIK PIKA VSCEV V+
Subjt:  NKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLVN

Query:  TNNQTIEHQDCYPE
        TN+QTIEHQDCYPE
Subjt:  TNNQTIEHQDCYPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.0e-3940Show/hide
Query:  YTLIPPNAAQQ--NVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFA--VAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVRNKNFFSLN
        Y  +P +++ +  + V++S +  PP R R       F  + FL  FA  + ++ +PSDP ++++R+K++ V VH  PV S+D++  V+L+V N + +S +
Subjt:  YTLIPPNAAQQ--NVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFA--VAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVRNKNFFSLN

Query:  YNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLVNTNNQTIEH
        +  L V++ YRG+ LG+VSS+ G V+A GSSY++A  +L+G+ V  DV++L+ DL KG + FDT TE  G +G+ F + P+KA+V+C +LV+T NQTI  
Subjt:  YNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLVNTNNQTIEH

Query:  QDCYP
        Q C P
Subjt:  QDCYP

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.1e-3138.04Show/hide
Query:  YTLIPPNAAQQ--NVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFA--VAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVRNKNFFSLN
        Y  +P +++ +  + V++S +  PP R R       F  + FL  FA  + ++ +PSDP ++++R+K++ V VH  PV S+D++  V+L+V N + +S +
Subjt:  YTLIPPNAAQQ--NVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFA--VAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVRNKNFFSLN

Query:  YNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKAR
        +  L V++ YRG+ LG+VSS+ G V+A GSSY++A  +L+G+ V  DV++L+ DL KG + FDT TE  G +G+ F + P+K R
Subjt:  YNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKAR

AT4G13270.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.8e-5449.54Show/hide
Query:  SSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHR-----RLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVS
        +SS  +   +PYT +P +   Q+V++L+ YR    RHR     R LR    ++A  L L A  +LL+PSDP + + R+ LN + V     ++LDLSFS++
Subjt:  SSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHR-----RLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVS

Query:  LRVRNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCE
        ++VRN++FFSL+Y+ L VS+GYRGR LG V S+ G + AR SSY++ATL+L+GLEVVHDV+YL+ DL KG+IPFDT  +V+G +G+    IPI+ +VSCE
Subjt:  LRVRNKNFFSLNYNFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCE

Query:  VLVNTNNQTIEHQDCY
        V VN NNQ I HQDC+
Subjt:  VLVNTNNQTIEHQDCY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTCCAGCTCCGGGGACGATTCGGTCCCGGTGCCCTACACTCTCATTCCCCCAAATGCTGCGCAACAAAACGTCGTCGTTTTATCCCTCTACCGTCCCCCTCCATG
CCGACACCGCCGCCTTCTCCGCCTCTGTGCCTTCTACTCTGCCGCCTTCCTCCCCCTCTTCGCCGTTGCTTTTCTTCTTTTTCCCTCCGATCCCTCCCTCCAACTCGTCC
GTTTGAAACTCAATCGCGTCAAAGTCCATTTGGTGCCTGTTGTTTCCCTTGACCTTTCTTTTTCTGTTTCTCTTAGGGTTCGCAATAAGAACTTCTTCTCTCTTAATTAC
AATTTCCTTGGCGTTTCCGTTGGCTACCGGGGAAGACGACTTGGATATGTGAGCTCTGAAAGCGGTCGAGTTTCTGCTCGAGGTTCTTCTTATGTGAATGCCACTCTCGA
TTTGAATGGGTTGGAGGTTGTTCATGATGTCTTGTACTTGCTTGCGGATCTGGGGAAGGGTATTATTCCCTTCGATACGGAGACCGAAGTGGAAGGATCCATGGGGCTCT
TCTTTATCAAAATCCCGATTAAGGCAAGAGTGTCATGTGAGGTACTTGTGAATACAAATAACCAAACAATTGAACATCAAGATTGCTACCCTGAGGTGAAAATTCATCAT
GCTGCTCACTTTTGTATTGATATTTCATATTTCTGCCTAAATCTCAACCTCACAGCCCCTTGGATCCTTTTTCTCGTATTTGCAGTGAAGGGAAGAAGGAAATTGGGTTT
TCATTGTTACTTTTGTGACACGATGTTGAAGCTGGAAAGTGGGAACTCCTCTGATGTTGCTGAATTTGGCT
mRNA sequenceShow/hide mRNA sequence
ATGACCTCCAGCTCCGGGGACGATTCGGTCCCGGTGCCCTACACTCTCATTCCCCCAAATGCTGCGCAACAAAACGTCGTCGTTTTATCCCTCTACCGTCCCCCTCCATG
CCGACACCGCCGCCTTCTCCGCCTCTGTGCCTTCTACTCTGCCGCCTTCCTCCCCCTCTTCGCCGTTGCTTTTCTTCTTTTTCCCTCCGATCCCTCCCTCCAACTCGTCC
GTTTGAAACTCAATCGCGTCAAAGTCCATTTGGTGCCTGTTGTTTCCCTTGACCTTTCTTTTTCTGTTTCTCTTAGGGTTCGCAATAAGAACTTCTTCTCTCTTAATTAC
AATTTCCTTGGCGTTTCCGTTGGCTACCGGGGAAGACGACTTGGATATGTGAGCTCTGAAAGCGGTCGAGTTTCTGCTCGAGGTTCTTCTTATGTGAATGCCACTCTCGA
TTTGAATGGGTTGGAGGTTGTTCATGATGTCTTGTACTTGCTTGCGGATCTGGGGAAGGGTATTATTCCCTTCGATACGGAGACCGAAGTGGAAGGATCCATGGGGCTCT
TCTTTATCAAAATCCCGATTAAGGCAAGAGTGTCATGTGAGGTACTTGTGAATACAAATAACCAAACAATTGAACATCAAGATTGCTACCCTGAGGTGAAAATTCATCAT
GCTGCTCACTTTTGTATTGATATTTCATATTTCTGCCTAAATCTCAACCTCACAGCCCCTTGGATCCTTTTTCTCGTATTTGCAGTGAAGGGAAGAAGGAAATTGGGTTT
TCATTGTTACTTTTGTGACACGATGTTGAAGCTGGAAAGTGGGAACTCCTCTGATGTTGCTGAATTTGGCT
Protein sequenceShow/hide protein sequence
MTSSSGDDSVPVPYTLIPPNAAQQNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLPLFAVAFLLFPSDPSLQLVRLKLNRVKVHLVPVVSLDLSFSVSLRVRNKNFFSLNY
NFLGVSVGYRGRRLGYVSSESGRVSARGSSYVNATLDLNGLEVVHDVLYLLADLGKGIIPFDTETEVEGSMGLFFIKIPIKARVSCEVLVNTNNQTIEHQDCYPEVKIHH
AAHFCIDISYFCLNLNLTAPWILFLVFAVKGRRKLGFHCYFCDTMLKLESGNSSDVAEFGX