; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cp4.1LG18g04410 (gene) of Cucurbita pepo (MU-CU-16) v4.1 genome

Gene IDCp4.1LG18g04410
OrganismCucurbita pepo var. pepo MU-CU-16 (Cucurbita pepo (MU-CU-16) v4.1)
DescriptionLEA_2 domain-containing protein
Genome locationCp4.1LG18:5453580..5455536
RNA-Seq ExpressionCp4.1LG18g04410
SyntenyCp4.1LG18g04410
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589903.1 hypothetical protein SDJN03_15326, partial [Cucurbita argyrosperma subsp. sororia]3.44e-14397.21Show/hide
Query:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIP NA APQNVVVLSLYRPPLYR RRLLRLCALYS AFLLLSA VFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV
        RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEV V
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

KAG7023573.1 hypothetical protein SDJN02_14599, partial [Cucurbita argyrosperma subsp. argyrosperma]4.96e-12776.23Show/hide
Query:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIP NA APQNVVVLSLYRPPLYR RRLLRLCALYS AFLLLSA VFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKAT-------
        RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK +       
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKAT-------

Query:  ----------------------VSCEVLVDTNSQT-----IEHQDCYPELKLGSGNARDIVEFEC
                                  + + T S+       ++  C  +LKLGSGN+ DIVE EC
Subjt:  ----------------------VSCEVLVDTNSQT-----IEHQDCYPELKLGSGNARDIVEFEC

XP_022960913.1 uncharacterized protein LOC111461574 [Cucurbita moschata]4.01e-14296.28Show/hide
Query:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIP NA APQN+VVLSLYRPPLYR RRLLRLC LYS AFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV
        RN NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEV V
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

XP_022987870.1 uncharacterized protein LOC111485280 [Cucurbita maxima]1.99e-14296.74Show/hide
Query:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIP NA APQNVVVLSLYRPPLYR RRLLRLCALYS AFLLLSAVVFLLFPSDPSLQLVRLKLNGV VRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV
        RNKNFFSLDYNYLGVSVG+RGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEV V
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

XP_023515526.1 uncharacterized protein LOC111779657 [Cucurbita pepo subsp. pepo]6.47e-148100Show/hide
Query:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV
        RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

TrEMBL top hitse value%identityAlignment
A0A0A0LTV4 LEA_2 domain-containing protein3.12e-11681.78Show/hide
Query:  SCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR
        S S D S+PVPY+ IP+NA A QNVVVLSLYRPP  RHRRLLRLCA YS AFLLL AV FLLFPSDPSLQLVRLKLN V V L+P V LDLSFS S+RVR
Subjt:  SCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR

Query:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLVD
        NKNFFSL+YN+LGVSVGYRGRRLG+VSS+GGRVSARGSSYVNATLDLNGL+++HDV +LL DL KGIIPFDTET+VEGSMGLFFIK PIKA VSCEVLV+
Subjt:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLVD

Query:  TNSQTIEHQDCYPE
        TN+QTIEHQDCYPE
Subjt:  TNSQTIEHQDCYPE

A0A1S3CJK6 uncharacterized protein LOC1035015514.69e-11180.66Show/hide
Query:  SKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNK
        S D S+PVPY+ + +NA A QNVVVLSLYRP   RHRRLLRL A YS AFLLL AV FLLFPSDPSLQLVRLKLN V V L+P V LDLSFS S+RVRNK
Subjt:  SKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNK

Query:  NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLVDTN
        NFFSL+YN+LGVSVGYRGRRLG+VSS GGRVSARGSSYVNATLDLNGL+++HDV +LL DL KGIIPFDTETEVEGSMGLFFIK PIKA VSCEVLV+TN
Subjt:  NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLVDTN

Query:  SQTIEHQDCYPE
        +QTIEHQDCYPE
Subjt:  SQTIEHQDCYPE

A0A6J1CTN0 uncharacterized protein LOC1110144736.29e-11680.84Show/hide
Query:  SCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR
        S S+D S+PVPYS +P NA A QNVVVLSLYRPP +R RRLLRLCA YS AFLLLSAV FLLFP+DPSLQLVRLKLN + VRLLP ++LDLSFSASVRVR
Subjt:  SCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR

Query:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLVD
        N NFFSLDYNYLGVSVGYRGRRLGFVSS+GGRVSARG SYVNATLDLNG ++IHD  +L+EDL  GI+PFDTETEVEG MGLFFIKFPIKA VSCEV V+
Subjt:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLVD

Query:  TNSQTIEHQDCYPE
        TN +TIEHQDCYPE
Subjt:  TNSQTIEHQDCYPE

A0A6J1HAC8 uncharacterized protein LOC1114615741.94e-14296.28Show/hide
Query:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIP NA APQN+VVLSLYRPPLYR RRLLRLC LYS AFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV
        RN NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEV V
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

A0A6J1JI07 uncharacterized protein LOC1114852809.63e-14396.74Show/hide
Query:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIP NA APQNVVVLSLYRPPLYR RRLLRLCALYS AFLLLSAVVFLLFPSDPSLQLVRLKLNGV VRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV
        RNKNFFSLDYNYLGVSVG+RGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEV V
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.7e-3939.6Show/hide
Query:  YSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNKNFFSLDYNY
        Y P+P++++   N  VL    P     RR +    L S A    S ++++ +PSDP ++++R+K++ V+V   P   +D++   +++V N + +S D+  
Subjt:  YSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNKNFFSLDYNY

Query:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLVDTNSQTIEHQDC
        L V++ YRG+ LG VSSDGG V+A GSSY++A  +L+G+ +  DV  L+ DL KG + FDT TE  G +G+ F +FP+KA V+C +LVDT +QTI  Q C
Subjt:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLVDTNSQTIEHQDC

Query:  YP
         P
Subjt:  YP

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.7e-3137.43Show/hide
Query:  YSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNKNFFSLDYNY
        Y P+P++++   N  VL    P     RR +    L S A    S ++++ +PSDP ++++R+K++ V+V   P   +D++   +++V N + +S D+  
Subjt:  YSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNKNFFSLDYNY

Query:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK
        L V++ YRG+ LG VSSDGG V+A GSSY++A  +L+G+ +  DV  L+ DL KG + FDT TE  G +G+ F +FP+K
Subjt:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK

AT4G13270.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.8e-5348.62Show/hide
Query:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHR-----RLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFS
        M+ SK     +PY+P+P++  + Q+V++L+ YR    RHR     R LR   L++   LLLSA V+LL+PSDP + + R+ LN ++V     + LDLSFS
Subjt:  MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHR-----RLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFS

Query:  ASVRVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVS
         +++VRN++FFSLDY+ L VS+GYRGR LG V S GG + AR SSY++ATL+L+GL+++HDV +L+ DL KG+IPFDT  +V+G +G+     PI+  VS
Subjt:  ASVRVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVS

Query:  CEVLVDTNSQTIEHQDCY
        CEV V+ N+Q I HQDC+
Subjt:  CEVLVDTNSQTIEHQDCY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTGCTCTAAGGACGGTTCGATCCCTGTTCCTTACTCTCCTATTCCCGCAAATGCTACTGCACCGCAAAACGTTGTCGTTTTATCTCTCTATCGTCCCCCT
CTCTACCGGCACCGGCGGCTTCTTCGCCTCTGTGCCCTCTACTCCGTCGCTTTCCTCCTCCTCTCCGCCGTTGTTTTTCTACTTTTCCCGTCCGATCCCTCGCTC
CAACTCGTTCGATTGAAACTCAATGGGGTGAATGTCCGTTTGTTGCCTGCTGTTGTCCTTGACCTTTCTTTCTCTGCTTCTGTTAGGGTTCGGAATAAGAACTTT
TTTTCTCTCGATTACAATTACCTTGGCGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATTTGTGAGCTCTGATGGCGGTCGTGTCTCTGCTCGAGGCTCCTCT
TACGTGAACGCCACTCTCGATTTGAATGGGTTACAGATCATTCACGATGTCTTTTTCTTGCTTGAGGATCTGAGGAAGGGTATAATTCCTTTCGATACGGAGACA
GAAGTGGAAGGATCCATGGGGCTTTTCTTTATCAAATTCCCAATTAAGGCTACAGTATCATGTGAGGTACTTGTGGATACAAATAGCCAAACAATTGAGCATCAA
GATTGCTACCCTGAGCTGAAACTGGGAAGCGGGAACGCCCGTGATATTGTTGAATTCGAGTGTTAA
mRNA sequenceShow/hide mRNA sequence
TGGCATAGGCTTTATTGGGACGTGTAAATTCTTGATAGGCTGAGGCGGGTAGCTTAGGGAGGGGCGAGTGTTTTCGCCACGTTACTCATAAAAATCAGTTCCCAT
TTCAATTCTCTCTAAAACAAAGCATGAGCTGCTCTAAGGACGGTTCGATCCCTGTTCCTTACTCTCCTATTCCCGCAAATGCTACTGCACCGCAAAACGTTGTCG
TTTTATCTCTCTATCGTCCCCCTCTCTACCGGCACCGGCGGCTTCTTCGCCTCTGTGCCCTCTACTCCGTCGCTTTCCTCCTCCTCTCCGCCGTTGTTTTTCTAC
TTTTCCCGTCCGATCCCTCGCTCCAACTCGTTCGATTGAAACTCAATGGGGTGAATGTCCGTTTGTTGCCTGCTGTTGTCCTTGACCTTTCTTTCTCTGCTTCTG
TTAGGGTTCGGAATAAGAACTTTTTTTCTCTCGATTACAATTACCTTGGCGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATTTGTGAGCTCTGATGGCGGTC
GTGTCTCTGCTCGAGGCTCCTCTTACGTGAACGCCACTCTCGATTTGAATGGGTTACAGATCATTCACGATGTCTTTTTCTTGCTTGAGGATCTGAGGAAGGGTA
TAATTCCTTTCGATACGGAGACAGAAGTGGAAGGATCCATGGGGCTTTTCTTTATCAAATTCCCAATTAAGGCTACAGTATCATGTGAGGTACTTGTGGATACAA
ATAGCCAAACAATTGAGCATCAAGATTGCTACCCTGAGCTGAAACTGGGAAGCGGGAACGCCCGTGATATTGTTGAATTCGAGTGTTAA
Protein sequenceShow/hide protein sequence
MSCSKDGSIPVPYSPIPANATAPQNVVVLSLYRPPLYRHRRLLRLCALYSVAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNKNF
FSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVLVDTNSQTIEHQ
DCYPELKLGSGNARDIVEFEC