; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg09863 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg09863
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionLEA_2 domain-containing protein
Genome locationCarg_Chr10:2845232..2847740
RNA-Seq ExpressionCarg09863
SyntenyCarg09863
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589903.1 hypothetical protein SDJN03_15326, partial [Cucurbita argyrosperma subsp. sororia]3.4e-9898.96Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS
        RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK +
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS

KAG7023573.1 hypothetical protein SDJN02_14599, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-142100Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTSNVFLVEY
        RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTSNVFLVEY
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTSNVFLVEY

Query:  VIRVKRIPVIPYHVRYLWIQIAKQLSIKIATLSEPKMGIRFDYDSCDVKLKLGSGNSPDIVESEC
        VIRVKRIPVIPYHVRYLWIQIAKQLSIKIATLSEPKMGIRFDYDSCDVKLKLGSGNSPDIVESEC
Subjt:  VIRVKRIPVIPYHVRYLWIQIAKQLSIKIATLSEPKMGIRFDYDSCDVKLKLGSGNSPDIVESEC

XP_022960913.1 uncharacterized protein LOC111461574 [Cucurbita moschata]5.4e-9696.89Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQN+VVLSLYRPPLYRQRRLLRLC LYSAAFLLLSA VFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS
        RN NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK +
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS

XP_022987870.1 uncharacterized protein LOC111485280 [Cucurbita maxima]3.2e-9697.41Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSA VFLLFPSDPSLQLVRLKLNGV VRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS
        RNKNFFSLDYNYLGVSVG+RGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK +
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS

XP_023515526.1 uncharacterized protein LOC111779657 [Cucurbita pepo subsp. pepo]6.0e-9596.37Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIP NA APQNVVVLSLYRPPLYR RRLLRLCALYS AFLLLSA VFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS
        RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK +
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS

TrEMBL top hitse value%identityAlignment
A0A0A0LTV4 LEA_2 domain-containing protein4.8e-7480.53Show/hide
Query:  SCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR
        S S D S+PVPY+ IP N AA QNVVVLSLYRPP  R RRLLRLCA YSAAFLLL A  FLLFPSDPSLQLVRLKLN V V L+P V LDLSFS S+RVR
Subjt:  SCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR

Query:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK
        NKNFFSL+YN+LGVSVGYRGRRLG+VSS+GGRVSARGSSYVNATLDLNGL+++HDV +LL DL KGIIPFDTET+VEGSMGLFFIK PIK
Subjt:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK

A0A5A7TX90 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 13.2e-7079.26Show/hide
Query:  SKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNK
        S D S+PVPY+ +  N AA QNVVVLSLYRP   R RRLLRL A YSAAFLLL A  FLLFPSDPSLQLVRLKLN V V L+P V LDLSFS S+RVRNK
Subjt:  SKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNK

Query:  NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK
        NFFSL+YN+LGVSVGYRGRRLG+VSS GGRVSARGSSYVNATLDLNGL+++HDV +LL DL KGIIPFDTETEVEGSMGLFFIK PIK
Subjt:  NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK

A0A6J1CTN0 uncharacterized protein LOC1110144734.7e-7781.58Show/hide
Query:  SCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR
        S S+D S+PVPYS +PPN AA QNVVVLSLYRPP +R+RRLLRLCA YSAAFLLLSA  FLLFP+DPSLQLVRLKLN + VRLLP ++LDLSFSASVRVR
Subjt:  SCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR

Query:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK
        N NFFSLDYNYLGVSVGYRGRRLGFVSS+GGRVSARG SYVNATLDLNG ++IHD  +L+EDL  GI+PFDTETEVEG MGLFFIKFPIK
Subjt:  NKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK

A0A6J1HAC8 uncharacterized protein LOC1114615742.6e-9696.89Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQN+VVLSLYRPPLYRQRRLLRLC LYSAAFLLLSA VFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS
        RN NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK +
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS

A0A6J1JI07 uncharacterized protein LOC1114852801.5e-9697.41Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSA VFLLFPSDPSLQLVRLKLNGV VRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS
        RNKNFFSLDYNYLGVSVG+RGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK +
Subjt:  RNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.2e-2937.43Show/hide
Query:  YSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNKNFFSLDYNY
        Y P+P +++   N  VL    P    +RR +    L S A +L    +++ +PSDP ++++R+K++ V+V   P   +D++   +++V N + +S D+  
Subjt:  YSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNKNFFSLDYNY

Query:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK
        L V++ YRG+ LG VSSDGG V+A GSSY++A  +L+G+ +  DV  L+ DL KG + FDT TE  G +G+ F +FP+K
Subjt:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.2e-3035.75Show/hide
Query:  YSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNKNFFSLDYNY
        Y P+P +++   N  VL    P    +RR +    L S A +L    +++ +PSDP ++++R+K++ V+V   P   +D++   +++V N + +S D+  
Subjt:  YSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNKNFFSLDYNY

Query:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTSNVFLVEYVIRVK
        L V++ YRG+ LG VSSDGG V+A GSSY++A  +L+G+ +  DV  L+ DL KG + FDT TE  G +G+ F +FP+K  N+     +  +K
Subjt:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTSNVFLVEYVIRVK

AT4G13270.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.8e-4447.69Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLY----RPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSA
        M+ SK     +PY+P+ P++   Q+V++L+ Y    RP L R    LR   L++A  LLLSA V+LL+PSDP + + R+ LN ++V     + LDLSFS 
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLY----RPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSA

Query:  SVRVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK
        +++VRN++FFSLDY+ L VS+GYRGR LG V S GG + AR SSY++ATL+L+GL+++HDV +L+ DL KG+IPFDT  +V+G +G+     PI+
Subjt:  SVRVRNKNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTGCTCTAAGGACGGTTCGATCCCTGTTCCTTACTCTCCTATTCCCCCAAATGCTGCTGCACCGCAAAACGTTGTCGTTTTATCTCTCTATCGTCCCCCTCTCTA
CCGGCAGCGGCGGCTTCTTCGCCTCTGTGCCCTCTACTCCGCCGCTTTCCTCCTCCTCTCCGCCTTTGTTTTTCTACTTTTTCCGTCCGATCCCTCGCTCCAACTCGTTC
GATTGAAACTCAATGGGGTGAACGTCCGTTTGTTGCCTGCTGTCGTCCTTGACCTTTCTTTCTCTGCTTCTGTTAGGGTTCGGAATAAGAACTTTTTTTCTCTCGATTAT
AATTACCTTGGCGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATTTGTGAGCTCTGATGGCGGTCGAGTTTCTGCTCGAGGCTCCTCTTACGTGAACGCCACTCTCGA
TTTGAATGGGTTACAGATCATTCACGACGTCTTTTTCTTGCTTGAGGATCTAAGGAAGGGTATAATTCCTTTCGATACGGAGACAGAAGTGGAAGGATCCATGGGGCTTT
TCTTTATCAAATTCCCAATTAAGACCTCGAATGTTTTTCTAGTGGAATATGTCATCCGAGTTAAAAGAATCCCTGTAATCCCTTATCATGTGAGGTATTTGTGGATACAA
ATAGCCAAACAATTGAGCATCAAGATTGCTACCCTGAGTGAACCAAAGATGGGAATTCGGTTTGATTATGACTCTTGTGACGTGAAGCTGAAACTGGGAAGTGGGAACTC
CCCTGATATTGTTGAATCTGAGTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTGCTCTAAGGACGGTTCGATCCCTGTTCCTTACTCTCCTATTCCCCCAAATGCTGCTGCACCGCAAAACGTTGTCGTTTTATCTCTCTATCGTCCCCCTCTCTA
CCGGCAGCGGCGGCTTCTTCGCCTCTGTGCCCTCTACTCCGCCGCTTTCCTCCTCCTCTCCGCCTTTGTTTTTCTACTTTTTCCGTCCGATCCCTCGCTCCAACTCGTTC
GATTGAAACTCAATGGGGTGAACGTCCGTTTGTTGCCTGCTGTCGTCCTTGACCTTTCTTTCTCTGCTTCTGTTAGGGTTCGGAATAAGAACTTTTTTTCTCTCGATTAT
AATTACCTTGGCGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATTTGTGAGCTCTGATGGCGGTCGAGTTTCTGCTCGAGGCTCCTCTTACGTGAACGCCACTCTCGA
TTTGAATGGGTTACAGATCATTCACGACGTCTTTTTCTTGCTTGAGGATCTAAGGAAGGGTATAATTCCTTTCGATACGGAGACAGAAGTGGAAGGATCCATGGGGCTTT
TCTTTATCAAATTCCCAATTAAGACCTCGAATGTTTTTCTAGTGGAATATGTCATCCGAGTTAAAAGAATCCCTGTAATCCCTTATCATGTGAGGTATTTGTGGATACAA
ATAGCCAAACAATTGAGCATCAAGATTGCTACCCTGAGTGAACCAAAGATGGGAATTCGGTTTGATTATGACTCTTGTGACGTGAAGCTGAAACTGGGAAGTGGGAACTC
CCCTGATATTGTTGAATCTGAGTGTTAATATTACTCGCAGAAAGTTAAGCTCCATTGTTCATATGCTAGAGATTTAAAAGAAAAAGTAAGTTCCGAATATCTTTGTGGAT
TATTACTTATAAACTATGCTTTTATAATGTAGTCAGATTGTTACACAAGTTAAAGTTCATTTGTTATGTGTAGTAGTAGGCTGTTGTTGCCATTGCAATAATGAGAGTAA
ATTCTGTGCTTTACTCGTGCTTCTGTGTTCATTCGTGTGAAGAGTTTCTTACCCATTTGGGAATTAAACATTCTGAAATAAAGATAAATCTATGTGGAATCACATCATAT
GGGAAATTAGTCGAAGAAATGAAGTTTTGAAAACATTCAAGCAAATTAAAAGCCACTGCTATTCATTTTCTTCTGCAATTACATTCCGGCGATTCAAACACCAATTACTC
ATCGTTAAACAGAGCGACCTTTTTACTCGCCGCCGGTCCACTTTTCACCATCCGCGGCCTCCCAACCCAAACGTACTCACTTCGGTGCAGCCTGCAACGAAACGACCCAG
GGTGCCTCGTCGGCGAGCACAAGCACCGCTTTTTCACCGACTCGCCACCGCCACTCTCAACCGCCGTCGCCGCCGTCTGCAAGAGGACCCAATTGCATTGAACATTCAAA
AACTCCCTCC
Protein sequenceShow/hide protein sequence
MSCSKDGSIPVPYSPIPPNAAAPQNVVVLSLYRPPLYRQRRLLRLCALYSAAFLLLSAFVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNKNFFSLDY
NYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKTSNVFLVEYVIRVKRIPVIPYHVRYLWIQ
IAKQLSIKIATLSEPKMGIRFDYDSCDVKLKLGSGNSPDIVESEC