; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G006450 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G006450
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionLEA_2 domain-containing protein
Genome locationCmo_Chr10:2965232..2967320
RNA-Seq ExpressionCmoCh10G006450
SyntenyCmoCh10G006450
Gene Ontology termsGO:0009269 - response to desiccation (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589903.1 hypothetical protein SDJN03_15326, partial [Cucurbita argyrosperma subsp. sororia]3.2e-11198.14Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQN+VVLSLYRPPLYRQRRLLRLC LYSAAFLLLSA VFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV
        RN NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

KAG7023573.1 hypothetical protein SDJN02_14599, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-10680.38Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQN+VVLSLYRPPLYRQRRLLRLC LYSAAFLLLSA VFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKAT-------
        RN NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK +       
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKAT-------

Query:  ------------------VSCEVFVDTNSQTIEHQDCYPEFDYDSCDVKLKLGSGNSPDIVESEC
                          +     +     T+        FDYDSCDVKLKLGSGNSPDIVESEC
Subjt:  ------------------VSCEVFVDTNSQTIEHQDCYPEFDYDSCDVKLKLGSGNSPDIVESEC

XP_022960913.1 uncharacterized protein LOC111461574 [Cucurbita moschata]2.6e-113100Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV
        RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

XP_022987870.1 uncharacterized protein LOC111485280 [Cucurbita maxima]1.2e-11097.67Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQN+VVLSLYRPPLYRQRRLLRLC LYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGV VRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV
        RN NFFSLDYNYLGVSVG+RGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

XP_023515526.1 uncharacterized protein LOC111779657 [Cucurbita pepo subsp. pepo]1.1e-10896.28Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIP NA APQN+VVLSLYRPPLYR RRLLRLC LYS AFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV
        RN NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEV V
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

TrEMBL top hitse value%identityAlignment
A0A0A0LTV4 LEA_2 domain-containing protein2.2e-8679.91Show/hide
Query:  SCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR
        S S D S+PVPY+ IP N AA QN+VVLSLYRPP  R RRLLRLC  YSAAFLLL AV FLLFPSDPSLQLVRLKLN V V L+P V LDLSFS S+RVR
Subjt:  SCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR

Query:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFVD
        N NFFSL+YN+LGVSVGYRGRRLG+VSS+GGRVSARGSSYVNATLDLNGL+++HDV +LL DL KGIIPFDTET+VEGSMGLFFIK PIKA VSCEV V+
Subjt:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFVD

Query:  TNSQTIEHQDCYPE
        TN+QTIEHQDCYPE
Subjt:  TNSQTIEHQDCYPE

A0A1S3CJK6 uncharacterized protein LOC1035015511.5e-8278.77Show/hide
Query:  SKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNN
        S D S+PVPY+ +  N AA QN+VVLSLYRP   R RRLLRL   YSAAFLLL AV FLLFPSDPSLQLVRLKLN V V L+P V LDLSFS S+RVRN 
Subjt:  SKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNN

Query:  NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFVDTN
        NFFSL+YN+LGVSVGYRGRRLG+VSS GGRVSARGSSYVNATLDLNGL+++HDV +LL DL KGIIPFDTETEVEGSMGLFFIK PIKA VSCEV V+TN
Subjt:  NFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFVDTN

Query:  SQTIEHQDCYPE
        +QTIEHQDCYPE
Subjt:  SQTIEHQDCYPE

A0A6J1CTN0 uncharacterized protein LOC1110144738.8e-9181.78Show/hide
Query:  SCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR
        S S+D S+PVPYS +PPN AA QN+VVLSLYRPP +R+RRLLRLC  YSAAFLLLSAV FLLFP+DPSLQLVRLKLN + VRLLP ++LDLSFSASVRVR
Subjt:  SCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVR

Query:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFVD
        NNNFFSLDYNYLGVSVGYRGRRLGFVSS+GGRVSARG SYVNATLDLNG ++IHD  +L+EDL  GI+PFDTETEVEG MGLFFIKFPIKA VSCEVFV+
Subjt:  NNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFVD

Query:  TNSQTIEHQDCYPE
        TN +TIEHQDCYPE
Subjt:  TNSQTIEHQDCYPE

A0A6J1HAC8 uncharacterized protein LOC1114615741.3e-113100Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV
        RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

A0A6J1JI07 uncharacterized protein LOC1114852805.9e-11197.67Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV
        MSCSKDGSIPVPYSPIPPNAAAPQN+VVLSLYRPPLYRQRRLLRLC LYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGV VRLLPAVVLDLSFSASVRV
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRV

Query:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV
        RN NFFSLDYNYLGVSVG+RGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV
Subjt:  RNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFV

Query:  DTNSQTIEHQDCYPE
        DTNSQTIEHQDCYPE
Subjt:  DTNSQTIEHQDCYPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.9e-3739.11Show/hide
Query:  YSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNNNFFSLDYNY
        Y P+P +++   N  VL    P    +RR +    L S A    S ++++ +PSDP ++++R+K++ V+V   P   +D++   +++V N + +S D+  
Subjt:  YSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNNNFFSLDYNY

Query:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFVDTNSQTIEHQDC
        L V++ YRG+ LG VSSDGG V+A GSSY++A  +L+G+ +  DV  L+ DL KG + FDT TE  G +G+ F +FP+KA V+C + VDT +QTI  Q C
Subjt:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFVDTNSQTIEHQDC

Query:  YP
         P
Subjt:  YP

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.0e-3037.43Show/hide
Query:  YSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNNNFFSLDYNY
        Y P+P +++   N  VL    P    +RR +    L S A    S ++++ +PSDP ++++R+K++ V+V   P   +D++   +++V N + +S D+  
Subjt:  YSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNNNFFSLDYNY

Query:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK
        L V++ YRG+ LG VSSDGG V+A GSSY++A  +L+G+ +  DV  L+ DL KG + FDT TE  G +G+ F +FP+K
Subjt:  LGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIK

AT4G13270.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.2e-5348.39Show/hide
Query:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLY----RPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSA
        M+ SK     +PY+P+ P++   Q++++L+ Y    RP L R    LR  +L++A  LLLSA V+LL+PSDP + + R+ LN ++V     + LDLSFS 
Subjt:  MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLY----RPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSA

Query:  SVRVRNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSC
        +++VRN +FFSLDY+ L VS+GYRGR LG V S GG + AR SSY++ATL+L+GL+++HDV +L+ DL KG+IPFDT  +V+G +G+     PI+  VSC
Subjt:  SVRVRNNNFFSLDYNYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSC

Query:  EVFVDTNSQTIEHQDCY
        EV+V+ N+Q I HQDC+
Subjt:  EVFVDTNSQTIEHQDCY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTGCTCTAAGGACGGTTCGATCCCTGTTCCTTACTCTCCTATTCCCCCAAATGCTGCTGCACCGCAAAACCTTGTCGTTTTATCTCTCTATCGTCCCCCTCTCTA
CCGGCAGCGGCGGCTTCTTCGCCTCTGTGTCCTCTACTCCGCCGCTTTCCTCCTCCTCTCCGCCGTTGTTTTTCTACTTTTTCCGTCCGATCCCTCGCTCCAACTCGTTC
GATTGAAACTCAATGGGGTGAACGTCCGTTTGTTGCCTGCTGTCGTCCTTGACCTTTCTTTCTCTGCTTCTGTTAGGGTTCGGAATAACAACTTTTTTTCTCTCGATTAC
AATTACCTTGGCGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATTTGTGAGCTCTGATGGCGGTCGTGTCTCTGCTCGAGGCTCCTCTTACGTGAACGCCACTCTCGA
TTTGAATGGGTTACAGATCATTCACGACGTCTTTTTCTTGCTCGAGGATCTGAGGAAGGGTATAATTCCTTTCGATACGGAGACAGAAGTGGAAGGATCCATGGGGCTTT
TCTTTATCAAATTCCCAATTAAGGCTACAGTATCATGTGAGGTATTTGTGGATACAAATAGCCAAACAATTGAGCATCAAGATTGCTACCCTGAGTTTGATTATGACTCT
TGTGACGTGAAGCTGAAACTGGGAAGTGGGAACTCCCCTGATATTGTTGAATCTGAGTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATATTGTAATTTATATTTTAGTTTTCAAAACTTTAGAAATTATAACAAATTTCAAAGCAATTATTGTTTGGAAACAAGTAAGTAATTTGAAAGTTGTGGCATAGGCTTTA
TTGGGACGTGTAAATTCTTGATAGGCTGAGGCGGGTAGCTTAGGGAGGGGCGAGAGTTTTTGCCACGTTACTCATAAAGATCAGTTCCCATTTCAATTCTCTCTAACAAA
GCATGAGCTGCTCTAAGGACGGTTCGATCCCTGTTCCTTACTCTCCTATTCCCCCAAATGCTGCTGCACCGCAAAACCTTGTCGTTTTATCTCTCTATCGTCCCCCTCTC
TACCGGCAGCGGCGGCTTCTTCGCCTCTGTGTCCTCTACTCCGCCGCTTTCCTCCTCCTCTCCGCCGTTGTTTTTCTACTTTTTCCGTCCGATCCCTCGCTCCAACTCGT
TCGATTGAAACTCAATGGGGTGAACGTCCGTTTGTTGCCTGCTGTCGTCCTTGACCTTTCTTTCTCTGCTTCTGTTAGGGTTCGGAATAACAACTTTTTTTCTCTCGATT
ACAATTACCTTGGCGTTTCGGTCGGCTACCGGGGAAGACGACTTGGATTTGTGAGCTCTGATGGCGGTCGTGTCTCTGCTCGAGGCTCCTCTTACGTGAACGCCACTCTC
GATTTGAATGGGTTACAGATCATTCACGACGTCTTTTTCTTGCTCGAGGATCTGAGGAAGGGTATAATTCCTTTCGATACGGAGACAGAAGTGGAAGGATCCATGGGGCT
TTTCTTTATCAAATTCCCAATTAAGGCTACAGTATCATGTGAGGTATTTGTGGATACAAATAGCCAAACAATTGAGCATCAAGATTGCTACCCTGAGTTTGATTATGACT
CTTGTGACGTGAAGCTGAAACTGGGAAGTGGGAACTCCCCTGATATTGTTGAATCTGAGTGTTAA
Protein sequenceShow/hide protein sequence
MSCSKDGSIPVPYSPIPPNAAAPQNLVVLSLYRPPLYRQRRLLRLCVLYSAAFLLLSAVVFLLFPSDPSLQLVRLKLNGVNVRLLPAVVLDLSFSASVRVRNNNFFSLDY
NYLGVSVGYRGRRLGFVSSDGGRVSARGSSYVNATLDLNGLQIIHDVFFLLEDLRKGIIPFDTETEVEGSMGLFFIKFPIKATVSCEVFVDTNSQTIEHQDCYPEFDYDS
CDVKLKLGSGNSPDIVESEC