; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg01974 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg01974
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionYae1_N domain-containing protein
Genome locationCarg_Chr04:8272769..8275813
RNA-Seq ExpressionCarg01974
SyntenyCarg01974
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0004386 - helicase activity (molecular function)
InterPro domainsIPR019191 - Essential protein Yae1, N-terminal
IPR038881 - Yae1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601147.1 Crossover junction endonuclease MUS81, partial [Cucurbita argyrosperma subsp. sororia]3.0e-116100Show/hide
Query:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
        MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
Subjt:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK

Query:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
        QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
Subjt:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE

Query:  LQALLPTSPALNLHLHEEQ
        LQALLPTSPALNLHLHEEQ
Subjt:  LQALLPTSPALNLHLHEEQ

KAG7031946.1 hypothetical protein SDJN02_05988 [Cucurbita argyrosperma subsp. argyrosperma]3.0e-116100Show/hide
Query:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
        MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
Subjt:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK

Query:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
        QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
Subjt:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE

Query:  LQALLPTSPALNLHLHEEQ
        LQALLPTSPALNLHLHEEQ
Subjt:  LQALLPTSPALNLHLHEEQ

XP_022990784.1 uncharacterized protein LOC111487537 isoform X1 [Cucurbita maxima]6.5e-11197.26Show/hide
Query:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
        MEGTIAEELYSE LQSTKSKLAD+SLSDYRQNR SGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
Subjt:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK

Query:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
        QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQ TKEE VDANTSSQTIDLLKQNSD SRLGQFYGE
Subjt:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE

Query:  LQALLPTSPALNLHLHEEQ
        LQALLPTSPALNLHLHEEQ
Subjt:  LQALLPTSPALNLHLHEEQ

XP_023514282.1 uncharacterized protein LOC111778596 isoform X1 [Cucurbita pepo subsp. pepo]3.4e-11297.26Show/hide
Query:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
        MEGTIAEELYSE LQSTKSKLAD+SLSDYRQNR SGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
Subjt:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK

Query:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
        QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFH+DIMAQ TKEECVDANTSSQTIDLLKQNSDNSRLG+FYGE
Subjt:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE

Query:  LQALLPTSPALNLHLHEEQ
        LQALLPTSPALNLHLHEEQ
Subjt:  LQALLPTSPALNLHLHEEQ

XP_023514291.1 uncharacterized protein LOC111778596 isoform X2 [Cucurbita pepo subsp. pepo]3.4e-11297.26Show/hide
Query:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
        MEGTIAEELYSE LQSTKSKLAD+SLSDYRQNR SGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
Subjt:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK

Query:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
        QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFH+DIMAQ TKEECVDANTSSQTIDLLKQNSDNSRLG+FYGE
Subjt:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE

Query:  LQALLPTSPALNLHLHEEQ
        LQALLPTSPALNLHLHEEQ
Subjt:  LQALLPTSPALNLHLHEEQ

TrEMBL top hitse value%identityAlignment
A0A1S4DX60 uncharacterized protein LOC1034894871.0e-9382.19Show/hide
Query:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
        MEGTIAEELYSE LQSTKSKL DISLSDY+++R SGS V+DSC+DDGSLWGGSDEGLEET+DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFN+GFK
Subjt:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK

Query:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL G  EN+SKFQSLYESVNSLST DALRLF+ DI  Q TKEE V ANT+SQT+DLLK+N D  RLG+FY E
Subjt:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE

Query:  LQALLPTSPALNLHLHEEQ
        L A LP SPALN+HLHEE+
Subjt:  LQALLPTSPALNLHLHEEQ

A0A6J1H273 uncharacterized protein LOC1114589037.7e-11096.35Show/hide
Query:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
        MEGTIAEELYSE LQSTKSKLADISLSDYRQNR SGSAVNDSCEDDGSLWGGSDE LEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
Subjt:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK

Query:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
        QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEE RSKFQSLYESVNSLSTADALRLFHDDIMAQ TKEECVDANTSSQTIDLLKQNSD  RLGQFYGE
Subjt:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE

Query:  LQALLPTSPALNLHLHEEQ
        LQALLP SPALNLHLHEEQ
Subjt:  LQALLPTSPALNLHLHEEQ

A0A6J1JR95 uncharacterized protein LOC111487537 isoform X23.1e-11197.26Show/hide
Query:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
        MEGTIAEELYSE LQSTKSKLAD+SLSDYRQNR SGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
Subjt:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK

Query:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
        QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQ TKEE VDANTSSQTIDLLKQNSD SRLGQFYGE
Subjt:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE

Query:  LQALLPTSPALNLHLHEEQ
        LQALLPTSPALNLHLHEEQ
Subjt:  LQALLPTSPALNLHLHEEQ

A0A6J1JRH1 uncharacterized protein LOC111487537 isoform X33.1e-11197.26Show/hide
Query:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
        MEGTIAEELYSE LQSTKSKLAD+SLSDYRQNR SGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
Subjt:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK

Query:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
        QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQ TKEE VDANTSSQTIDLLKQNSD SRLGQFYGE
Subjt:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE

Query:  LQALLPTSPALNLHLHEEQ
        LQALLPTSPALNLHLHEEQ
Subjt:  LQALLPTSPALNLHLHEEQ

A0A6J1JSZ3 uncharacterized protein LOC111487537 isoform X13.1e-11197.26Show/hide
Query:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
        MEGTIAEELYSE LQSTKSKLAD+SLSDYRQNR SGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK
Subjt:  MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFK

Query:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE
        QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQ TKEE VDANTSSQTIDLLKQNSD SRLGQFYGE
Subjt:  QSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGE

Query:  LQALLPTSPALNLHLHEEQ
        LQALLPTSPALNLHLHEEQ
Subjt:  LQALLPTSPALNLHLHEEQ

SwissProt top hitse value%identityAlignment
Q9NRH1 Protein YAE1 homolog8.5e-0538.24Show/hide
Query:  DEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFKQSVSIGYKLGLVRGVSSVLAS
        DE  +E+    REW     +    GYRDG+ AGK    Q+GFN G+K+   +    G +RG  S L S
Subjt:  DEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFKQSVSIGYKLGLVRGVSSVLAS

Arabidopsis top hitse value%identityAlignment
AT1G34550.1 Protein of unknown function (DUF616)2.1e-0634.74Show/hide
Query:  SVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGELQALLPTSPAL
        +VLA LPD+L+EKL+  +E R KFQ L+  V++LST  A++ F+  +    T +E +  +  + T D           G +  EL +LL  SP +
Subjt:  SVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGELQALLPTSPAL

AT1G34570.1 Essential protein Yae1, N-terminal9.2e-3137.8Show/hide
Query:  AEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFKQSVSI
        A+ELY E LQ +K    D  L +                 D   +G SDE   E   LD E  +R  +FH+ GYRDG++ GKEA +QEG+N G+K+SV  
Subjt:  AEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFKQSVSI

Query:  GYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTID-LLKQNSDNSRLGQFYGELQAL
        GYK G+VRGVSS LA LP + +EKL+  +E R KFQ L+ SV++LST  A++ F++ +  +  +E+  +    S ++       +  + LG +  EL +L
Subjt:  GYKLGLVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTID-LLKQNSDNSRLGQFYGELQAL

Query:  LPTSPALNL
        L  SP + +
Subjt:  LPTSPALNL

AT3G15750.1 Essential protein Yae1, N-terminal6.6e-2939.78Show/hide
Query:  GSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFKQSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEE
        G +V  S  DD       +E   E   L  E  +R  +FH+ GYRDG++AGKEA +QEG+N G+K+SV  GY+ GLVRGVSS LA LPD+L+EKL+  +E
Subjt:  GSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFKQSVSIGYKLGLVRGVSSVLASLPDDLKEKLVGIEE

Query:  NRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEE--------CVDANTSSQTIDLLKQNSDNSRLGQFYGELQALLPTSPALNL
         R KFQ L+ SV++LST  A++ F++ +  +  +E+        C D+ + S         +  + LG +  EL +LL  SP + +
Subjt:  NRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEE--------CVDANTSSQTIDLLKQNSDNSRLGQFYGELQALLPTSPALNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGTACCATTGCTGAAGAACTTTATTCCGAGGGTTTGCAGTCAACAAAATCAAAATTGGCTGACATATCATTATCTGATTACAGACAAAATCGGTCATCAGGTTC
TGCTGTGAATGACTCGTGTGAAGATGATGGATCTTTATGGGGTGGTTCGGATGAAGGCTTGGAGGAAACATCTGATCTGGATAGGGAGTGGCATAGAAGACACGACCAAT
TTCATACGATTGGGTACCGTGATGGTTTAATTGCTGGGAAAGAAGCTGCATCTCAAGAGGGATTTAATGTTGGCTTCAAGCAGTCAGTCTCTATCGGATATAAGTTGGGT
CTTGTCAGAGGTGTTAGCAGTGTGCTTGCTTCCCTTCCTGATGACTTGAAAGAGAAGCTAGTAGGGATTGAAGAAAACAGAAGTAAATTCCAAAGCTTGTATGAATCTGT
GAACTCTCTTTCGACAGCAGATGCACTTAGGCTATTCCACGACGATATTATGGCACAATGCACGAAAGAAGAATGCGTCGATGCAAATACGAGTTCGCAAACAATAGATT
TGTTGAAGCAAAATTCAGATAACAGTCGCCTAGGACAGTTCTATGGAGAGCTTCAAGCACTGTTGCCCACATCACCTGCTCTGAATCTTCATTTACATGAAGAACAGTAG
mRNA sequenceShow/hide mRNA sequence
TGTGTACATTGAATTCAATCTACACAATGGGAACCTGTATATAAAGATTGATAATGTATGCCCGTTTGATACTAAGATACTCAGTGAACATGAATTGTTGGTGTTTAGAC
GTTAATAGAAATGCTATCCTACGGCTGGTTATTTTGATTTAATAGCTGATAGTATCTTTAGTATTTATGACGTGGTTGGAGAACCCATTTTGAGTTATTTCCCTTAGAAA
TTAGTAGACGTGAATTAAATTGGAATTTTGCATGTTGAGTACCGACCTTAGGCTGGAAACTGAATAATAGGAATGATGCCCAATGCTCTAGGTGTTCTTGCTTGGTTGTA
ATTCATCAATTTCACAGCTGGTGATCCGTTCTGACACAGAATCTGAGTCTGGGTTTTAGGAACTGATTGAGAAGAAGTTTTGTTTGTGATGGATTGATGATATAAATGGT
TTATGATTAGCTCAGGTTTTGCGGATTTAGCGAGATAAGTGAATCTCGCGATGTTCATAATTTTGATGGGTATCCCACCTCTCTTCCTTTCCTTCTCGACGTACAATTGA
TCTTTCTGCATTTCGGTTATTGGGGTGTTTAAGCTTTCTTTGAGCAAAATTGGGGCTTTAATTTGTTGTCTAGTTCTAGTTTTATGAAAAGATAATGGCATGGGAAATTG
TTGATGGGCGAAGATGTTCTGCTGACACAGATGTTGCCAATCGTTTAGTTCTAATATAACCATTGTTGTATGAAATGCTTATCTGTTTTTGTGGAATTGAGTGTATAGGC
CCAATGCTTCTACTAATTGGTTTTGTCGAGAACCCTTCGTTTATTTTTTTTTTTTCCCGCTATGTTGCGATAATTTAAGTAAGTCATTGTACCTTGAAATCAAGTTCACT
ATAAGGTTTTCTTTTATTCGTTGCTGAAGATTGCAATCGCAATCGACCTAATTCTGTTAGATGGAGGGTACCATTGCTGAAGAACTTTATTCCGAGGGTTTGCAGTCAAC
AAAATCAAAATTGGCTGACATATCATTATCTGATTACAGACAAAATCGGTCATCAGGTTCTGCTGTGAATGACTCGTGTGAAGATGATGGATCTTTATGGGGTGGTTCGG
ATGAAGGCTTGGAGGAAACATCTGATCTGGATAGGGAGTGGCATAGAAGACACGACCAATTTCATACGATTGGGTACCGTGATGGTTTAATTGCTGGGAAAGAAGCTGCA
TCTCAAGAGGGATTTAATGTTGGCTTCAAGCAGTCAGTCTCTATCGGATATAAGTTGGGTCTTGTCAGAGGTGTTAGCAGTGTGCTTGCTTCCCTTCCTGATGACTTGAA
AGAGAAGCTAGTAGGGATTGAAGAAAACAGAAGTAAATTCCAAAGCTTGTATGAATCTGTGAACTCTCTTTCGACAGCAGATGCACTTAGGCTATTCCACGACGATATTA
TGGCACAATGCACGAAAGAAGAATGCGTCGATGCAAATACGAGTTCGCAAACAATAGATTTGTTGAAGCAAAATTCAGATAACAGTCGCCTAGGACAGTTCTATGGAGAG
CTTCAAGCACTGTTGCCCACATCACCTGCTCTGAATCTTCATTTACATGAAGAACAGTAGATTCAATGCAACTTTTGTAGGTTCAGCATTTGTTAGGTTGAAATTTTGGG
TATTGATCTCTTCTGTCAGTTATGTATAATATTTACATTTGGGAATTTCCATATTCTCCTTTAATCCCGAACCAACCAACTCAAAGGAAAGTTTCGAAATCGATATGCTC
TAATTTTCTATT
Protein sequenceShow/hide protein sequence
MEGTIAEELYSEGLQSTKSKLADISLSDYRQNRSSGSAVNDSCEDDGSLWGGSDEGLEETSDLDREWHRRHDQFHTIGYRDGLIAGKEAASQEGFNVGFKQSVSIGYKLG
LVRGVSSVLASLPDDLKEKLVGIEENRSKFQSLYESVNSLSTADALRLFHDDIMAQCTKEECVDANTSSQTIDLLKQNSDNSRLGQFYGELQALLPTSPALNLHLHEEQ