; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019229 (gene) of Snake gourd v1 genome

Gene IDTan0019229
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionYae1_N domain-containing protein
Genome locationLG09:73772071..73775698
RNA-Seq ExpressionTan0019229
SyntenyTan0019229
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0004386 - helicase activity (molecular function)
InterPro domainsIPR019191 - Essential protein Yae1, N-terminal
IPR038881 - Yae1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022990784.1 uncharacterized protein LOC111487537 isoform X1 [Cucurbita maxima]3.7e-9382.19Show/hide
Query:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQSTK KL DMS SD RQNRPSGS V+DSC+DDGSLWGGSDEGLEET DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPD LKEKL G E+NRSKFQSLYESVNSLSTA ALRLF+D+I  Q  KEE +DA  +SQ ID+LKQNSDY RLG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE

Query:  LQELLPKSPALNVHHKDQQ
        LQ LLP SPALN+H  ++Q
Subjt:  LQELLPKSPALNVHHKDQQ

XP_022990940.1 uncharacterized protein LOC111487537 isoform X3 [Cucurbita maxima]3.7e-9382.19Show/hide
Query:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQSTK KL DMS SD RQNRPSGS V+DSC+DDGSLWGGSDEGLEET DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPD LKEKL G E+NRSKFQSLYESVNSLSTA ALRLF+D+I  Q  KEE +DA  +SQ ID+LKQNSDY RLG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE

Query:  LQELLPKSPALNVHHKDQQ
        LQ LLP SPALN+H  ++Q
Subjt:  LQELLPKSPALNVHHKDQQ

XP_023514282.1 uncharacterized protein LOC111778596 isoform X1 [Cucurbita pepo subsp. pepo]2.1e-9382.19Show/hide
Query:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQSTK KL DMS SD RQNRPSGS V+DSC+DDGSLWGGSDEGLEET DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPD LKEKL G E+NRSKFQSLYESVNSLSTA ALRLF+++I  Q  KEEC+DA  +SQ ID+LKQNSD  RLGKFYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE

Query:  LQELLPKSPALNVHHKDQQ
        LQ LLP SPALN+H  ++Q
Subjt:  LQELLPKSPALNVHHKDQQ

XP_023514291.1 uncharacterized protein LOC111778596 isoform X2 [Cucurbita pepo subsp. pepo]2.1e-9382.19Show/hide
Query:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQSTK KL DMS SD RQNRPSGS V+DSC+DDGSLWGGSDEGLEET DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPD LKEKL G E+NRSKFQSLYESVNSLSTA ALRLF+++I  Q  KEEC+DA  +SQ ID+LKQNSD  RLGKFYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE

Query:  LQELLPKSPALNVHHKDQQ
        LQ LLP SPALN+H  ++Q
Subjt:  LQELLPKSPALNVHHKDQQ

XP_038891531.1 protein YAE1-like [Benincasa hispida]2.5e-9483.49Show/hide
Query:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYS+SLQS K KLGDMS S+ RQNRPSGSGVDDSC+DDGSLWGGSDEGLE T DLDREWQRRH QFHTIGYRDG+IAGKEAAAQEGFNVGFK
Subjt:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE
        QSVS+GYK GLVRGVSSVLASLPD LKEKL G E+NRSKFQSLYESVNSLST  ALRLFNDEITTQ MKEECI+A  N++ ID+LKQ SDY RLGKFY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE

Query:  LQELLPKSPALNVHHKDQ
        L+ LLPKSPALNVH  +Q
Subjt:  LQELLPKSPALNVHHKDQ

TrEMBL top hitse value%identityAlignment
A0A1S4DX60 uncharacterized protein LOC1034894876.3e-9179.91Show/hide
Query:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQSTK KLGD+S SD +++RPSGSGVDDSC+DDGSLWGGSDEGLEET DLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFN+GFK
Subjt:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE
        QSVSVGYKLGLVRGVSSVLASLPD LKEKL G+ +N+SKFQSLYESVNSLST  ALRLFN +ITTQ  KEE + A  NSQ +D+LK+N DY RLGKFY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE

Query:  LQELLPKSPALNVHHKDQQ
        L   LP+SPALNVH  +++
Subjt:  LQELLPKSPALNVHHKDQQ

A0A6J1H273 uncharacterized protein LOC1114589035.2e-9381.74Show/hide
Query:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQSTK KL D+S SD RQNRPSGS V+DSC+DDGSLWGGSDE LEET DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPD LKEKL G E+ RSKFQSLYESVNSLSTA ALRLF+D+I  Q  KEEC+DA  +SQ ID+LKQNSDY RLG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE

Query:  LQELLPKSPALNVHHKDQQ
        LQ LLPKSPALN+H  ++Q
Subjt:  LQELLPKSPALNVHHKDQQ

A0A6J1JR95 uncharacterized protein LOC111487537 isoform X21.8e-9382.19Show/hide
Query:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQSTK KL DMS SD RQNRPSGS V+DSC+DDGSLWGGSDEGLEET DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPD LKEKL G E+NRSKFQSLYESVNSLSTA ALRLF+D+I  Q  KEE +DA  +SQ ID+LKQNSDY RLG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE

Query:  LQELLPKSPALNVHHKDQQ
        LQ LLP SPALN+H  ++Q
Subjt:  LQELLPKSPALNVHHKDQQ

A0A6J1JRH1 uncharacterized protein LOC111487537 isoform X31.8e-9382.19Show/hide
Query:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQSTK KL DMS SD RQNRPSGS V+DSC+DDGSLWGGSDEGLEET DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPD LKEKL G E+NRSKFQSLYESVNSLSTA ALRLF+D+I  Q  KEE +DA  +SQ ID+LKQNSDY RLG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE

Query:  LQELLPKSPALNVHHKDQQ
        LQ LLP SPALN+H  ++Q
Subjt:  LQELLPKSPALNVHHKDQQ

A0A6J1JSZ3 uncharacterized protein LOC111487537 isoform X11.8e-9382.19Show/hide
Query:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQSTK KL DMS SD RQNRPSGS V+DSC+DDGSLWGGSDEGLEET DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPD LKEKL G E+NRSKFQSLYESVNSLSTA ALRLF+D+I  Q  KEE +DA  +SQ ID+LKQNSDY RLG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGE

Query:  LQELLPKSPALNVHHKDQQ
        LQ LLP SPALN+H  ++Q
Subjt:  LQELLPKSPALNVHHKDQQ

SwissProt top hitse value%identityAlignment
Q9NRH1 Protein YAE1 homolog7.9e-0641.18Show/hide
Query:  DEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLAS
        DE  +E+    REWQ    +    GYRDG+ AGK    Q+GFN G+K+   V    G +RG  S L S
Subjt:  DEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLAS

Arabidopsis top hitse value%identityAlignment
AT1G34550.1 Protein of unknown function (DUF616)3.9e-0840Show/hide
Query:  SVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGELQELLPKSPAL
        +VLA LPD L+EKL   ++ R KFQ L+  V++LST  A++ F   +TT   KE    +GVN+         SD+   G +  EL  LL KSP +
Subjt:  SVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGELQELLPKSPAL

AT1G34570.1 Essential protein Yae1, N-terminal2.3e-3241.15Show/hide
Query:  AEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSV
        A+ELY ESLQ +K     +S+ D         G+++    D   +G SDE   E   LD E ++R  +FH+ GYRDG++ GKEA AQEG+N G+K+SV  
Subjt:  AEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSV

Query:  GYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDR-LGKFYGELQEL
        GYK G+VRGVSS LA LP   +EKL   ++ R KFQ L+ SV++LST  A++ F + +TT+  +E+  + G +S  +       +    LG +  EL  L
Subjt:  GYKLGLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDR-LGKFYGELQEL

Query:  LPKSPALNV
        L KSP + V
Subjt:  LPKSPALNV

AT3G15750.1 Essential protein Yae1, N-terminal2.8e-3043.01Show/hide
Query:  GSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEK
        G  V  S  DD       +E   E   L  E ++R  +FH+ GYRDG++AGKEA AQEG+N G+K+SV  GY+ GLVRGVSS LA LPD L+EKL   ++
Subjt:  GSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASLPDHLKEKLTGSEK

Query:  NRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEE--------CIDAGVNSQPIDMLKQNSDYDRLGKFYGELQELLPKSPALNV
         R KFQ L+ SV++LST  A++ F + +TT+  +E+        C D+G  S     +   +D   LG +  EL  LL KSP + V
Subjt:  NRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEE--------CIDAGVNSQPIDMLKQNSDYDRLGKFYGELQELLPKSPALNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAGGGTACCATGGCCGAAGAACTTTATTCCGAAAGCTTACAGTCAACAAAACCAAAATTAGGTGATATGTCATCTTCTGATTGCAGACAAAATCGGCCATCAGG
TTCTGGTGTGGATGACTCGTGCCAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTTGGAGGAAACATGTGATTTGGACAGGGAGTGGCAGAGGAGACATGACC
AATTTCACACAATTGGATACCGCGATGGTTTAATCGCTGGGAAAGAAGCTGCAGCTCAAGAGGGATTTAATGTTGGCTTCAAGCAGTCAGTCTCCGTCGGGTATAAGTTG
GGTCTTGTCAGAGGTGTTAGCAGTGTGCTTGCTTCCCTTCCTGATCACTTGAAAGAGAAACTAACAGGAAGTGAAAAGAACAGAAGTAAATTCCAAAGCTTGTATGAATC
TGTGAACTCTCTTTCGACAGCAGGAGCGCTTAGGCTATTCAATGACGAGATTACGACACAAGGAATGAAAGAGGAGTGTATCGATGCCGGTGTAAATTCCCAACCAATAG
ACATGTTGAAGCAAAATTCAGATTATGATCGTCTAGGAAAGTTCTATGGAGAGCTTCAAGAACTTTTACCCAAATCACCTGCTCTAAATGTTCATCACAAAGATCAGCAG
AAATCTGCTGTAACTTAG
mRNA sequenceShow/hide mRNA sequence
AAATTTGTAATTTAACGAAATAAATTTGCTAGAAGCCTAGAACTCGATTGCCAGTTCTCTCTCTACACGGTGTCCTCTTCTCAGTTCTCACCAATCTCAATCTCTGCTTT
ATCCCACTCGACGGTCGTCCCATCTCTCAGTGATCGTCATCGTCACGCACGGCTGACCTAGTTTTCGCCTGGCAGGTTCAGTCCTCACGCCCCTTCTTTCTCGATATTCT
CAGTTTGGTTGCTTTATATTATTGTCTCTCCGCGTACGTTGAATTGAATTCACCCAAGGATCGAATGGAACCTGTGTAAAAAGTTTGATAATGCATACTCACTTGATCCT
TAGATACTCCGTGAACATGAATTGTCGATATGATATGATATGATGGAGGGTACCATGGCCGAAGAACTTTATTCCGAAAGCTTACAGTCAACAAAACCAAAATTAGGTGA
TATGTCATCTTCTGATTGCAGACAAAATCGGCCATCAGGTTCTGGTGTGGATGACTCGTGCCAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTTGGAGGAAA
CATGTGATTTGGACAGGGAGTGGCAGAGGAGACATGACCAATTTCACACAATTGGATACCGCGATGGTTTAATCGCTGGGAAAGAAGCTGCAGCTCAAGAGGGATTTAAT
GTTGGCTTCAAGCAGTCAGTCTCCGTCGGGTATAAGTTGGGTCTTGTCAGAGGTGTTAGCAGTGTGCTTGCTTCCCTTCCTGATCACTTGAAAGAGAAACTAACAGGAAG
TGAAAAGAACAGAAGTAAATTCCAAAGCTTGTATGAATCTGTGAACTCTCTTTCGACAGCAGGAGCGCTTAGGCTATTCAATGACGAGATTACGACACAAGGAATGAAAG
AGGAGTGTATCGATGCCGGTGTAAATTCCCAACCAATAGACATGTTGAAGCAAAATTCAGATTATGATCGTCTAGGAAAGTTCTATGGAGAGCTTCAAGAACTTTTACCC
AAATCACCTGCTCTAAATGTTCATCACAAAGATCAGCAGAAATCTGCTGTAACTTAGCATTGATATTTTTGCTTAGGAGAAAGAGGTTTTGTTAGTAATTTTATCCACGC
AACTTTGGTAGATTTAGCATTTATGTGGAAATTTTAAATATTGACCTTGTCTGATATTTGTTAAAATCGACATGTTATCCTTTCAACTTGTGATGGATTTCATTGTGGTT
GGAGGGATAATATATATACATATACATACATATATATGAGGAGCCAATCTTTCGTTGGGAAACATGAAGGATTTACAATGGCATAAAAAAC
Protein sequenceShow/hide protein sequence
MMEGTMAEELYSESLQSTKPKLGDMSSSDCRQNRPSGSGVDDSCQDDGSLWGGSDEGLEETCDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKL
GLVRGVSSVLASLPDHLKEKLTGSEKNRSKFQSLYESVNSLSTAGALRLFNDEITTQGMKEECIDAGVNSQPIDMLKQNSDYDRLGKFYGELQELLPKSPALNVHHKDQQ
KSAVT