; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004252 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004252
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionYae1_N domain-containing protein
Genome locationscaffold92:1189530..1192188
RNA-Seq ExpressionMS004252
SyntenyMS004252
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0004386 - helicase activity (molecular function)
InterPro domainsIPR019191 - Essential protein Yae1, N-terminal
IPR038881 - Yae1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601147.1 Crossover junction endonuclease MUS81, partial [Cucurbita argyrosperma subsp. sororia]7.4e-9180.82Show/hide
Query:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSE LQST SKL     SD+ QNRSSGS V+D C+DDGSLWGGSDEG EE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE
        QSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EEN SKFQSLYE  NSLSTADALRLF+D+I+AQ T EECVDA+T+S+TI LLKQN D  RLG+FYGE
Subjt:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE

Query:  LQALLPKSPALKVHLHEDQ
        LQALLP SPAL +HLHE+Q
Subjt:  LQALLPKSPALKVHLHEDQ

KAG7031946.1 hypothetical protein SDJN02_05988 [Cucurbita argyrosperma subsp. argyrosperma]7.4e-9180.82Show/hide
Query:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSE LQST SKL     SD+ QNRSSGS V+D C+DDGSLWGGSDEG EE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE
        QSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EEN SKFQSLYE  NSLSTADALRLF+D+I+AQ T EECVDA+T+S+TI LLKQN D  RLG+FYGE
Subjt:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE

Query:  LQALLPKSPALKVHLHEDQ
        LQALLP SPAL +HLHE+Q
Subjt:  LQALLPKSPALKVHLHEDQ

XP_022138890.1 uncharacterized protein LOC111009960 [Momordica charantia]4.8e-11497.26Show/hide
Query:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEG EEISDLDREWQRRHDQFHTIGYRDGLI+GKEAAAQEGFNVGFK
Subjt:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE
        QSVFIGYKLGLVRGVSSVLA LPDDLKEKLMGNEENSSKFQSLYECTNS+STADALRLFNDEILAQ TTEECVDADTNSRTIGLLKQNPD GRLGKFYGE
Subjt:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE

Query:  LQALLPKSPALKVHLHEDQ
        LQALLPKSPALKVHLHEDQ
Subjt:  LQALLPKSPALKVHLHEDQ

XP_022957528.1 uncharacterized protein LOC111458903 [Cucurbita moschata]2.6e-9181.28Show/hide
Query:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQST SKL     SD+ QNR SGS V+D C+DDGSLWGGSDE  EE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE
        QSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EE  SKFQSLYE  NSLSTADALRLF+D+I+AQ T EECVDA+T+S+TI LLKQN DYGRLG+FYGE
Subjt:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE

Query:  LQALLPKSPALKVHLHEDQ
        LQALLPKSPAL +HLHE+Q
Subjt:  LQALLPKSPALKVHLHEDQ

XP_023514291.1 uncharacterized protein LOC111778596 isoform X2 [Cucurbita pepo subsp. pepo]9.7e-9180.82Show/hide
Query:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQST SKL     SD+ QNR SGS V+D C+DDGSLWGGSDEG EE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE
        QSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EEN SKFQSLYE  NSLSTADALRLF+++I+AQ T EECVDA+T+S+TI LLKQN D  RLGKFYGE
Subjt:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE

Query:  LQALLPKSPALKVHLHEDQ
        LQALLP SPAL +HLHE+Q
Subjt:  LQALLPKSPALKVHLHEDQ

TrEMBL top hitse value%identityAlignment
A0A6J1CAS2 uncharacterized protein LOC1110099602.3e-11497.26Show/hide
Query:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEG EEISDLDREWQRRHDQFHTIGYRDGLI+GKEAAAQEGFNVGFK
Subjt:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE
        QSVFIGYKLGLVRGVSSVLA LPDDLKEKLMGNEENSSKFQSLYECTNS+STADALRLFNDEILAQ TTEECVDADTNSRTIGLLKQNPD GRLGKFYGE
Subjt:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE

Query:  LQALLPKSPALKVHLHEDQ
        LQALLPKSPALKVHLHEDQ
Subjt:  LQALLPKSPALKVHLHEDQ

A0A6J1H273 uncharacterized protein LOC1114589031.2e-9181.28Show/hide
Query:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQST SKL     SD+ QNR SGS V+D C+DDGSLWGGSDE  EE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE
        QSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EE  SKFQSLYE  NSLSTADALRLF+D+I+AQ T EECVDA+T+S+TI LLKQN DYGRLG+FYGE
Subjt:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE

Query:  LQALLPKSPALKVHLHEDQ
        LQALLPKSPAL +HLHE+Q
Subjt:  LQALLPKSPALKVHLHEDQ

A0A6J1JR95 uncharacterized protein LOC111487537 isoform X28.0e-9180.82Show/hide
Query:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQST SKL     SD+ QNR SGS V+D C+DDGSLWGGSDEG EE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE
        QSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EEN SKFQSLYE  NSLSTADALRLF+D+I+AQ T EE VDA+T+S+TI LLKQN DY RLG+FYGE
Subjt:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE

Query:  LQALLPKSPALKVHLHEDQ
        LQALLP SPAL +HLHE+Q
Subjt:  LQALLPKSPALKVHLHEDQ

A0A6J1JRH1 uncharacterized protein LOC111487537 isoform X38.0e-9180.82Show/hide
Query:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQST SKL     SD+ QNR SGS V+D C+DDGSLWGGSDEG EE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE
        QSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EEN SKFQSLYE  NSLSTADALRLF+D+I+AQ T EE VDA+T+S+TI LLKQN DY RLG+FYGE
Subjt:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE

Query:  LQALLPKSPALKVHLHEDQ
        LQALLP SPAL +HLHE+Q
Subjt:  LQALLPKSPALKVHLHEDQ

A0A6J1JSZ3 uncharacterized protein LOC111487537 isoform X18.0e-9180.82Show/hide
Query:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEGT+AEELYSESLQST SKL     SD+ QNR SGS V+D C+DDGSLWGGSDEG EE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE
        QSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EEN SKFQSLYE  NSLSTADALRLF+D+I+AQ T EE VDA+T+S+TI LLKQN DY RLG+FYGE
Subjt:  QSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGE

Query:  LQALLPKSPALKVHLHEDQ
        LQALLP SPAL +HLHE+Q
Subjt:  LQALLPKSPALKVHLHEDQ

SwissProt top hitse value%identityAlignment
Q9NRH1 Protein YAE1 homolog8.5e-0537.18Show/hide
Query:  LCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVL
        L Q  G      DE ++E     REWQ    +    GYRDG+ AGK    Q+GFN G+K+   +    G +RG  S L
Subjt:  LCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVL

Arabidopsis top hitse value%identityAlignment
AT1G34550.1 Protein of unknown function (DUF616)6.0e-0633.33Show/hide
Query:  SVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGELQALLPKSPALKVHLH
        +VLA LPD+L+EKL+  +E   KFQ L+   ++LST  A++ F   +    TT+E +     + T             G +  EL +LL KSP ++  L 
Subjt:  SVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGELQALLPKSPALKVHLH

Query:  ED
        +D
Subjt:  ED

AT1G34570.1 Essential protein Yae1, N-terminal2.1e-3039.71Show/hide
Query:  AEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFI
        A+ELY ESLQ   SKL            +   G+++L   D   +G SDE   E   LD E ++R  +FH+ GYRDG++ GKEA AQEG+N G+K+SV  
Subjt:  AEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFI

Query:  GYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTI-GLLKQNPDYGRLGKFYGELQAL
        GYK G+VRGVSS LA LP + +EKL+  +E   KFQ L+   ++LST  A++ F + +  +   E+  +   +S ++ G          LG +  EL +L
Subjt:  GYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTI-GLLKQNPDYGRLGKFYGELQAL

Query:  LPKSPALKV
        L KSP ++V
Subjt:  LPKSPALKV

AT3G15750.1 Essential protein Yae1, N-terminal1.7e-2938.07Show/hide
Query:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGS-EEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGF
        ++  LA+ELY ES+Q                          L + D   +G SDE    E   L  E ++R  +FH+ GYRDG++AGKEA AQEG+N G+
Subjt:  MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGS-EEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGF

Query:  KQSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILA----QGTTEECVDADTNSRTIGLLKQNPDYGRLG
        K+SV  GY+ GLVRGVSS LA LPD+L+EKL+  +E   KFQ L+   ++LST  A++ F + +      + + EE  D+ ++S +   +    D   LG
Subjt:  KQSVFIGYKLGLVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILA----QGTTEECVDADTNSRTIGLLKQNPDYGRLG

Query:  KFYGELQALLPKSPALKV
         +  EL +LL KSP ++V
Subjt:  KFYGELQALLPKSPALKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGTACTCTTGCTGAAGAACTTTATTCCGAGAGTCTGCAGTCAACAAATTCAAAATTGGGTGCTACATTGTCTTCTGATTGGGGACAAAATCGGTCATCAGGTTC
TGGTGTGGATGACTTGTGCCAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTCGGAGGAAATATCTGATTTGGACAGGGAGTGGCAGAGGAGACATGACCAAT
TCCATACGATTGGATACCGTGATGGTTTAATCGCTGGTAAAGAAGCTGCAGCTCAAGAGGGATTTAATGTTGGCTTCAAGCAGTCAGTCTTTATTGGGTATAAGTTGGGT
CTTGTCAGAGGTGTTAGCAGTGTGCTTGCTCGCCTTCCTGATGACTTGAAAGAGAAGCTAATGGGAAATGAAGAGAACAGTAGTAAATTCCAAAGCTTGTATGAATGTAC
GAACTCTCTTTCGACAGCAGATGCGCTTAGACTATTCAATGACGAGATTTTGGCACAAGGCACGACAGAAGAGTGTGTCGACGCAGATACTAATTCCCGAACGATAGGTT
TGCTGAAGCAAAATCCAGATTATGGACGTCTAGGGAAGTTCTATGGAGAGCTTCAAGCACTTTTACCAAAATCACCTGCTCTGAAAGTTCATCTACATGAAGACCAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGTACTCTTGCTGAAGAACTTTATTCCGAGAGTCTGCAGTCAACAAATTCAAAATTGGGTGCTACATTGTCTTCTGATTGGGGACAAAATCGGTCATCAGGTTC
TGGTGTGGATGACTTGTGCCAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTCGGAGGAAATATCTGATTTGGACAGGGAGTGGCAGAGGAGACATGACCAAT
TCCATACGATTGGATACCGTGATGGTTTAATCGCTGGTAAAGAAGCTGCAGCTCAAGAGGGATTTAATGTTGGCTTCAAGCAGTCAGTCTTTATTGGGTATAAGTTGGGT
CTTGTCAGAGGTGTTAGCAGTGTGCTTGCTCGCCTTCCTGATGACTTGAAAGAGAAGCTAATGGGAAATGAAGAGAACAGTAGTAAATTCCAAAGCTTGTATGAATGTAC
GAACTCTCTTTCGACAGCAGATGCGCTTAGACTATTCAATGACGAGATTTTGGCACAAGGCACGACAGAAGAGTGTGTCGACGCAGATACTAATTCCCGAACGATAGGTT
TGCTGAAGCAAAATCCAGATTATGGACGTCTAGGGAAGTTCTATGGAGAGCTTCAAGCACTTTTACCAAAATCACCTGCTCTGAAAGTTCATCTACATGAAGACCAG
Protein sequenceShow/hide protein sequence
MEGTLAEELYSESLQSTNSKLGATLSSDWGQNRSSGSGVDDLCQDDGSLWGGSDEGSEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLG
LVRGVSSVLARLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQGTTEECVDADTNSRTIGLLKQNPDYGRLGKFYGELQALLPKSPALKVHLHEDQ