; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003240 (gene) of Snake gourd v1 genome

Gene IDTan0003240
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionYae1_N domain-containing protein
Genome locationLG04:4782417..4784626
RNA-Seq ExpressionTan0003240
SyntenyTan0003240
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0004386 - helicase activity (molecular function)
InterPro domainsIPR019191 - Essential protein Yae1, N-terminal
IPR038881 - Yae1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022990784.1 uncharacterized protein LOC111487537 isoform X1 [Cucurbita maxima]1.1e-9282.65Show/hide
Query:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEG +AEELYSES+QSTKSKL  MS SD +QNRPSGS V+DSC+DDGSLWGGSDEG EETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL G EE RSKFQSLYESVNSLSTA ALRLF+D I  Q T+EE  DANT+SQTI LLKQNSDY  LG+FY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE

Query:  LQELLPQSPALNVHLHKDQ
        LQ LLP SPALN+HLH++Q
Subjt:  LQELLPQSPALNVHLHKDQ

XP_022990860.1 uncharacterized protein LOC111487537 isoform X2 [Cucurbita maxima]1.1e-9282.65Show/hide
Query:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEG +AEELYSES+QSTKSKL  MS SD +QNRPSGS V+DSC+DDGSLWGGSDEG EETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL G EE RSKFQSLYESVNSLSTA ALRLF+D I  Q T+EE  DANT+SQTI LLKQNSDY  LG+FY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE

Query:  LQELLPQSPALNVHLHKDQ
        LQ LLP SPALN+HLH++Q
Subjt:  LQELLPQSPALNVHLHKDQ

XP_022990940.1 uncharacterized protein LOC111487537 isoform X3 [Cucurbita maxima]1.1e-9282.65Show/hide
Query:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEG +AEELYSES+QSTKSKL  MS SD +QNRPSGS V+DSC+DDGSLWGGSDEG EETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL G EE RSKFQSLYESVNSLSTA ALRLF+D I  Q T+EE  DANT+SQTI LLKQNSDY  LG+FY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE

Query:  LQELLPQSPALNVHLHKDQ
        LQ LLP SPALN+HLH++Q
Subjt:  LQELLPQSPALNVHLHKDQ

XP_023514282.1 uncharacterized protein LOC111778596 isoform X1 [Cucurbita pepo subsp. pepo]6.2e-9382.65Show/hide
Query:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEG +AEELYSES+QSTKSKL  MS SD +QNRPSGS V+DSC+DDGSLWGGSDEG EETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL G EE RSKFQSLYESVNSLSTA ALRLF++ I  Q T+EEC DANT+SQTI LLKQNSD   LGKFY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE

Query:  LQELLPQSPALNVHLHKDQ
        LQ LLP SPALN+HLH++Q
Subjt:  LQELLPQSPALNVHLHKDQ

XP_023514291.1 uncharacterized protein LOC111778596 isoform X2 [Cucurbita pepo subsp. pepo]6.2e-9382.65Show/hide
Query:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEG +AEELYSES+QSTKSKL  MS SD +QNRPSGS V+DSC+DDGSLWGGSDEG EETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL G EE RSKFQSLYESVNSLSTA ALRLF++ I  Q T+EEC DANT+SQTI LLKQNSD   LGKFY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE

Query:  LQELLPQSPALNVHLHKDQ
        LQ LLP SPALN+HLH++Q
Subjt:  LQELLPQSPALNVHLHKDQ

TrEMBL top hitse value%identityAlignment
A0A1S4DX60 uncharacterized protein LOC1034894879.6e-9281.28Show/hide
Query:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEG +AEELYSES+QSTKSKLG +S SD K++RPSGSGVDDSC+DDGSLWGGSDEG EET+DLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFN+GFK
Subjt:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE
        QSVSVGYKLGLVRGVSSVLASLPDDLKEKL G+ E +SKFQSLYESVNSLST  ALRLFN  IT Q T+EE   ANTNSQT+ LLK+N DY  LGKFY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE

Query:  LQELLPQSPALNVHLHKDQ
        L   LPQSPALNVHLH+++
Subjt:  LQELLPQSPALNVHLHKDQ

A0A6J1H273 uncharacterized protein LOC1114589038.7e-9382.19Show/hide
Query:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEG +AEELYSES+QSTKSKL  +S SD +QNRPSGS V+DSC+DDGSLWGGSDE  EETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL G EE RSKFQSLYESVNSLSTA ALRLF+D I  Q T+EEC DANT+SQTI LLKQNSDY  LG+FY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE

Query:  LQELLPQSPALNVHLHKDQ
        LQ LLP+SPALN+HLH++Q
Subjt:  LQELLPQSPALNVHLHKDQ

A0A6J1JR95 uncharacterized protein LOC111487537 isoform X25.1e-9382.65Show/hide
Query:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEG +AEELYSES+QSTKSKL  MS SD +QNRPSGS V+DSC+DDGSLWGGSDEG EETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL G EE RSKFQSLYESVNSLSTA ALRLF+D I  Q T+EE  DANT+SQTI LLKQNSDY  LG+FY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE

Query:  LQELLPQSPALNVHLHKDQ
        LQ LLP SPALN+HLH++Q
Subjt:  LQELLPQSPALNVHLHKDQ

A0A6J1JRH1 uncharacterized protein LOC111487537 isoform X35.1e-9382.65Show/hide
Query:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEG +AEELYSES+QSTKSKL  MS SD +QNRPSGS V+DSC+DDGSLWGGSDEG EETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL G EE RSKFQSLYESVNSLSTA ALRLF+D I  Q T+EE  DANT+SQTI LLKQNSDY  LG+FY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE

Query:  LQELLPQSPALNVHLHKDQ
        LQ LLP SPALN+HLH++Q
Subjt:  LQELLPQSPALNVHLHKDQ

A0A6J1JSZ3 uncharacterized protein LOC111487537 isoform X15.1e-9382.65Show/hide
Query:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK
        MEG +AEELYSES+QSTKSKL  MS SD +QNRPSGS V+DSC+DDGSLWGGSDEG EETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFK
Subjt:  MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL G EE RSKFQSLYESVNSLSTA ALRLF+D I  Q T+EE  DANT+SQTI LLKQNSDY  LG+FY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRE

Query:  LQELLPQSPALNVHLHKDQ
        LQ LLP SPALN+HLH++Q
Subjt:  LQELLPQSPALNVHLHKDQ

SwissProt top hitse value%identityAlignment
Q9NRH1 Protein YAE1 homolog3.9e-0541.18Show/hide
Query:  DEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLAS
        DE  +E+    REWQ    +    GYRDG+ AGK    Q+GFN G+K+   V    G +RG  S L S
Subjt:  DEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLAS

Arabidopsis top hitse value%identityAlignment
AT1G34550.1 Protein of unknown function (DUF616)2.1e-0636.84Show/hide
Query:  SVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRELQELLPQSPAL
        +VLA LPD+L+EKL   +E R KFQ L+  V++LST  A++ F   +T   T+E    +  N+         SD+   G +  EL  LL +SP +
Subjt:  SVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRELQELLPQSPAL

AT1G34570.1 Essential protein Yae1, N-terminal5.9e-3340.48Show/hide
Query:  AEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQD-DGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVS
        A+ELY ES+Q +K                  S  D   ++ D   +G SDE + E   LD E ++R  +FH+ GYRDG++ GKEA AQEG+N G+K+SV 
Subjt:  AEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQD-DGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVS

Query:  VGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTI-GLLKQNSDYDHLGKFYRELQE
         GYK G+VRGVSS LA LP + +EKL   +E R KFQ L+ SV++LST  A++ F + +T +  EE+ G+   +S ++ G     +    LG +  EL  
Subjt:  VGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTI-GLLKQNSDYDHLGKFYRELQE

Query:  LLPQSPALNV
        LL +SP + V
Subjt:  LLPQSPALNV

AT3G15750.1 Essential protein Yae1, N-terminal1.1e-3140.76Show/hide
Query:  LAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSD-EGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSV
        LA+ELY ES+Q +K                           D   +G SD E + E   L  E ++R  +FH+ GYRDG++AGKEA AQEG+N G+K+SV
Subjt:  LAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSD-EGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSV

Query:  SVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNS-QTIGLLKQNSDYDHLGKFYRELQ
          GY+ GLVRGVSS LA LPD+L+EKL   +E R KFQ L+ SV++LST  A++ F + +T +  EE+ G+   +S    G     +    LG +  EL 
Subjt:  SVGYKLGLVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNS-QTIGLLKQNSDYDHLGKFYRELQ

Query:  ELLPQSPALNV
         LL +SP + V
Subjt:  ELLPQSPALNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGTGCCCTCGCTGAAGAACTTTATTCCGAGAGTATACAGTCAACGAAATCAAAATTAGGTGCTATGTCATTTTCTGATTGCAAACAAAATCGGCCATCAGGTTC
TGGTGTGGATGACTCGTGCCAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTGGGAGGAAACATCTGATTTGGACAGGGAGTGGCAGAGGAGACATGACCAAT
TTCACACGATTGGATACCGTGATGGTTTAATCGCTGGGAAAGAAGCTGCAGCTCAAGAGGGATTTAATGTTGGGTTCAAGCAGTCAGTCTCTGTCGGGTATAAGTTGGGT
CTTGTCCGAGGTGTCAGCAGTGTGCTTGCTTCCCTCCCTGATGACTTGAAAGAGAAGCTAACAGGAAGTGAAGAGAAGAGAAGTAAATTCCAAAGCTTGTATGAATCTGT
GAACTCTCTTTCGACAGCAGGAGCGCTTAGGCTATTCAACGACAAGATTACGGTACAAGGCACAGAAGAAGAATGTGGCGATGCCAATACAAATTCCCAAACGATAGGTT
TGTTGAAGCAAAATTCAGATTATGATCATCTAGGAAAGTTCTATAGAGAGCTTCAAGAACTTTTACCCCAATCACCTGCTCTAAATGTTCATCTACACAAAGATCAGCAG
ATTTTGCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGTGCCCTCGCTGAAGAACTTTATTCCGAGAGTATACAGTCAACGAAATCAAAATTAGGTGCTATGTCATTTTCTGATTGCAAACAAAATCGGCCATCAGGTTC
TGGTGTGGATGACTCGTGCCAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTGGGAGGAAACATCTGATTTGGACAGGGAGTGGCAGAGGAGACATGACCAAT
TTCACACGATTGGATACCGTGATGGTTTAATCGCTGGGAAAGAAGCTGCAGCTCAAGAGGGATTTAATGTTGGGTTCAAGCAGTCAGTCTCTGTCGGGTATAAGTTGGGT
CTTGTCCGAGGTGTCAGCAGTGTGCTTGCTTCCCTCCCTGATGACTTGAAAGAGAAGCTAACAGGAAGTGAAGAGAAGAGAAGTAAATTCCAAAGCTTGTATGAATCTGT
GAACTCTCTTTCGACAGCAGGAGCGCTTAGGCTATTCAACGACAAGATTACGGTACAAGGCACAGAAGAAGAATGTGGCGATGCCAATACAAATTCCCAAACGATAGGTT
TGTTGAAGCAAAATTCAGATTATGATCATCTAGGAAAGTTCTATAGAGAGCTTCAAGAACTTTTACCCCAATCACCTGCTCTAAATGTTCATCTACACAAAGATCAGCAG
ATTTTGCTGTAA
Protein sequenceShow/hide protein sequence
MEGALAEELYSESIQSTKSKLGAMSFSDCKQNRPSGSGVDDSCQDDGSLWGGSDEGWEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLG
LVRGVSSVLASLPDDLKEKLTGSEEKRSKFQSLYESVNSLSTAGALRLFNDKITVQGTEEECGDANTNSQTIGLLKQNSDYDHLGKFYRELQELLPQSPALNVHLHKDQQ
ILL