; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040531 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040531
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionYae1_N domain-containing protein
Genome locationchr13:5699617..5701897
RNA-Seq ExpressionLag0040531
SyntenyLag0040531
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0004386 - helicase activity (molecular function)
InterPro domainsIPR019191 - Essential protein Yae1, N-terminal
IPR038881 - Yae1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022957528.1 uncharacterized protein LOC111458903 [Cucurbita moschata]4.2e-9785.39Show/hide
Query:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK
        MEGT AEELYSESLQSTKSKL D+S SDYRQN+ SGS V+DSC+DDGSLWGGSDE LEETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFN+GFK
Subjt:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL GIEE RSKFQ+LYESVNSLSTA ALRLF+DDI AQ  KE+ VDANT+SQTIDL KQ SDYG LG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE

Query:  LQALLPKSPALNVHLQEDQ
        LQALLPKSPALN+HL E+Q
Subjt:  LQALLPKSPALNVHLQEDQ

XP_022990784.1 uncharacterized protein LOC111487537 isoform X1 [Cucurbita maxima]2.9e-9885.84Show/hide
Query:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK
        MEGT AEELYSESLQSTKSKL DMS SDYRQN+ SGS V+DSC+DDGSLWGGSDEGLEETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFN+GFK
Subjt:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL GIEENRSKFQ+LYESVNSLSTA ALRLF+DDI AQ  KE+ VDANT+SQTIDL KQ SDY  LG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE

Query:  LQALLPKSPALNVHLQEDQ
        LQALLP SPALN+HL E+Q
Subjt:  LQALLPKSPALNVHLQEDQ

XP_022990860.1 uncharacterized protein LOC111487537 isoform X2 [Cucurbita maxima]2.9e-9885.84Show/hide
Query:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK
        MEGT AEELYSESLQSTKSKL DMS SDYRQN+ SGS V+DSC+DDGSLWGGSDEGLEETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFN+GFK
Subjt:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL GIEENRSKFQ+LYESVNSLSTA ALRLF+DDI AQ  KE+ VDANT+SQTIDL KQ SDY  LG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE

Query:  LQALLPKSPALNVHLQEDQ
        LQALLP SPALN+HL E+Q
Subjt:  LQALLPKSPALNVHLQEDQ

XP_022990940.1 uncharacterized protein LOC111487537 isoform X3 [Cucurbita maxima]2.9e-9885.84Show/hide
Query:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK
        MEGT AEELYSESLQSTKSKL DMS SDYRQN+ SGS V+DSC+DDGSLWGGSDEGLEETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFN+GFK
Subjt:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL GIEENRSKFQ+LYESVNSLSTA ALRLF+DDI AQ  KE+ VDANT+SQTIDL KQ SDY  LG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE

Query:  LQALLPKSPALNVHLQEDQ
        LQALLP SPALN+HL E+Q
Subjt:  LQALLPKSPALNVHLQEDQ

XP_023514291.1 uncharacterized protein LOC111778596 isoform X2 [Cucurbita pepo subsp. pepo]5.5e-9785.39Show/hide
Query:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK
        MEGT AEELYSESLQSTKSKL DMS SDYRQN+ SGS V+DSC+DDGSLWGGSDEGLEETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFN+GFK
Subjt:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL GIEENRSKFQ+LYESVNSLSTA ALRLF++DI AQ  KE+ VDANT+SQTIDL KQ SD   LGKFYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE

Query:  LQALLPKSPALNVHLQEDQ
        LQALLP SPALN+HL E+Q
Subjt:  LQALLPKSPALNVHLQEDQ

TrEMBL top hitse value%identityAlignment
A0A1S4DX60 uncharacterized protein LOC1034894872.1e-9483.11Show/hide
Query:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK
        MEGT AEELYSESLQSTKSKLGD+S SDY++++ SGSGVDDSC+DDGSLWGGSDEGLEET+DLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK
Subjt:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE
        QSVSVGYKLGLVRGVSSVLASLPDDLKEKL G  EN+SKFQ+LYESVNSLST  ALRLFN DIT Q  KE+ V ANTNSQT+DL K+  DYG LGKFY E
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE

Query:  LQALLPKSPALNVHLQEDQ
        L A LP+SPALNVHL E++
Subjt:  LQALLPKSPALNVHLQEDQ

A0A6J1H273 uncharacterized protein LOC1114589032.0e-9785.39Show/hide
Query:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK
        MEGT AEELYSESLQSTKSKL D+S SDYRQN+ SGS V+DSC+DDGSLWGGSDE LEETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFN+GFK
Subjt:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL GIEE RSKFQ+LYESVNSLSTA ALRLF+DDI AQ  KE+ VDANT+SQTIDL KQ SDYG LG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE

Query:  LQALLPKSPALNVHLQEDQ
        LQALLPKSPALN+HL E+Q
Subjt:  LQALLPKSPALNVHLQEDQ

A0A6J1JR95 uncharacterized protein LOC111487537 isoform X21.4e-9885.84Show/hide
Query:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK
        MEGT AEELYSESLQSTKSKL DMS SDYRQN+ SGS V+DSC+DDGSLWGGSDEGLEETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFN+GFK
Subjt:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL GIEENRSKFQ+LYESVNSLSTA ALRLF+DDI AQ  KE+ VDANT+SQTIDL KQ SDY  LG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE

Query:  LQALLPKSPALNVHLQEDQ
        LQALLP SPALN+HL E+Q
Subjt:  LQALLPKSPALNVHLQEDQ

A0A6J1JRH1 uncharacterized protein LOC111487537 isoform X31.4e-9885.84Show/hide
Query:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK
        MEGT AEELYSESLQSTKSKL DMS SDYRQN+ SGS V+DSC+DDGSLWGGSDEGLEETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFN+GFK
Subjt:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL GIEENRSKFQ+LYESVNSLSTA ALRLF+DDI AQ  KE+ VDANT+SQTIDL KQ SDY  LG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE

Query:  LQALLPKSPALNVHLQEDQ
        LQALLP SPALN+HL E+Q
Subjt:  LQALLPKSPALNVHLQEDQ

A0A6J1JSZ3 uncharacterized protein LOC111487537 isoform X11.4e-9885.84Show/hide
Query:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK
        MEGT AEELYSESLQSTKSKL DMS SDYRQN+ SGS V+DSC+DDGSLWGGSDEGLEETSDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFN+GFK
Subjt:  MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFK

Query:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE
        QSVS+GYKLGLVRGVSSVLASLPDDLKEKL GIEENRSKFQ+LYESVNSLSTA ALRLF+DDI AQ  KE+ VDANT+SQTIDL KQ SDY  LG+FYGE
Subjt:  QSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGE

Query:  LQALLPKSPALNVHLQEDQ
        LQALLP SPALN+HL E+Q
Subjt:  LQALLPKSPALNVHLQEDQ

SwissProt top hitse value%identityAlignment
Q9NRH1 Protein YAE1 homolog1.8e-0541.18Show/hide
Query:  DEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFKQSVSVGYKLGLVRGVSSVLAS
        DE  +E+    REWQ    +    GYRDG+ AGK    Q+GFN G+K+   V    G +RG  S L S
Subjt:  DEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFKQSVSVGYKLGLVRGVSSVLAS

Arabidopsis top hitse value%identityAlignment
AT1G34550.1 Protein of unknown function (DUF616)4.3e-0737.25Show/hide
Query:  SVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGELQALLPKSPALNVHLQ
        +VLA LPD+L+EKL   +E R KFQ L+  V++LST  A++ F   +T    KE    +  N+         SD+   G +  EL +LL KSP +   L 
Subjt:  SVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGELQALLPKSPALNVHLQ

Query:  ED
        +D
Subjt:  ED

AT1G34570.1 Essential protein Yae1, N-terminal1.6e-3340.95Show/hide
Query:  FAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFKQSVS
        FA+ELY ESLQ +K     +S  D+        G+++    D   +G SDE   E   LD E ++R  +FH+ GYRDG++ GKEA AQEG+N G+K+SV 
Subjt:  FAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFKQSVS

Query:  VGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITA-QGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGELQA
         GYK G+VRGVSS LA LP + +EKL   +E R KFQ L+ SV++LST  A++ F + +T  QG ++   +   +         ++    LG +  EL +
Subjt:  VGYKLGLVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITA-QGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGELQA

Query:  LLPKSPALNV
        LL KSP + V
Subjt:  LLPKSPALNV

AT3G15750.1 Essential protein Yae1, N-terminal9.5e-3142.46Show/hide
Query:  GSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFKQSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEE
        G  V  S  DD       +E   E   L  E ++R  +FH+ GYRDG++AGKEA AQEG+N G+K+SV  GY+ GLVRGVSS LA LPD+L+EKL   +E
Subjt:  GSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFKQSVSVGYKLGLVRGVSSVLASLPDDLKEKLTGIEE

Query:  NRSKFQNLYESVNSLSTAGALRLFNDDITA-QGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGELQALLPKSPALNV
         R KFQ L+ SV++LST  A++ F + +T  QG ++   +   +         ++    LG +  EL +LL KSP + V
Subjt:  NRSKFQNLYESVNSLSTAGALRLFNDDITA-QGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGELQALLPKSPALNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGTACCTTTGCTGAAGAACTTTATTCCGAGAGTTTGCAATCAACAAAATCAAAATTGGGTGATATGTCATTTTCTGATTACAGACAAAATCAGTCATCAGGTTC
TGGTGTGGATGACTCGTGCCAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTTGGAGGAAACATCTGATCTGGACAGGGAGTGGCAGAGGAGACATGACCAAT
TTCATACGATTGGATACCGTGATGGTTTAATCGCTGGGAAAGAAGCTGCAGCTCAAGAGGGATTTAATATTGGCTTCAAGCAGTCGGTCTCTGTCGGGTATAAGTTGGGT
CTTGTCAGAGGTGTTAGCAGTGTGCTTGCTTCCCTTCCTGATGACTTGAAAGAAAAGCTAACGGGAATCGAAGAGAACAGAAGTAAATTCCAAAACTTGTATGAATCTGT
GAACTCTCTTTCAACAGCAGGTGCGCTTAGGCTATTCAACGACGATATTACGGCGCAAGGCAGGAAAGAAGATCGTGTCGATGCAAATACAAATTCCCAAACAATAGATT
TGCCGAAGCAAATATCAGATTATGGTCATCTAGGGAAGTTCTATGGAGAGCTTCAAGCACTTTTACCCAAATCACCTGCTCTGAATGTTCATCTACAAGAAGATCAGCCA
AATTTTACAGTGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGTACCTTTGCTGAAGAACTTTATTCCGAGAGTTTGCAATCAACAAAATCAAAATTGGGTGATATGTCATTTTCTGATTACAGACAAAATCAGTCATCAGGTTC
TGGTGTGGATGACTCGTGCCAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTTGGAGGAAACATCTGATCTGGACAGGGAGTGGCAGAGGAGACATGACCAAT
TTCATACGATTGGATACCGTGATGGTTTAATCGCTGGGAAAGAAGCTGCAGCTCAAGAGGGATTTAATATTGGCTTCAAGCAGTCGGTCTCTGTCGGGTATAAGTTGGGT
CTTGTCAGAGGTGTTAGCAGTGTGCTTGCTTCCCTTCCTGATGACTTGAAAGAAAAGCTAACGGGAATCGAAGAGAACAGAAGTAAATTCCAAAACTTGTATGAATCTGT
GAACTCTCTTTCAACAGCAGGTGCGCTTAGGCTATTCAACGACGATATTACGGCGCAAGGCAGGAAAGAAGATCGTGTCGATGCAAATACAAATTCCCAAACAATAGATT
TGCCGAAGCAAATATCAGATTATGGTCATCTAGGGAAGTTCTATGGAGAGCTTCAAGCACTTTTACCCAAATCACCTGCTCTGAATGTTCATCTACAAGAAGATCAGCCA
AATTTTACAGTGGCTTAG
Protein sequenceShow/hide protein sequence
MEGTFAEELYSESLQSTKSKLGDMSFSDYRQNQSSGSGVDDSCQDDGSLWGGSDEGLEETSDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNIGFKQSVSVGYKLG
LVRGVSSVLASLPDDLKEKLTGIEENRSKFQNLYESVNSLSTAGALRLFNDDITAQGRKEDRVDANTNSQTIDLPKQISDYGHLGKFYGELQALLPKSPALNVHLQEDQP
NFTVA