; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005632 (gene) of Snake gourd v1 genome

Gene IDTan0005632
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein LSM12 homolog A
Genome locationLG04:11114024..11121079
RNA-Seq ExpressionTan0005632
SyntenyTan0005632
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0030198 - extracellular matrix organization (biological process)
GO:0030574 - collagen catabolic process (biological process)
GO:0031012 - extracellular matrix (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR019181 - Anticodon-binding domain
IPR039683 - Protein Lsm12-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135229.1 protein LSM12 homolog A [Cucumis sativus]3.0e-9095.58Show/hide
Query:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDGS NGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQA
Subjt:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ
        EAEAERIGVGVTSEAQ+IFDALSKTLPVRWDKTVIVVMNEVRVS+PYLPESV+GGTPAANERVKKVLE ERKRLQVRGG Q
Subjt:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ

XP_022151106.1 protein LSM12 homolog [Momordica charantia]1.5e-8995.03Show/hide
Query:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDGS NGDDF VGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGP RNIRLLKANYIKEF+FLGHGEDPLDLKKCYLDLN+LRAREELAIRQA
Subjt:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ
        EAEAERIGVGVTSEAQ+IFDALSKTLPVRWDKTVIVVMNEVRVS+PYL ESVTGGTPAANERVKKVLELERKRLQVRGG Q
Subjt:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ

XP_022942326.1 protein LSM12 homolog A [Cucurbita moschata]8.3e-8892.27Show/hide
Query:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDG  NGDDFTVGSFFSIKTTLGDEFQGQVITFD  SNIL+LQEGSK GPRRNIRLLKANYIKEFSFLGHGEDP+DLKKCYLD+NTLRAREELAIRQA
Subjt:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ
        E +AERIGVGVTSEAQ+IFDALSKTLPVRWDK+VIVVMNEVRVSNPYLPESVTGGTPAAN+RVKKVLELERKRLQVRGG Q
Subjt:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ

XP_022985626.1 protein LSM12 homolog A [Cucurbita maxima]3.7e-8892.82Show/hide
Query:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDG  NGDDFTVGSFFSIKTTLGDEFQGQVITFD  SNIL+LQEGSK GPRRNIRLLKANYIKEFSFLGHGEDP+DLKKCYLD+NTLRAREELAIRQA
Subjt:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ
        E EAERIGVGVTSEAQ+IFDALSKTLPVRWDK+VIVVMNEVRVSNPYLPESVTGGTPAAN+RVKKVLELERKRLQVRGG Q
Subjt:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ

XP_038892233.1 protein LSM12 homolog A-like [Benincasa hispida]3.0e-9095.58Show/hide
Query:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDGS NGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQA
Subjt:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ
        EAEAERIGVGVTSEAQ+IFDALSKTLPVRWDKTVIVVMNEVRVS+PYLPESV+GGTPAANERVKKVLELERKRLQVRG  Q
Subjt:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ

TrEMBL top hitse value%identityAlignment
A0A0A0KQA9 AD domain-containing protein1.5e-9095.58Show/hide
Query:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDGS NGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQA
Subjt:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ
        EAEAERIGVGVTSEAQ+IFDALSKTLPVRWDKTVIVVMNEVRVS+PYLPESV+GGTPAANERVKKVLE ERKRLQVRGG Q
Subjt:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ

A0A6J1DDK3 protein LSM12 homolog7.3e-9095.03Show/hide
Query:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDGS NGDDF VGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGP RNIRLLKANYIKEF+FLGHGEDPLDLKKCYLDLN+LRAREELAIRQA
Subjt:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ
        EAEAERIGVGVTSEAQ+IFDALSKTLPVRWDKTVIVVMNEVRVS+PYL ESVTGGTPAANERVKKVLELERKRLQVRGG Q
Subjt:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ

A0A6J1FNK2 protein LSM12 homolog A4.0e-8892.27Show/hide
Query:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDG  NGDDFTVGSFFSIKTTLGDEFQGQVITFD  SNIL+LQEGSK GPRRNIRLLKANYIKEFSFLGHGEDP+DLKKCYLD+NTLRAREELAIRQA
Subjt:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ
        E +AERIGVGVTSEAQ+IFDALSKTLPVRWDK+VIVVMNEVRVSNPYLPESVTGGTPAAN+RVKKVLELERKRLQVRGG Q
Subjt:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ

A0A6J1GZX6 protein LSM12 homolog A2.6e-8794.97Show/hide
Query:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALD S NGDDF+VGSFFSIKTTLGDEFQ QVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQA
Subjt:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGG
        E EAERIGVGVTSEAQ+IFDALSKTLPVRWDKTVIVVMNEVRVS+PYLPESV+GGTPAANERVKKVLELERKRLQVRGG
Subjt:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGG

A0A6J1J8S1 protein LSM12 homolog A1.8e-8892.82Show/hide
Query:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDG  NGDDFTVGSFFSIKTTLGDEFQGQVITFD  SNIL+LQEGSK GPRRNIRLLKANYIKEFSFLGHGEDP+DLKKCYLD+NTLRAREELAIRQA
Subjt:  MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ
        E EAERIGVGVTSEAQ+IFDALSKTLPVRWDK+VIVVMNEVRVSNPYLPESVTGGTPAAN+RVKKVLELERKRLQVRGG Q
Subjt:  EAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ

SwissProt top hitse value%identityAlignment
Q0VCF9 Protein LSM12 homolog4.5e-1231.71Show/hide
Query:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGP--RRNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAE
        G+ F+VGS  S +T      QG+V+ FD  S +L L+  S  G     +I L+   Y+ E   +    E P  L    +     +AR E   + ++A A 
Subjt:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGP--RRNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAE

Query:  RIGVGVTSEAQTIFDALSKTL-PVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLE
         I  GV+ E Q +F  + KT+   +W +  IVVM EV ++ PY  E+  G   +A   V+K++E
Subjt:  RIGVGVTSEAQTIFDALSKTL-PVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLE

Q5ZML5 Protein LSM12 homolog2.0e-1231.1Show/hide
Query:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGP--RRNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAE
        G+ F+VGS  S +T      QG+V+ FD PS +L L+  S  G     +I L+   Y+ E   +    E P  L    L+++ L  +  +   +  ++A 
Subjt:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGP--RRNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAE

Query:  RIGVGVTSEAQTIFDALSKTL-PVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLE
         I  GV+ E Q +F  + KT+   +W +  IVVM EV ++ PY  E+  G   +A   V+K++E
Subjt:  RIGVGVTSEAQTIFDALSKTL-PVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLE

Q6GP89 Protein LSM12 homolog1.2e-1229.38Show/hide
Query:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGP--RRNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAE
        G+ F +G++ S +T      QG+V+ FD PS +L L+  S  G     +I LL  +Y+ +   +    + P  L    L++  L +R  L   +  ++A 
Subjt:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGP--RRNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAE

Query:  RIGVGVTSEAQTIFDALSKTL-PVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ
         I  GV+ + Q +F  + KT+   +W +  IVVM+EV +S PY  E+  G    A   V K++E   + ++ + G Q
Subjt:  RIGVGVTSEAQTIFDALSKTL-PVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ

Q6NSN1 Protein LSM12 homolog B7.0e-1331.1Show/hide
Query:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRR--NIRLLKANYIKEFSFL-GHGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAE
        G+ F+VGS  S  T LG   QG+V+ FD PS +L L+     G     ++ L+   Y+ E   +    E P  L     +    RAR E   + + A A 
Subjt:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRR--NIRLLKANYIKEFSFL-GHGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAE

Query:  RIGVGVTSEAQTIFDALSKTL-PVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLE
         +  GV+ E Q +F  + KT+   +W +  I+VM++V +S PY  ++  G   +A   V+K++E
Subjt:  RIGVGVTSEAQTIFDALSKTL-PVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLE

Q6PBA2 Protein LSM12 homolog A4.8e-1432.32Show/hide
Query:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRR--NIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAE
        G+ F+VGS  S  T LG   QG+V+ FD PS +L L+  S  G     ++ L+   Y+ E   +    E P  L    +     RAR E   + ++A A 
Subjt:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRR--NIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAE

Query:  RIGVGVTSEAQTIFDALSKTL-PVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLE
         I  GV+ E Q +F  + KT+   +W +  I+VM++V +S PY  E+  G   +A   ++K++E
Subjt:  RIGVGVTSEAQTIFDALSKTL-PVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLE

Arabidopsis top hitse value%identityAlignment
AT1G24050.1 RNA-processing, Lsm domain7.3e-5058.14Show/hide
Query:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGP--RRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAER
        G+ F VG+ +S+K   GDEF+G V+ +D   N +  +EG+KP P   +N R++ A++I   S+LG  EDPLD     +DLN LRA+E LAIRQAEA+AER
Subjt:  GDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGP--RRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAER

Query:  IGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRG
        +GVGVT+EAQ+IFDALSKTLPV+W+ + I+VM EVRV +PYL + V GGT AAN RVKKVLELER+RLQ+ G
Subjt:  IGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRG

AT1G70220.1 RNA-processing, Lsm domain1.6e-2533.8Show/hide
Query:  LDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSN----------------------------------------ILVLQEGSKPGP--RRNIRLLK
        +D     + F VG  +++K T GD+F G V+ +D   N                                        +++LQEG+KP P   +++R++ 
Subjt:  LDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSN----------------------------------------ILVLQEGSKPGP--RRNIRLLK

Query:  ANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAAN
         NYI E   LG  ++ L  K   ++L+ L  +E  AI    +  E+IG GVT+E Q IFDA+SKTLP+RW    ++VM +V + +PY  + V GG    N
Subjt:  ANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAERIGVGVTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAAN

Query:  ERVKKVLELERKRLQV
        ERVK VL  ERK+LQ+
Subjt:  ERVKKVLELERKRLQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACTGGACGGTAGCGTCAATGGGGATGACTTCACTGTTGGGTCCTTCTTCTCCATTAAGACGACCTTAGGCGATGAATTTCAAGGACAAGTCATTACCTTTGACCG
CCCCTCCAACATCCTCGTCCTTCAGGAGGGTTCGAAGCCAGGACCTCGTAGGAACATAAGGCTGCTGAAGGCCAATTATATTAAGGAGTTTTCGTTTTTGGGACATGGTG
AAGATCCTCTTGATCTCAAGAAGTGTTACCTCGATCTCAACACTCTCCGTGCTCGAGAGGAACTGGCCATTAGGCAAGCGGAGGCAGAGGCGGAGAGGATAGGAGTGGGT
GTGACCAGCGAGGCCCAGACTATTTTTGACGCCTTATCTAAAACGCTTCCTGTGCGCTGGGACAAGACTGTCATAGTTGTAATGAATGAAGTACGTGTTAGCAATCCGTA
TCTACCAGAATCTGTTACTGGAGGCACCCCTGCTGCCAATGAGCGGGTGAAGAAAGTGCTTGAACTGGAGAGGAAGCGGTTGCAAGTTCGTGGCGGTAGTCAGTGA
mRNA sequenceShow/hide mRNA sequence
GTTGAAGAAAAGCCCTAATTTATATGGAGATTAGAAAATTAGAGTCGTTACCCAACTCCTTCAGGAGGAGGAAGTGACGGTTCCAACTTTTACAGCGAGGAAGAAGGCGA
AAATGGCACTGGACGGTAGCGTCAATGGGGATGACTTCACTGTTGGGTCCTTCTTCTCCATTAAGACGACCTTAGGCGATGAATTTCAAGGACAAGTCATTACCTTTGAC
CGCCCCTCCAACATCCTCGTCCTTCAGGAGGGTTCGAAGCCAGGACCTCGTAGGAACATAAGGCTGCTGAAGGCCAATTATATTAAGGAGTTTTCGTTTTTGGGACATGG
TGAAGATCCTCTTGATCTCAAGAAGTGTTACCTCGATCTCAACACTCTCCGTGCTCGAGAGGAACTGGCCATTAGGCAAGCGGAGGCAGAGGCGGAGAGGATAGGAGTGG
GTGTGACCAGCGAGGCCCAGACTATTTTTGACGCCTTATCTAAAACGCTTCCTGTGCGCTGGGACAAGACTGTCATAGTTGTAATGAATGAAGTACGTGTTAGCAATCCG
TATCTACCAGAATCTGTTACTGGAGGCACCCCTGCTGCCAATGAGCGGGTGAAGAAAGTGCTTGAACTGGAGAGGAAGCGGTTGCAAGTTCGTGGCGGTAGTCAGTGATG
GTTATTCATCCCTATCCTTCTTTACTGGGTGACAGGCGATTGCCTGTACTGAGCCTTTTTGCTAATGGGGGGAAGGTATAATCTCTTTTAGAGGCATCTCAGTGTCAGCA
ATCATGAAGAGATTTCGTGTTTAAGTGGTACCTAATCTGTTGCCAAAGTTTTTATGTTTATTGGACTTCTGGTTGTTGATTTTCTCCTGTTTATGGACTGCACTGGACAT
ACCCTGGCAAAGCCTTTTGGGGGTCCATATTTAAAGTTTTCCAAGATTTGCTCTGTTGCAATGTATCAAATTTGAAATGAGCAAACTCGATGTAATAACTCCCCACCTTT
TTTTCAATCATAATGTGGAAAAACTAACCCTGATGTATTTGGGAAACTGCAAGAATAGGTTTCTCATGTTTTACCTCTCAACCTTTTCTCTGTTAAGTATCAAGCATCAT
GAGCAGGGACAGGTTTCGAAATATTTGACGCACGGGGTGTCTAATGGCCGAACATTGATAATCGTTTGGGGCCTAAGATCAGCCCTATATTTGTCATCTGCTCATCTTTC
GATCTCATTAGTTAGAGTTATGAAAAAACCCATTTTGTAGATCTTGTGGATATTTTCTGTTGAAGAAGAAAGCCGTAGAGGCAAATGGAAAGCTTCAACAGGGTACTTGG
TATGGCGCCAAAAGAACGCTGATTTTTCTTTTTCATACCCTTTCTTTTTCCCTTTGTTATGCTTGGCCCTTTGCTTTTCCTAATGGCC
Protein sequenceShow/hide protein sequence
MALDGSVNGDDFTVGSFFSIKTTLGDEFQGQVITFDRPSNILVLQEGSKPGPRRNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAERIGVG
VTSEAQTIFDALSKTLPVRWDKTVIVVMNEVRVSNPYLPESVTGGTPAANERVKKVLELERKRLQVRGGSQ