; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000859 (gene) of Snake gourd v1 genome

Gene IDTan0000859
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMetal-independent phosphoserine phosphatase
Genome locationLG05:8976902..8984406
RNA-Seq ExpressionTan0000859
SyntenyTan0000859
Gene Ontology termsNA
InterPro domainsIPR013078 - Histidine phosphatase superfamily, clade-1
IPR029033 - Histidine phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604968.1 hypothetical protein SDJN03_02285, partial [Cucurbita argyrosperma subsp. sororia]4.5e-11592.07Show/hide
Query:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC
        MATA FL N+YWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENV+ICYSPFSRTIHTA+VAASA+NLPFE PQC
Subjt:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC

Query:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR
        KMM+DLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKA+LQ+ESQFQGCA+LVVSHGDPLQIFQTV+GAA  ENGS SDEL+SR
Subjt:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR

Query:  LQAVITKPILSQHRKFALLTGELRPVV
        LQA ITKPILSQHRKFALLTGELR VV
Subjt:  LQAVITKPILSQHRKFALLTGELRPVV

XP_022140404.1 uncharacterized protein LOC111011089 [Momordica charantia]7.9e-11289.91Show/hide
Query:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC
        M TASFL NRYW+LRHGKSIPNEKGLIVSSIENGTLPEYQLA+EGVGQAQLAGEQFLKELKENSI LENV+ICYSPFSRTIHTA+VAASA+N+PFEGPQC
Subjt:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC

Query:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRP-EGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSS
        KM+EDLRERYFGPSFEL+SHDKYA+IWALDEEDPFKRP EGGESVEDVASRLAKAILQIESQFQGCAIL+VSHGDPLQIFQTVVGAAKQE+ S+SDEL S
Subjt:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRP-EGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSS

Query:  RLQAVITKPILSQHRKFALLTGELRPVV
        +LQAVITK +LSQHRKFALLTGELR V+
Subjt:  RLQAVITKPILSQHRKFALLTGELRPVV

XP_022948064.1 uncharacterized protein LOC111451758 isoform X1 [Cucurbita moschata]1.0e-11492.07Show/hide
Query:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC
        MATA FL NRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENV+ICYSPFSRTIHTA+VAASA+NLPFE P C
Subjt:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC

Query:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR
        KMM+DLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKA+LQ+ESQFQGCA+LVVSHGDPLQIFQTV+GAA  ENGS SDEL+SR
Subjt:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR

Query:  LQAVITKPILSQHRKFALLTGELRPVV
        LQA ITKPILSQHRKFALLTGELR VV
Subjt:  LQAVITKPILSQHRKFALLTGELRPVV

XP_022971095.1 uncharacterized protein LOC111469867 [Cucurbita maxima]7.7e-11592.51Show/hide
Query:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC
        MATA FL NRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENV+ICYSPFSRTIHTA+V ASA+NLPFE PQC
Subjt:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC

Query:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR
        KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPF RPEGGESVEDVASRLAKA+LQ+ESQFQGCAILVVSHGDPLQIFQTV+GAA  ENGS SDEL+SR
Subjt:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR

Query:  LQAVITKPILSQHRKFALLTGELRPVV
        LQA ITKPILSQHRKFALLTGELR VV
Subjt:  LQAVITKPILSQHRKFALLTGELRPVV

XP_023532426.1 uncharacterized protein LOC111794603 [Cucurbita pepo subsp. pepo]5.3e-11692.95Show/hide
Query:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC
        MATA FL NRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENV+ICYSPFSRTIHTA+VAASA+NLPFE PQC
Subjt:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC

Query:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR
        KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKA+LQ+ESQFQGCA+LVVSHGDPLQIFQTV+GAA  ENGS+SDEL+SR
Subjt:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR

Query:  LQAVITKPILSQHRKFALLTGELRPVV
        LQA ITKPILSQHRKFALLTGELR VV
Subjt:  LQAVITKPILSQHRKFALLTGELRPVV

TrEMBL top hitse value%identityAlignment
A0A0A0KQV4 Uncharacterized protein4.0e-10182.46Show/hide
Query:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC
        MATASFL NRYWILRHGKSIPNEKGLIVSS ENG LPEYQLA EGV QA+LAG QFLKELKENSI LENV+ICYSPFSRTIHTA+VAAS +NLPFEGPQC
Subjt:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC

Query:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSN-SDELSS
        KM+E+LRERYFGPSFELLSHDKY EIWALDEED FKRPEGGESVEDVASRLAKAIL+IES FQGCAILVVSHGDPLQI Q ++G+  +++GS  S++LSS
Subjt:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSN-SDELSS

Query:  RLQAVITKPILSQHRKFALLTGELRPVV
         L+A++TKPILS HR+FALLTGELRP++
Subjt:  RLQAVITKPILSQHRKFALLTGELRPVV

A0A6J1CFM2 uncharacterized protein LOC1110110893.8e-11289.91Show/hide
Query:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC
        M TASFL NRYW+LRHGKSIPNEKGLIVSSIENGTLPEYQLA+EGVGQAQLAGEQFLKELKENSI LENV+ICYSPFSRTIHTA+VAASA+N+PFEGPQC
Subjt:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC

Query:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRP-EGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSS
        KM+EDLRERYFGPSFEL+SHDKYA+IWALDEEDPFKRP EGGESVEDVASRLAKAILQIESQFQGCAIL+VSHGDPLQIFQTVVGAAKQE+ S+SDEL S
Subjt:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRP-EGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSS

Query:  RLQAVITKPILSQHRKFALLTGELRPVV
        +LQAVITK +LSQHRKFALLTGELR V+
Subjt:  RLQAVITKPILSQHRKFALLTGELRPVV

A0A6J1G864 uncharacterized protein LOC1114517604.5e-10585.46Show/hide
Query:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC
        MA A FL NRYWILRHGKSIPNEKGLIVSSIENGTLPEYQL SEGV QAQLAGEQFLKELKEN IPLENV+ICYSPFSRTIHTA+ AASA+NLPFE PQC
Subjt:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC

Query:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR
        KMMEDLRERYFGPS E LS  K  E+ A+DEEDPFKRPEGGESVEDVASRLAK +LQ+ESQFQGCAILV+SHGDPLQI QTV+GAA  ENGS SDEL+SR
Subjt:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR

Query:  LQAVITKPILSQHRKFALLTGELRPVV
        LQA ITKPILSQHRKF+LLTGELR VV
Subjt:  LQAVITKPILSQHRKFALLTGELRPVV

A0A6J1G8R3 uncharacterized protein LOC111451758 isoform X14.8e-11592.07Show/hide
Query:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC
        MATA FL NRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENV+ICYSPFSRTIHTA+VAASA+NLPFE P C
Subjt:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC

Query:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR
        KMM+DLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKA+LQ+ESQFQGCA+LVVSHGDPLQIFQTV+GAA  ENGS SDEL+SR
Subjt:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR

Query:  LQAVITKPILSQHRKFALLTGELRPVV
        LQA ITKPILSQHRKFALLTGELR VV
Subjt:  LQAVITKPILSQHRKFALLTGELRPVV

A0A6J1I4S5 uncharacterized protein LOC1114698673.7e-11592.51Show/hide
Query:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC
        MATA FL NRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENV+ICYSPFSRTIHTA+V ASA+NLPFE PQC
Subjt:  MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQC

Query:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR
        KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPF RPEGGESVEDVASRLAKA+LQ+ESQFQGCAILVVSHGDPLQIFQTV+GAA  ENGS SDEL+SR
Subjt:  KMMEDLRERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSR

Query:  LQAVITKPILSQHRKFALLTGELRPVV
        LQA ITKPILSQHRKFALLTGELR VV
Subjt:  LQAVITKPILSQHRKFALLTGELRPVV

SwissProt top hitse value%identityAlignment
F4KI56 Metal-independent phosphoserine phosphatase3.3e-0427.06Show/hide
Query:  ILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQCKMMEDLRERYFG
        ++RHG++  N  G I   IE+       L   G+ QA    E+  KE +        V +  S   R   TA + A         P+   + DL+ER+ G
Subjt:  ILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQCKMMEDLRERYFG

Query:  PSFELL---SHDKYAEIWA--LDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQ
            L      +K  E ++     ++  + P GGES + +A R   A+ QI  + +G  ++VV+HG  L+
Subjt:  PSFELL---SHDKYAEIWA--LDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQ

Arabidopsis top hitse value%identityAlignment
AT4G38370.1 Phosphoglycerate mutase family protein3.2e-8768.64Show/hide
Query:  SNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQCKMMEDLR
        +NRYW+LRHGKSIPNE+GL+VSS+ENG LPEYQLA +GV QA+LAGE FL++LKE++I L+ V+ICYSPFSRT HTARV A  +NLPF+ PQCKMMEDLR
Subjt:  SNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQCKMMEDLR

Query:  ERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSRLQAVITK
        ERYFGP+FEL SHDKY EIWALDE+DPF  PEGGES +DV SRLA A+  +E+++Q CAILVVSHGDPLQ+ Q V  +AKQ+ G   D L+ + Q     
Subjt:  ERYFGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSRLQAVITK

Query:  PILSQHRKFALLTGELRPVV
         +LSQHRKFALLTGELRP++
Subjt:  PILSQHRKFALLTGELRPVV

AT5G04120.1 Phosphoglycerate mutase family protein2.4e-0527.06Show/hide
Query:  ILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQCKMMEDLRERYFG
        ++RHG++  N  G I   IE+       L   G+ QA    E+  KE +        V +  S   R   TA + A         P+   + DL+ER+ G
Subjt:  ILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQCKMMEDLRERYFG

Query:  PSFELL---SHDKYAEIWA--LDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQ
            L      +K  E ++     ++  + P GGES + +A R   A+ QI  + +G  ++VV+HG  L+
Subjt:  PSFELL---SHDKYAEIWA--LDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACGGCGTCGTTTTTAAGCAACAGATACTGGATTCTCAGGCATGGCAAGAGCATTCCTAATGAGAAGGGCCTCATTGTTTCTTCTATAGAAAATGGTACCCTCCC
TGAGTATCAACTGGCCTCTGAGGGTGTTGGACAGGCGCAGTTGGCTGGAGAACAGTTCTTAAAGGAGTTGAAGGAAAATTCCATACCACTCGAAAATGTTCAGATTTGCT
ATTCACCATTCTCAAGAACTATCCATACAGCTAGAGTGGCTGCATCTGCAATGAATCTTCCATTTGAAGGCCCTCAGTGTAAGATGATGGAAGATCTCAGGGAACGCTAC
TTTGGTCCTTCATTTGAGCTCTTGTCTCATGATAAATATGCAGAAATCTGGGCTCTTGATGAGGAAGATCCATTCAAGCGGCCTGAAGGTGGAGAAAGCGTTGAAGATGT
TGCTTCAAGACTTGCCAAAGCAATTCTTCAAATAGAGTCGCAATTTCAAGGGTGTGCCATCTTGGTGGTCAGCCATGGGGACCCCCTGCAAATTTTTCAGACAGTGGTCG
GAGCAGCCAAGCAAGAAAATGGATCAAATTCAGATGAATTGTCATCAAGATTGCAAGCTGTCATCACCAAGCCCATTCTCTCGCAGCACCGGAAATTTGCCCTGCTCACC
GGAGAGCTTCGACCTGTCGTTTGA
mRNA sequenceShow/hide mRNA sequence
TGGATGAATGGGACTGTTTGGGTGATCGAATTTTTTTTGAACACCCCTATTCAAAAGCTTGTACAACAAAATCCCAATAATGTATGTACATAAAATTCACAGAAGCCATT
GAAAAATACTGGTGTAAACGACTGAAGAGCTTCAGGGAGGGGAGTTTGAAGAATGGCAACGGCGTCGTTTTTAAGCAACAGATACTGGATTCTCAGGCATGGCAAGAGCA
TTCCTAATGAGAAGGGCCTCATTGTTTCTTCTATAGAAAATGGTACCCTCCCTGAGTATCAACTGGCCTCTGAGGGTGTTGGACAGGCGCAGTTGGCTGGAGAACAGTTC
TTAAAGGAGTTGAAGGAAAATTCCATACCACTCGAAAATGTTCAGATTTGCTATTCACCATTCTCAAGAACTATCCATACAGCTAGAGTGGCTGCATCTGCAATGAATCT
TCCATTTGAAGGCCCTCAGTGTAAGATGATGGAAGATCTCAGGGAACGCTACTTTGGTCCTTCATTTGAGCTCTTGTCTCATGATAAATATGCAGAAATCTGGGCTCTTG
ATGAGGAAGATCCATTCAAGCGGCCTGAAGGTGGAGAAAGCGTTGAAGATGTTGCTTCAAGACTTGCCAAAGCAATTCTTCAAATAGAGTCGCAATTTCAAGGGTGTGCC
ATCTTGGTGGTCAGCCATGGGGACCCCCTGCAAATTTTTCAGACAGTGGTCGGAGCAGCCAAGCAAGAAAATGGATCAAATTCAGATGAATTGTCATCAAGATTGCAAGC
TGTCATCACCAAGCCCATTCTCTCGCAGCACCGGAAATTTGCCCTGCTCACCGGAGAGCTTCGACCTGTCGTTTGAATTCCGGTATGTTCTCGCCGGACTTATTGAAGAA
GATGAACTAAAAAATCGTTTGTTCTTCTTCTTCTTTTTACTTCGGAAATAATTATTATTTTTCTTTTATATACATATATAAAAGAATGATGATATAATAATTAATATATG
CTAACTATAATCTTAGGGGTTGTTTGGTCCCCTGTCTTGGAATAAACCCCTGGAGGTTGTTCTTGTTCTAAAACC
Protein sequenceShow/hide protein sequence
MATASFLSNRYWILRHGKSIPNEKGLIVSSIENGTLPEYQLASEGVGQAQLAGEQFLKELKENSIPLENVQICYSPFSRTIHTARVAASAMNLPFEGPQCKMMEDLRERY
FGPSFELLSHDKYAEIWALDEEDPFKRPEGGESVEDVASRLAKAILQIESQFQGCAILVVSHGDPLQIFQTVVGAAKQENGSNSDELSSRLQAVITKPILSQHRKFALLT
GELRPVV