; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010695 (gene) of Snake gourd v1 genome

Gene IDTan0010695
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG09:68455728..68456862
RNA-Seq ExpressionTan0010695
SyntenyTan0010695
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589708.1 hypothetical protein SDJN03_15131, partial [Cucurbita argyrosperma subsp. sororia]3.0e-11885.77Show/hide
Query:  MDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDHSP
        MDDTSES PLTSQH       +D+DE+EAEE+LSFSDLPLDK+KSD  T ESFRKNPRRSSSEPLDLFEFF+ GFITSEISPAEDLIFCGRLLPLNDHS 
Subjt:  MDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDHSP

Query:  LTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPRWY
        LT+ T  ++EK+FWKDESRKQT FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQ NSIFSP AEIDRNCSIK+GLKPDP N+KASSKPRWY
Subjt:  LTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPRWY

Query:  LLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA
        LLMFGMVKFPAEM+LSDIKSRQVRRSSSALFPANESKGK+ C NRSSGEA WRILRALSCKN+ASVDVTASLTA
Subjt:  LLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA

KAG7023388.1 hypothetical protein SDJN02_14413 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-11985.87Show/hide
Query:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH
        MFMDDTSES PLTSQH       +D+DE+EAEE+LSFSDLPLDK+KSD  T ESFRKNPRRSSSEPLDLFEFF+ GFITSEISPAEDLIFCGRLLPLNDH
Subjt:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH

Query:  SPLTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPR
        S LT+ T  ++EK+FWKDESRKQT FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQ NSIFSP AEIDRNCSIK+GLKPDP N+KASSKPR
Subjt:  SPLTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPR

Query:  WYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA
        WYLLMFGMVKFPAEM+LSDIKSRQVRRSSSALFPANESKGK+ C NRSSGEA WRILRALSCKN+ASVDVTASLTA
Subjt:  WYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA

XP_022921809.1 uncharacterized protein LOC111429953 [Cucurbita moschata]2.7e-11985.87Show/hide
Query:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH
        MFMDDTSES PLTSQH       +D DE+EAEE+LSFSDLPLDK+KSD  T ESFRKNPRRSSSEPLDLFEFF+ GFITSEISPAEDLIFCGRLLPLNDH
Subjt:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH

Query:  SPLTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPR
        S LT+ T  ++EK+FWKDESRKQT FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQ NSIFSP AEIDRNCSIK+GLKPDP N+KASSKPR
Subjt:  SPLTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPR

Query:  WYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA
        WYLLMFGMVKFPAEM+LSDIKSRQVRRSSSALFPANESKGK+ C NRSSGEA WRILRALSCKN+ASVDVTASLTA
Subjt:  WYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA

XP_022987522.1 uncharacterized protein LOC111485063 [Cucurbita maxima]9.6e-11784.78Show/hide
Query:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH
        MFMDDTSES PLTSQH       +D  E+EAEE+LSFSDLPLDK+KSD  T ESFRKNPRRSSSEPLDLFEFF+ GFITSEISPAEDLIFCGRLLPLND 
Subjt:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH

Query:  SPLTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPR
        S LT+ T  +++K+FWKDESRKQT FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQANSIFSP AEIDRNCSIK+GLKPDP N+KASSKPR
Subjt:  SPLTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPR

Query:  WYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA
        WYLLMFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+ESKGK+ C NRSSGEA WRILRALSCKN+ASVDVTASLTA
Subjt:  WYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA

XP_023516073.1 uncharacterized protein LOC111780044 [Cucurbita pepo subsp. pepo]8.7e-11884.84Show/hide
Query:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH
        MFMDDTSES PLTSQH       +D  E+EAEE+LSFSDLPLDK+KSD  T ESFRKNPRRSSSEPLDLFEFF+ GFITSEISPAEDLIFCGRLLPLNDH
Subjt:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH

Query:  SPLTSRT---SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKP
        S LT+ T   ++EK+FWKDESRKQ  FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQ NSIFSP AEIDRNCSIK+GLKPDP N+KASSKP
Subjt:  SPLTSRT---SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKP

Query:  RWYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA
        RWYLLMFGMVKFPAEM+LSDIKSRQVRRSSSALFPANESKGK+ C NRSSGEA WRILRALSCKN+ASVDVTASLTA
Subjt:  RWYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA

TrEMBL top hitse value%identityAlignment
A0A1S3BW36 uncharacterized protein LOC1034942657.7e-11283.27Show/hide
Query:  MDDTSESNPLTS---QHDDDDEDEAEESLSFSDLPLDKEKSDDQT-LESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDHSPLTS
        MDDTSESNPLTS   QH+DDD+ E++ESLSFSDLP+D+E SD  T  +SFRKNPRRSSSEPLDLFEFF+ GFITSEISPAEDLIFCGRLLPLNDHS   +
Subjt:  MDDTSESNPLTS---QHDDDDEDEAEESLSFSDLPLDKEKSDDQT-LESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDHSPLTS

Query:  RTSSEKNFWKDE-SRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPRWYLLMF
        R +++K+FWK+E SRKQT FRKRSESLSGLQSSVSRSNSAK NLKRNSRSLDYR+LYRQANSIFSP AEIDRNCSIK+GLKPD  N+K SSKPRWYLLMF
Subjt:  RTSSEKNFWKDE-SRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPRWYLLMF

Query:  GMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQCNRSSGEAAWRILRALSCKNHASVDVTASLTA
        GMVKFPAEMELSDIKSRQVRRSSS LFP+NE+K KF C RSSGEA WRILRALSCKNHASVDVTASLTA
Subjt:  GMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQCNRSSGEAAWRILRALSCKNHASVDVTASLTA

A0A5A7UVS6 Uncharacterized protein7.7e-11283.27Show/hide
Query:  MDDTSESNPLTS---QHDDDDEDEAEESLSFSDLPLDKEKSDDQT-LESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDHSPLTS
        MDDTSESNPLTS   QH+DDD+ E++ESLSFSDLP+D+E SD  T  +SFRKNPRRSSSEPLDLFEFF+ GFITSEISPAEDLIFCGRLLPLNDHS   +
Subjt:  MDDTSESNPLTS---QHDDDDEDEAEESLSFSDLPLDKEKSDDQT-LESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDHSPLTS

Query:  RTSSEKNFWKDE-SRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPRWYLLMF
        R +++K+FWK+E SRKQT FRKRSESLSGLQSSVSRSNSAK NLKRNSRSLDYR+LYRQANSIFSP AEIDRNCSIK+GLKPD  N+K SSKPRWYLLMF
Subjt:  RTSSEKNFWKDE-SRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPRWYLLMF

Query:  GMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQCNRSSGEAAWRILRALSCKNHASVDVTASLTA
        GMVKFPAEMELSDIKSRQVRRSSS LFP+NE+K KF C RSSGEA WRILRALSCKNHASVDVTASLTA
Subjt:  GMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQCNRSSGEAAWRILRALSCKNHASVDVTASLTA

A0A5D3CL04 Uncharacterized protein4.2e-11082.9Show/hide
Query:  MDDTSESNPLTS---QHDDDDEDEAEESLSFSDLPLDKEKSDDQT-LESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDHSPLTS
        MDDTSESNPLTS   QH+DDD+ E++ESLSFSDLP+D+E SD  T  +SFRKNPRRSSSEPLDLFEFF+ GFITSEISPAEDLIFCGRLLPLNDHS   +
Subjt:  MDDTSESNPLTS---QHDDDDEDEAEESLSFSDLPLDKEKSDDQT-LESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDHSPLTS

Query:  RTSSEKNFWKDE-SRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPRWYLLMF
        R +++K+FWK+E SRKQT FRKRSESLSGLQSSVSRSNSAK NLKRNSRSLDYR+LYRQANSIFSP AEIDRNCSIK+GLKPD  N+K +SKPRWYLLMF
Subjt:  RTSSEKNFWKDE-SRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPRWYLLMF

Query:  GMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQCNRSSGEAAWRILRALSCKNHASVDVTASLTA
        GMVKFPAEMELSDIKSRQVRRSSS LFP+NE+K KF C RSSGEA WRILRALSCKNHASVDVTASLTA
Subjt:  GMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQCNRSSGEAAWRILRALSCKNHASVDVTASLTA

A0A6J1E1L2 uncharacterized protein LOC1114299531.3e-11985.87Show/hide
Query:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH
        MFMDDTSES PLTSQH       +D DE+EAEE+LSFSDLPLDK+KSD  T ESFRKNPRRSSSEPLDLFEFF+ GFITSEISPAEDLIFCGRLLPLNDH
Subjt:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH

Query:  SPLTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPR
        S LT+ T  ++EK+FWKDESRKQT FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQ NSIFSP AEIDRNCSIK+GLKPDP N+KASSKPR
Subjt:  SPLTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPR

Query:  WYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA
        WYLLMFGMVKFPAEM+LSDIKSRQVRRSSSALFPANESKGK+ C NRSSGEA WRILRALSCKN+ASVDVTASLTA
Subjt:  WYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA

A0A6J1JAL3 uncharacterized protein LOC1114850634.6e-11784.78Show/hide
Query:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH
        MFMDDTSES PLTSQH       +D  E+EAEE+LSFSDLPLDK+KSD  T ESFRKNPRRSSSEPLDLFEFF+ GFITSEISPAEDLIFCGRLLPLND 
Subjt:  MFMDDTSESNPLTSQH-------DDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDH

Query:  SPLTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPR
        S LT+ T  +++K+FWKDESRKQT FRKRSESLSGLQSSVSRSN+AKINLKRNSRSLDYRKLYRQANSIFSP AEIDRNCSIK+GLKPDP N+KASSKPR
Subjt:  SPLTSRT--SSEKNFWKDESRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPR

Query:  WYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA
        WYLLMFGMVKFPAEM+LSDIKSRQVRRSSSALFPA+ESKGK+ C NRSSGEA WRILRALSCKN+ASVDVTASLTA
Subjt:  WYLLMFGMVKFPAEMELSDIKSRQVRRSSSALFPANESKGKFQC-NRSSGEAAWRILRALSCKNHASVDVTASLTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30230.1 unknown protein4.7e-2938.1Show/hide
Query:  EDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDHSPLTSRTSSEKNFWKDESRKQTGFRKR
        E+E E++LS  DLPL K K+ + T     K P        +LFEF T+   + +++PAE++IF G+L+PLN           +  F+          R R
Subjt:  EDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDHSPLTSRTSSEKNFWKDESRKQTGFRKR

Query:  SESLSGLQS-SVSRSNSAKINLKRN------SRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKA--SSKPRWYLLMFGMVKFPAEMELSDI
        SESLS +Q   ++R  S  +  + N      SRSLDYRKL R   ++ SP    + + S K+  KP+  +  +  S +PRWY++MFGMVKFP E+EL DI
Subjt:  SESLSGLQS-SVSRSNSAKINLKRN------SRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKA--SSKPRWYLLMFGMVKFPAEMELSDI

Query:  KSRQVRRS-SSALFPANESKGKFQCNRSSGEAAWRILRALSCKNHASVDVTA
        KSRQ+RR+    +FP+  ++        S   +WR L ALSCK   SV  TA
Subjt:  KSRQVRRS-SSALFPANESKGKFQCNRSSGEAAWRILRALSCKNHASVDVTA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCATGGACGACACCAGCGAATCCAATCCTCTAACCTCACAACACGACGACGATGACGAAGACGAAGCAGAGGAATCCCTCTCCTTCTCCGATCTTCCACTCGATAA
GGAAAAATCTGACGACCAAACTTTGGAAAGCTTCCGCAAGAATCCACGTAGATCCTCCTCCGAGCCTCTCGATCTCTTCGAATTCTTCACCACTGGATTCATCACCTCTG
AAATATCTCCGGCTGAGGATTTGATCTTCTGCGGCAGATTGCTTCCGCTCAACGATCACTCTCCGCTTACTTCGCGTACCAGCTCGGAGAAGAATTTCTGGAAGGATGAG
AGCCGAAAGCAGACTGGATTTCGGAAACGCTCTGAGTCATTGTCTGGATTGCAGAGCTCTGTTTCTCGATCGAACAGTGCAAAGATCAATCTCAAGCGAAACAGCCGATC
GCTCGATTACCGCAAGCTCTATCGTCAAGCGAATTCGATTTTCTCGCCGGCGGCCGAAATCGATCGTAATTGTTCGATCAAGAGCGGATTGAAGCCTGATCCGCAGAACA
GAAAGGCGTCGTCGAAGCCGCGGTGGTACTTGCTAATGTTCGGAATGGTGAAGTTCCCGGCGGAGATGGAACTCAGCGACATTAAGAGCAGACAAGTCCGCCGCAGTTCG
TCGGCACTTTTTCCGGCGAATGAGAGTAAAGGTAAATTCCAGTGCAACCGGAGCTCCGGCGAGGCGGCTTGGAGGATCCTTCGGGCGCTCAGCTGCAAGAACCACGCTAG
TGTAGATGTAACGGCGTCGTTAACTGCCTGA
mRNA sequenceShow/hide mRNA sequence
CGTCGTCACAAAATTAAACTTGGCATAATTCCAAACACAGCCTTTTCCTTATCTTCCTTCCACAGAATGTTCATGGACGACACCAGCGAATCCAATCCTCTAACCTCACA
ACACGACGACGATGACGAAGACGAAGCAGAGGAATCCCTCTCCTTCTCCGATCTTCCACTCGATAAGGAAAAATCTGACGACCAAACTTTGGAAAGCTTCCGCAAGAATC
CACGTAGATCCTCCTCCGAGCCTCTCGATCTCTTCGAATTCTTCACCACTGGATTCATCACCTCTGAAATATCTCCGGCTGAGGATTTGATCTTCTGCGGCAGATTGCTT
CCGCTCAACGATCACTCTCCGCTTACTTCGCGTACCAGCTCGGAGAAGAATTTCTGGAAGGATGAGAGCCGAAAGCAGACTGGATTTCGGAAACGCTCTGAGTCATTGTC
TGGATTGCAGAGCTCTGTTTCTCGATCGAACAGTGCAAAGATCAATCTCAAGCGAAACAGCCGATCGCTCGATTACCGCAAGCTCTATCGTCAAGCGAATTCGATTTTCT
CGCCGGCGGCCGAAATCGATCGTAATTGTTCGATCAAGAGCGGATTGAAGCCTGATCCGCAGAACAGAAAGGCGTCGTCGAAGCCGCGGTGGTACTTGCTAATGTTCGGA
ATGGTGAAGTTCCCGGCGGAGATGGAACTCAGCGACATTAAGAGCAGACAAGTCCGCCGCAGTTCGTCGGCACTTTTTCCGGCGAATGAGAGTAAAGGTAAATTCCAGTG
CAACCGGAGCTCCGGCGAGGCGGCTTGGAGGATCCTTCGGGCGCTCAGCTGCAAGAACCACGCTAGTGTAGATGTAACGGCGTCGTTAACTGCCTGAATTGTGTATCACG
TGCATCGCACGTGACTGGCGGTTGCATTTGGTACTATATTGCCCTCCCGATGCTGCGAAATTCCAGTACCGCGAAAAGGTCAGTTTGGGAATTTAAAAATTAAAATGAAA
AATATATATATATAATTATGCTTCACGTCGAAGGGGAGGGTAGTGTACAGTGGCGAAGCGTGGCTGATGATGGTGCGATCTCTTTAAAATCTTTTTTATTTTAATTTTTA
CTTTTTATTGTTTTGAAAGAATGGTTTGGTTCGTT
Protein sequenceShow/hide protein sequence
MFMDDTSESNPLTSQHDDDDEDEAEESLSFSDLPLDKEKSDDQTLESFRKNPRRSSSEPLDLFEFFTTGFITSEISPAEDLIFCGRLLPLNDHSPLTSRTSSEKNFWKDE
SRKQTGFRKRSESLSGLQSSVSRSNSAKINLKRNSRSLDYRKLYRQANSIFSPAAEIDRNCSIKSGLKPDPQNRKASSKPRWYLLMFGMVKFPAEMELSDIKSRQVRRSS
SALFPANESKGKFQCNRSSGEAAWRILRALSCKNHASVDVTASLTA