; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009337 (gene) of Snake gourd v1 genome

Gene IDTan0009337
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-directed RNA polymerase III subunit RPC7-like isoform X1
Genome locationLG06:1904817..1907823
RNA-Seq ExpressionTan0009337
SyntenyTan0009337
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005666 - RNA polymerase III complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR024661 - DNA-directed RNA polymerase III, subunit Rpc31


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575499.1 hypothetical protein SDJN03_26138, partial [Cucurbita argyrosperma subsp. sororia]6.3e-9289.32Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ
        MAFRGRGRGR GGGG+FQYAKQEPFELFPENVTLP+VSD+PE KGLVICNS+LLNYWKASPFYLEEN+MKKMQ+TE+E+FSDR+KSNSTLKRDSLAQILQ
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ

Query:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG
        LT+RNFPEELVEGFKGKLR+KRKVQWNPESGL KLDF EKREESLKGQ+KD KEKKEGE  EDEDEE+DDAQSEELTDDDYYQNEYFDDDEDDYNME++G
Subjt:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG

Query:  GDEPEY
        GDEPEY
Subjt:  GDEPEY

XP_022154143.1 ribosomal L1 domain-containing protein CG13096-like [Momordica charantia]1.9e-9693.24Show/hide
Query:  MAFR-GRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQIL
        MAFR GRGRGRGGGGG FQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNS+LLNYWKASPF+LEEN++KKMQRTE+EKFSDRSK NSTLKRDSLAQIL
Subjt:  MAFR-GRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQIL

Query:  QLTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDD
        QLT+RNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDF EKREESLKGQDKD KEKKEGEEGEDE++EEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDD
Subjt:  QLTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDD

Query:  GGDEPEY
        GGDEP Y
Subjt:  GGDEPEY

XP_022953194.1 glutamic acid-rich protein-like [Cucurbita moschata]6.3e-9289.32Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ
        MAFRGRGRGR GGGG+FQYAKQEPFELFPENVTLP+VSD+PE KGLVICNS+LLNYWKASPFYLEEN+MKKMQ+TE+E+FSDR+KSNSTLKRDSLAQILQ
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ

Query:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG
        LT+RNFPEELVEGFKGKLR+KRKVQWNPESGL KLDF EKREESLKGQ+KD KEKKEGE  EDEDEE+DDAQSEELTDDDYYQNEYFDDDEDDYNME++G
Subjt:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG

Query:  GDEPEY
        GDEPEY
Subjt:  GDEPEY

XP_023548752.1 DNA-directed RNA polymerase III subunit rpc31-like [Cucurbita pepo subsp. pepo]1.3e-9290.29Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ
        MAFRGRGRGR GGGG+FQYAKQEPFELFPENVTLP+VSD+PE KGLVICNS+LLNYWKASPFYLEEN+MKKMQRTE+E+FSDR+KSNSTLKRDSLAQILQ
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ

Query:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG
        LT+RNFPEELVEGFKGKLR+KRKVQWNPESGL KLDF EKREESLKGQ+KD KEKKEGE  EDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNME++G
Subjt:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG

Query:  GDEPEY
        GDEPEY
Subjt:  GDEPEY

XP_038898864.1 glutamic acid-rich protein-like isoform X2 [Benincasa hispida]6.1e-9592.23Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ
        MAFRGRGRGRGGGGG FQYAKQEPFELFPENVTLP VSD+PEEK L I N+K LNYWKASPFYLEEN+MKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ

Query:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG
        LT+RNFPEELV+GFKGKLR KRKVQWNPESGLQKLDF EKREESLKGQDKD KEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNME++G
Subjt:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG

Query:  GDEPEY
        GDEPEY
Subjt:  GDEPEY

TrEMBL top hitse value%identityAlignment
A0A1S3CGV4 DNA-directed RNA polymerase III subunit3.2e-8986.41Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ
        MAFRGRGRGRGGGGG+FQYAKQEPFELFPENVTLPSVS++PEE  L +     L YWKASPFYLEEN+MKKMQRTE+EKFSDR K NSTLKRDSLAQI+Q
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ

Query:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG
        LT+RNFPEELVEGFKGKLR KRKVQWNPESGL+K+DF EKREESLKGQDK+ KEKKEGEEGEDEDEEE+DAQSEELTDDDYYQNEYFDDDEDDYNME++G
Subjt:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG

Query:  GDEPEY
        GDEPEY
Subjt:  GDEPEY

A0A5D3BVU1 DNA-directed RNA polymerase III subunit RPC7-like isoform X17.3e-8679.46Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ
        MAFRGRGRGRGGGGG+FQYAKQEPFELFPENVTLPSVS++PEE  L +     L YWKASPFYLEEN+MKKMQRTE+EKFSDR K NSTLKRDSLAQI+Q
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ

Query:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG
        LT+RNFPEELVEGFKGKLR KRKVQWNPESGL+K+DF EKREESLKGQDK+ KEKKEGEEGEDEDEEE+DAQSEELTDDDYYQNEYFDDDEDDYNME++G
Subjt:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG

Query:  G------------------DEPEY
        G                  DEPEY
Subjt:  G------------------DEPEY

A0A6J1DMV9 ribosomal L1 domain-containing protein CG13096-like9.2e-9793.24Show/hide
Query:  MAFR-GRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQIL
        MAFR GRGRGRGGGGG FQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNS+LLNYWKASPF+LEEN++KKMQRTE+EKFSDRSK NSTLKRDSLAQIL
Subjt:  MAFR-GRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQIL

Query:  QLTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDD
        QLT+RNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDF EKREESLKGQDKD KEKKEGEEGEDE++EEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDD
Subjt:  QLTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDD

Query:  GGDEPEY
        GGDEP Y
Subjt:  GGDEPEY

A0A6J1GNY2 glutamic acid-rich protein-like3.1e-9289.32Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ
        MAFRGRGRGR GGGG+FQYAKQEPFELFPENVTLP+VSD+PE KGLVICNS+LLNYWKASPFYLEEN+MKKMQ+TE+E+FSDR+KSNSTLKRDSLAQILQ
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ

Query:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG
        LT+RNFPEELVEGFKGKLR+KRKVQWNPESGL KLDF EKREESLKGQ+KD KEKKEGE  EDEDEE+DDAQSEELTDDDYYQNEYFDDDEDDYNME++G
Subjt:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG

Query:  GDEPEY
        GDEPEY
Subjt:  GDEPEY

A0A6J1JVL5 DNA-directed RNA polymerase III subunit rpc31-like6.8e-9289.32Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ
        MAFRGRGRGR GGGG+FQYAKQEPFELFPENVTLP+VSD+PE KGLVICNS+LLNYWKASPFYLEEN+MKKMQ+ E+E+FSDR+KSNSTLKRDSLAQILQ
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQ

Query:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG
        LT+RNFPEELVEGFKGKLR+KRKVQWNPESGL KLDF EKREESLKGQ+KD KEKKEGE  EDEDEEED+AQSEELTDDDYYQNEYFDDDEDDYNMED+G
Subjt:  LTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDG

Query:  GDEPEY
        GDEPEY
Subjt:  GDEPEY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01590.1 unknown protein5.3e-2039.52Show/hide
Query:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQR--TEVEKFSDRSKSN-STLKRDSLA
        M+++G RG+ +G GG    Y K EPF +FPE +TLP    +  +  LV        +W+ SP++L +  + K ++    +E++SD  K    + K  S  
Subjt:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQR--TEVEKFSDRSKSN-STLKRDSLA

Query:  QILQLTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
          L L   NFP+EL+   + + R  ++ +W+ E+ LQKLD FEK E   K    +GKE+K  EEGED DEE  +++ EE  + DY QN+ FDDD+DDYN 
Subjt:  QILQLTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDDGGDEPEY
        EDDG  E  Y
Subjt:  EDDGGDEPEY

AT4G01590.2 unknown protein2.4e-2039.61Show/hide
Query:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQR--TEVEKFSDRSKSN-STLKRDSLA
        M+++G RG+ +G GG    Y K EPF +FPE +TLP    +  +  LV        +W+ SP++L +  + K ++    +E++SD  K    + K  S  
Subjt:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQR--TEVEKFSDRSKSN-STLKRDSLA

Query:  QILQLTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
          L L   NFP+EL+   + + R  ++ +W+ E+ LQKLD FEK E   K    +GKE+K  EEGED DEE  +++ EE  + DY QN+ FDDD+DDYN 
Subjt:  QILQLTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDDGGDE
        EDDG +E
Subjt:  EDDGGDE

AT4G01590.3 unknown protein5.3e-2039.52Show/hide
Query:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQR--TEVEKFSDRSKSN-STLKRDSLA
        M+++G RG+ +G GG    Y K EPF +FPE +TLP    +  +  LV        +W+ SP++L +  + K ++    +E++SD  K    + K  S  
Subjt:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQR--TEVEKFSDRSKSN-STLKRDSLA

Query:  QILQLTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
          L L   NFP+EL+   + + R  ++ +W+ E+ LQKLD FEK E   K    +GKE+K  EEGED DEE  +++ EE  + DY QN+ FDDD+DDYN 
Subjt:  QILQLTTRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDDGGDEPEY
        EDDG  E  Y
Subjt:  EDDGGDEPEY

AT4G35680.1 Arabidopsis protein of unknown function (DUF241)1.3e-1537.25Show/hide
Query:  GRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLL--NYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQLT
        GRG+ +G GG    Y K EPF +FPE +TLP    +  +  LV+  S      +W  SP++L +  + K           + K++  ++R          
Subjt:  GRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLL--NYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQLT

Query:  TRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDGGD
          NF +ELV   + + R  ++ +W+ E+ LQKLD FEK E   K Q   G E+K  E+GED DE+  +++ EE  + DY QN+ FDDDEDDYN E+DGG 
Subjt:  TRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDGGD

Query:  EPEY
        E  Y
Subjt:  EPEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTTAGAGGACGAGGGCGAGGACGAGGTGGCGGTGGAGGGACCTTTCAGTATGCCAAGCAAGAACCCTTCGAGCTTTTCCCGGAGAATGTAACCCTACCCAGCGT
CAGTGATGTGCCTGAAGAAAAAGGCTTGGTTATTTGTAACTCCAAGTTGCTGAATTATTGGAAGGCCTCTCCTTTTTATCTAGAGGAGAATATCATGAAAAAGATGCAAA
GAACTGAGGTAGAGAAATTTTCCGATAGATCCAAGTCGAATAGTACATTGAAGCGTGATTCCCTTGCACAAATTCTACAGCTCACAACAAGGAACTTTCCTGAAGAATTG
GTTGAAGGTTTCAAAGGGAAGTTGAGGAACAAACGAAAAGTTCAATGGAATCCTGAGTCAGGGCTGCAAAAATTGGACTTTTTCGAGAAGCGTGAAGAATCTCTCAAGGG
ACAGGATAAGGATGGTAAGGAGAAGAAAGAAGGAGAAGAAGGTGAAGACGAGGATGAAGAAGAAGATGATGCACAGTCTGAGGAACTTACCGATGATGATTATTATCAGA
ACGAATACTTCGACGATGATGAAGATGATTACAACATGGAAGATGATGGAGGAGATGAACCAGAATATTAG
mRNA sequenceShow/hide mRNA sequence
GTGAGAATCTCGTCTCGTTCGCCGTCTTCGACTCTTCCTCTCCTCTCTCTCGTATCATCGATTCGTTCTGCAGACTCCTCTCTCTCGTTCGCCGCTCGTGAGTGAGATCC
AGATCGCCGTCGACTCTTCCTCTCCCTCGTATCATAGTATCGATTCCCTCGGCTACTCGTGCTCTGTCGACTTGACTCCCAGTGTCACCAACTCACCGCCGTTGATCCGT
AGTCGAAGTTCTTTTCCATTAGATGATGAAAACAACTGTCTAGAATATAGATTTAAGGGGGATGGCATTTAGAGGACGAGGGCGAGGACGAGGTGGCGGTGGAGGGACCT
TTCAGTATGCCAAGCAAGAACCCTTCGAGCTTTTCCCGGAGAATGTAACCCTACCCAGCGTCAGTGATGTGCCTGAAGAAAAAGGCTTGGTTATTTGTAACTCCAAGTTG
CTGAATTATTGGAAGGCCTCTCCTTTTTATCTAGAGGAGAATATCATGAAAAAGATGCAAAGAACTGAGGTAGAGAAATTTTCCGATAGATCCAAGTCGAATAGTACATT
GAAGCGTGATTCCCTTGCACAAATTCTACAGCTCACAACAAGGAACTTTCCTGAAGAATTGGTTGAAGGTTTCAAAGGGAAGTTGAGGAACAAACGAAAAGTTCAATGGA
ATCCTGAGTCAGGGCTGCAAAAATTGGACTTTTTCGAGAAGCGTGAAGAATCTCTCAAGGGACAGGATAAGGATGGTAAGGAGAAGAAAGAAGGAGAAGAAGGTGAAGAC
GAGGATGAAGAAGAAGATGATGCACAGTCTGAGGAACTTACCGATGATGATTATTATCAGAACGAATACTTCGACGATGATGAAGATGATTACAACATGGAAGATGATGG
AGGAGATGAACCAGAATATTAGTCACTATGAAAGGCGGTGGTAAGATTGGTGTAGGGCATTGGATTGATTAGCCAGATTTAATTATTTCATAAGGTTTAAATTTCCTCTT
TCTTTATTTTTTTCCTTTTTTTTAAAAAAATCTGAGACTGTAAAGTATTCTTTTTTTTTTAATAATTAAATTTAAGTTCAAATC
Protein sequenceShow/hide protein sequence
MAFRGRGRGRGGGGGTFQYAKQEPFELFPENVTLPSVSDVPEEKGLVICNSKLLNYWKASPFYLEENIMKKMQRTEVEKFSDRSKSNSTLKRDSLAQILQLTTRNFPEEL
VEGFKGKLRNKRKVQWNPESGLQKLDFFEKREESLKGQDKDGKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDDGGDEPEY