; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005656 (gene) of Snake gourd v1 genome

Gene IDTan0005656
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionWIYLD domain-containing protein
Genome locationLG11:2114530..2117623
RNA-Seq ExpressionTan0005656
SyntenyTan0005656
Gene Ontology termsGO:0034968 - histone lysine methylation (biological process)
GO:0016020 - membrane (cellular component)
GO:0018024 - histone-lysine N-methyltransferase activity (molecular function)
InterPro domainsIPR018848 - WIYLD domain
IPR043017 - WIYLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033305.1 hypothetical protein SDJN02_07360 [Cucurbita argyrosperma subsp. argyrosperma]8.9e-8776.62Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVRDTVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKVHEE+GR AD QETS AGCSS+   
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE

Query:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT
                    VK +E +ISSYVDNE FR T ++P NDSDE+YWK +DI SG  D+HF  SS+NQSL+ AH PKIRRRKPYHGWISS  DDREDLV LT
Subjt:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT

Query:  PAQLPEEFAKLFIPHAQRKRKKRWDVKPSES
        PAQLPEEFA+L IPHAQRKRK RWDVKPSES
Subjt:  PAQLPEEFAKLFIPHAQRKRKKRWDVKPSES

XP_022953876.1 uncharacterized protein LOC111456280 isoform X1 [Cucurbita moschata]6.8e-9580.95Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVRDTVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKVHEE+GR AD QETS AGCSS+V+ 
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE

Query:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT
        EAS+S PGAE TVK +E +ISSYVDNE FR T ++P NDSDE+YWK +DI SG  D+HF  SS+NQSL+ AH PKIRRRKPYHGWISS  DDREDLV LT
Subjt:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT

Query:  PAQLPEEFAKLFIPHAQRKRKKRWDVKPSES
        PAQLPEEFA+L IPHAQRKRK RWDVKPSES
Subjt:  PAQLPEEFAKLFIPHAQRKRKKRWDVKPSES

XP_022990775.1 uncharacterized protein LOC111487557 [Cucurbita maxima]2.4e-8480.95Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE
        MAPR RSKKRGNLRIDAALDAMNPFGF PKLVRDTVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKVHEE+GR AD QETS AGCSS+V+ 
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE

Query:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT
        EAS+S PGAE TVK +E +ISSYVDNE FR T ++P NDSDE+YWK +DI SG  D+HF  SS+NQSL+ AHIPKIRRRKPYHGWISS  DDREDLV LT
Subjt:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT

Query:  PAQLPEEFAK
        PAQLPEEFA+
Subjt:  PAQLPEEFAK

XP_023522527.1 uncharacterized protein LOC111786513 [Cucurbita pepo subsp. pepo]2.1e-8380.48Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVRDTVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKVHEE+GR AD QETS AGCSS+V+ 
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE

Query:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISSS-DDREDLVQLT
        EAS+S PGAE TVK +E +ISSYVDNE FR T ++P NDSDE+YWK +DI SG  D+HF  SS+NQSL+ AH PKIRRRKPYHGWISSS DDREDLV LT
Subjt:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISSS-DDREDLVQLT

Query:  PAQLPEEFAK
        PAQLPEEFA+
Subjt:  PAQLPEEFAK

XP_023531273.1 uncharacterized protein LOC111793562 isoform X1 [Cucurbita pepo subsp. pepo]2.3e-9581.39Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVRDTVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKVHEE+GR AD QETS AGCSS+V+ 
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE

Query:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISSS-DDREDLVQLT
        EAS+S PGAE TVK +E +ISSYVDNE FR T ++P NDSDE+YWK +DI SG  D+HF  SS+NQSL+ AH PKIRRRKPYHGWISSS DDREDLV LT
Subjt:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISSS-DDREDLVQLT

Query:  PAQLPEEFAKLFIPHAQRKRKKRWDVKPSES
        PAQLPEEFA+L IPHAQRKRK RWDVKPSES
Subjt:  PAQLPEEFAKLFIPHAQRKRKKRWDVKPSES

TrEMBL top hitse value%identityAlignment
A0A6J1BXH3 uncharacterized protein LOC111006514 isoform X27.4e-7167.54Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE
        MAPRVRS+KRGNLRIDAALDAM PFGF PKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDT+L+K KDG    VHEEN RA  H+ETS+AGCSS+   
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE

Query:  EASSSYPGAESTVKQ-SEVLISSYVDNEAFRTTTSLPGNDSDEKYWKDDIASGDDDHFGSSLNQSL-VAHIPKIRRRKPYHGWISSSDDREDLVQLTPAQ
              P  E TVK+  +VLIS Y DNEAFR TT L   DS+ +Y  DD   GDDDHF S  NQS   AH PKI RR PYHGWISS+D +EDLV L P  
Subjt:  EASSSYPGAESTVKQ-SEVLISSYVDNEAFRTTTSLPGNDSDEKYWKDDIASGDDDHFGSSLNQSL-VAHIPKIRRRKPYHGWISSSDDREDLVQLTPAQ

Query:  LPEEFAKLFIPHAQRKRKKRWDVKPSES
           EFA+L +   QRKRK+RWDVKP+ES
Subjt:  LPEEFAKLFIPHAQRKRKKRWDVKPSES

A0A6J1BY42 uncharacterized protein LOC111006514 isoform X16.1e-7367.98Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE
        MAPRVRS+KRGNLRIDAALDAM PFGF PKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDT+L+K KDG I +VHEEN RA  H+ETS+AGCSS+   
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE

Query:  EASSSYPGAESTVKQ-SEVLISSYVDNEAFRTTTSLPGNDSDEKYWKDDIASGDDDHFGSSLNQSL-VAHIPKIRRRKPYHGWISSSDDREDLVQLTPAQ
              P  E TVK+  +VLIS Y DNEAFR TT L   DS+ +Y  DD   GDDDHF S  NQS   AH PKI RR PYHGWISS+D +EDLV L P  
Subjt:  EASSSYPGAESTVKQ-SEVLISSYVDNEAFRTTTSLPGNDSDEKYWKDDIASGDDDHFGSSLNQSL-VAHIPKIRRRKPYHGWISSSDDREDLVQLTPAQ

Query:  LPEEFAKLFIPHAQRKRKKRWDVKPSES
           EFA+L +   QRKRK+RWDVKP+ES
Subjt:  LPEEFAKLFIPHAQRKRKKRWDVKPSES

A0A6J1GPA9 uncharacterized protein LOC111456280 isoform X13.3e-9580.95Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVRDTVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKVHEE+GR AD QETS AGCSS+V+ 
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE

Query:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT
        EAS+S PGAE TVK +E +ISSYVDNE FR T ++P NDSDE+YWK +DI SG  D+HF  SS+NQSL+ AH PKIRRRKPYHGWISS  DDREDLV LT
Subjt:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT

Query:  PAQLPEEFAKLFIPHAQRKRKKRWDVKPSES
        PAQLPEEFA+L IPHAQRKRK RWDVKPSES
Subjt:  PAQLPEEFAKLFIPHAQRKRKKRWDVKPSES

A0A6J1GQX1 uncharacterized protein LOC111456280 isoform X22.9e-8380Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE
        MAPR R KKRGNLRIDAALDAMNPFGF PKLVRDTVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKVHEE+GR AD QETS AGCSS+V+ 
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE

Query:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT
        EAS+S PGAE TVK +E +ISSYVDNE FR T ++P NDSDE+YWK +DI SG  D+HF  SS+NQSL+ AH PKIRRRKPYHGWISS  DDREDLV LT
Subjt:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT

Query:  PAQLPEEFAK
        PAQLPEEFA+
Subjt:  PAQLPEEFAK

A0A6J1JR10 uncharacterized protein LOC1114875571.2e-8480.95Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE
        MAPR RSKKRGNLRIDAALDAMNPFGF PKLVRDTVKELLSVYGGD+GWVFIEEGSYTLLIDTLLEK+KDGAIEKVHEE+GR AD QETS AGCSS+V+ 
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIE

Query:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT
        EAS+S PGAE TVK +E +ISSYVDNE FR T ++P NDSDE+YWK +DI SG  D+HF  SS+NQSL+ AHIPKIRRRKPYHGWISS  DDREDLV LT
Subjt:  EASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWK-DDIASG-DDDHF-GSSLNQSLV-AHIPKIRRRKPYHGWISS-SDDREDLVQLT

Query:  PAQLPEEFAK
        PAQLPEEFA+
Subjt:  PAQLPEEFAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45248.2 Nucleolar histone methyltransferase-related protein4.1e-0534.92Show/hide
Query:  KKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD
        K  G  R DAA D M  FGF   ++  ++K++L VY G+D W  IE+ +Y + +   L+ +++
Subjt:  KKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD

AT1G45248.3 Nucleolar histone methyltransferase-related protein4.1e-0534.92Show/hide
Query:  KKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD
        K  G  R DAA D M  FGF   ++  ++K++L VY G+D W  IE+ +Y + +   L+ +++
Subjt:  KKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD

AT2G40020.1 Nucleolar histone methyltransferase-related protein2.0e-0725.11Show/hide
Query:  LRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD----------GAIEKVHEENGRAADHQETSVAGCSSSVIEEA
        +R DAA D M  FGF   ++ +++KELL VY  +D W  IE+ SY  L+   LEKQ++            + + H E   A + Q   +A       +E 
Subjt:  LRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD----------GAIEKVHEENGRAADHQETSVAGCSSSVIEEA

Query:  SSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWKDDIASGDDDHFGSSLNQSLV-----------AHIPKIRRRKPYHGWISSSDDREDL
                   +  E+ I+S   N+       L    SD  Y +  +   ++ H G  +++                 P+  + K       S  D +++
Subjt:  SSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWKDDIASGDDDHFGSSLNQSLV-----------AHIPKIRRRKPYHGWISSSDDREDL

Query:  VQLTPAQLPEEFAKLFIPHAQRKRKKR
        +QLTP  L EE  +L      +KR+K+
Subjt:  VQLTPAQLPEEFAKLFIPHAQRKRKKR

AT2G40020.2 Nucleolar histone methyltransferase-related protein3.8e-1148.57Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD
        MAPR R KK G +R DAA D M  FGF   ++ +++KELL VY  +D W  IE+ SY  L+   LEKQ++
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD

AT2G40020.3 Nucleolar histone methyltransferase-related protein2.2e-1127.2Show/hide
Query:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD----------GAIEKVHEENGRAADHQETS
        MAPR R KK G +R DAA D M  FGF   ++ +++KELL VY  +D W  IE+ SY  L+   LEKQ++            + + H E   A + Q   
Subjt:  MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKD----------GAIEKVHEENGRAADHQETS

Query:  VAGCSSSVIEEASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWKDDIASGDDDHFGSSLNQSLV-----------AHIPKIRRRKPYH
        +A       +E            +  E+ I+S   N+       L    SD  Y +  +   ++ H G  +++                 P+  + K   
Subjt:  VAGCSSSVIEEASSSYPGAESTVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWKDDIASGDDDHFGSSLNQSLV-----------AHIPKIRRRKPYH

Query:  GWISSSDDREDLVQLTPAQLPEEFAKLFIPHAQRKRKKR
            S  D ++++QLTP  L EE  +L      +KR+K+
Subjt:  GWISSSDDREDLVQLTPAQLPEEFAKLFIPHAQRKRKKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCCAGAGTACGAAGCAAAAAGAGGGGTAACTTACGAATTGATGCTGCGCTCGATGCAATGAACCCTTTCGGATTTCCTCCAAAGTTGGTTCGTGACACGGTCAA
GGAACTCCTCAGTGTCTATGGAGGAGATGATGGATGGGTATTCATTGAAGAAGGCTCTTATACTCTCTTGATTGATACCCTTCTCGAGAAACAGAAAGATGGTGCAATAG
AGAAGGTTCATGAAGAGAATGGAAGAGCTGCAGATCATCAGGAGACCTCAGTAGCTGGCTGTTCATCGAGTGTTATCGAGGAAGCTTCCTCATCTTATCCTGGGGCTGAG
AGTACTGTGAAACAGAGTGAGGTTTTAATTTCATCATATGTGGATAATGAAGCTTTCAGGACCACAACCTCATTACCTGGAAATGATTCAGATGAAAAATACTGGAAGGA
CGATATAGCTTCTGGTGATGATGACCATTTTGGAAGTTCTCTGAACCAATCTTTAGTGGCACATATCCCCAAAATAAGGAGGCGAAAACCTTATCACGGCTGGATCTCCT
CGAGCGACGACAGGGAAGATCTCGTGCAGTTGACACCAGCTCAATTGCCTGAAGAGTTTGCCAAGTTATTCATTCCTCATGCACAGAGAAAAAGAAAGAAGCGTTGGGAT
GTGAAGCCTTCAGAATCAGGAGCTTTTATTTGA
mRNA sequenceShow/hide mRNA sequence
CTTAAACGAAGCCCAAATCACTCGTCTTCCTCCTGCGCGCTTATCGCCTGCGTTACCAAAAACCGCCAAGTTTTCGACTTCACTCTTCTTCCCCTTCCTTCTGGACACAT
TCTCTCCTCATTTCTCTTAACTTTGATCTTGATCCCCACTCCATAGTCCATCCCACCCATTCTTCTGCTCATGGTTTCTCGATTACACTAGCACGAGAAAGTGGTCGACG
TTTCTTTGTTGGGATTCTGAGGTCCAAAAAATGGCTCCCAGAGTACGAAGCAAAAAGAGGGGTAACTTACGAATTGATGCTGCGCTCGATGCAATGAACCCTTTCGGATT
TCCTCCAAAGTTGGTTCGTGACACGGTCAAGGAACTCCTCAGTGTCTATGGAGGAGATGATGGATGGGTATTCATTGAAGAAGGCTCTTATACTCTCTTGATTGATACCC
TTCTCGAGAAACAGAAAGATGGTGCAATAGAGAAGGTTCATGAAGAGAATGGAAGAGCTGCAGATCATCAGGAGACCTCAGTAGCTGGCTGTTCATCGAGTGTTATCGAG
GAAGCTTCCTCATCTTATCCTGGGGCTGAGAGTACTGTGAAACAGAGTGAGGTTTTAATTTCATCATATGTGGATAATGAAGCTTTCAGGACCACAACCTCATTACCTGG
AAATGATTCAGATGAAAAATACTGGAAGGACGATATAGCTTCTGGTGATGATGACCATTTTGGAAGTTCTCTGAACCAATCTTTAGTGGCACATATCCCCAAAATAAGGA
GGCGAAAACCTTATCACGGCTGGATCTCCTCGAGCGACGACAGGGAAGATCTCGTGCAGTTGACACCAGCTCAATTGCCTGAAGAGTTTGCCAAGTTATTCATTCCTCAT
GCACAGAGAAAAAGAAAGAAGCGTTGGGATGTGAAGCCTTCAGAATCAGGAGCTTTTATTTGACTGCTTTGTTTTGTTATTGGATTTTGGTGATCGGAGGGAGGATAAGT
GAGAGTGTAAATGGTAGTTAAGAGTTTTGAGTTGTAGAACGAGGCTGTAAGTTGTGTAGTTTAGATGGTGTACTACAGGTGAGTTCTTCAGACTTCCTGGACCTGATATA
GAAAATGATCATTACATTCGCGGATTTGGTTTCAGCCCTAAGACGAAGGAGTACAAATTGTTTAAAACTTTAAGGAAAGCAAGGATTTGTCATATATGCTCTTGATGTTG
AAACTGAAAAAATGAAGTTGATGCTGTCTGGCTCTCGTTCATGGCGCCATGTCTTAAGATTCAAGCGTGGGGAATGCAAAAGAAACATTCATGGAGTAGAGATTTTGTGG
TTGGTGACATAACCAAGGCCCCACGATTTTACTTGGTTAAAGTTTCTGAAGATGGAGAGAGATAATAAGGCTGGGAGGAGGAAGATCAACATAAAAACATTGATTCAAAA
TGGGACCATACTTTACCATCTTTGTCATATAGAATCCTTGCAATTTGGTTTTCTCCCAACCATTCTGCGTAGATAGAACATTATGTTTTTTTTTTCATTTAATAAATTAT
GTTTGACTTTCCTCACTATGAGAAAGTTTGGTTGATCAAATTGGATGATTACGGAGA
Protein sequenceShow/hide protein sequence
MAPRVRSKKRGNLRIDAALDAMNPFGFPPKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQKDGAIEKVHEENGRAADHQETSVAGCSSSVIEEASSSYPGAE
STVKQSEVLISSYVDNEAFRTTTSLPGNDSDEKYWKDDIASGDDDHFGSSLNQSLVAHIPKIRRRKPYHGWISSSDDREDLVQLTPAQLPEEFAKLFIPHAQRKRKKRWD
VKPSESGAFI