; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023903 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023903
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionWIYLD domain-containing protein
Genome locationtig00001047:1274753..1277611
RNA-Seq ExpressionSgr023903
SyntenySgr023903
Gene Ontology termsGO:0034968 - histone lysine methylation (biological process)
GO:0018024 - histone-lysine N-methyltransferase activity (molecular function)
InterPro domainsIPR018848 - WIYLD domain
IPR043017 - WIYLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033305.1 hypothetical protein SDJN02_07360 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-7066.52Show/hide
Query:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS
        E+  + GNLRIDAALDAM+PFGF PKLVRDTVK+LLSVYGGD+GWVFIEEGSYTLLIDT+LEK KDG IEK HEE+GR  D +ETS AGCSS+       
Subjt:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS

Query:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP
                VK +  V +SSYVDNE FRIT T+P NDS+ERYWK++ I  G   +   RS++NQS   AHTP I R KPYHGWISS  DD+EDLV+LTPA 
Subjt:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP

Query:  LPEEFAKLLMPDAQRKRKKRWDVK
        LPEEFA+LL+P AQRKRK RWDVK
Subjt:  LPEEFAKLLMPDAQRKRKKRWDVK

XP_022953876.1 uncharacterized protein LOC111456280 isoform X1 [Cucurbita moschata]9.4e-7970.54Show/hide
Query:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS
        E+  + GNLRIDAALDAM+PFGF PKLVRDTVK+LLSVYGGD+GWVFIEEGSYTLLIDT+LEK KDG IEK HEE+GR  D +ETS AGCSS+ + E S+
Subjt:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS

Query:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP
        SNPGAE+TVK +  V +SSYVDNE FRIT T+P NDS+ERYWK++ I  G   +   RS++NQS   AHTP I R KPYHGWISS  DD+EDLV+LTPA 
Subjt:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP

Query:  LPEEFAKLLMPDAQRKRKKRWDVK
        LPEEFA+LL+P AQRKRK RWDVK
Subjt:  LPEEFAKLLMPDAQRKRKKRWDVK

XP_022953890.1 uncharacterized protein LOC111456280 isoform X2 [Cucurbita moschata]2.1e-7069.57Show/hide
Query:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS
        E+  + GNLRIDAALDAM+PFGF PKLVRDTVK+LLSVYGGD+GWVFIEEGSYTLLIDT+LEK KDG IEK HEE+GR  D +ETS AGCSS+ + E S+
Subjt:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS

Query:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP
        SNPGAE+TVK +  V +SSYVDNE FRIT T+P NDS+ERYWK++ I  G   +   RS++NQS   AHTP I R KPYHGWISS  DD+EDLV+LTPA 
Subjt:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP

Query:  LPEEFAK
        LPEEFA+
Subjt:  LPEEFAK

XP_023522527.1 uncharacterized protein LOC111786513 [Cucurbita pepo subsp. pepo]2.1e-7069.57Show/hide
Query:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS
        E+  + GNLRIDAALDAM+PFGF PKLVRDTVK+LLSVYGGD+GWVFIEEGSYTLLIDT+LEK KDG IEK HEE+GR  D +ETS AGCSS+ + E S+
Subjt:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS

Query:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP
        SNPGAE+TVK +  V +SSYVDNE FRIT T+P NDS+ERYWK++ I  G   +   RS++NQS   AHTP I R KPYHGWISS  DD+EDLV+LTPA 
Subjt:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP

Query:  LPEEFAK
        LPEEFA+
Subjt:  LPEEFAK

XP_023531273.1 uncharacterized protein LOC111793562 isoform X1 [Cucurbita pepo subsp. pepo]9.4e-7970.54Show/hide
Query:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS
        E+  + GNLRIDAALDAM+PFGF PKLVRDTVK+LLSVYGGD+GWVFIEEGSYTLLIDT+LEK KDG IEK HEE+GR  D +ETS AGCSS+ + E S+
Subjt:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS

Query:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP
        SNPGAE+TVK +  V +SSYVDNE FRIT T+P NDS+ERYWK++ I  G   +   RS++NQS   AHTP I R KPYHGWISS  DD+EDLV+LTPA 
Subjt:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP

Query:  LPEEFAKLLMPDAQRKRKKRWDVK
        LPEEFA+LL+P AQRKRK RWDVK
Subjt:  LPEEFAKLLMPDAQRKRKKRWDVK

TrEMBL top hitse value%identityAlignment
A0A6J1BXH3 uncharacterized protein LOC111006514 isoform X24.7e-6868.81Show/hide
Query:  REGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSSSNPG
        + GNLRIDAALDAM+PFGF PKLVRDTVK+LLSVYGGDDGWVFIEEGSYTLLIDTIL+KLKDG     HEEN R E H+ETS+AGC         SSNP 
Subjt:  REGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSSSNPG

Query:  AEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGGDDRCRSTLNQSSP-AHTPTISRGKPYHGWISSDD-KEDLVYLTPAPLPEEFAKL
         E+TVK+ ++VL+S Y DNEAFRITT L T DSE RY  DD+  G DD  RS  NQS+P AHTP ISR +PYHGWISS+D KEDLV+L P P   EFA+L
Subjt:  AEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGGDDRCRSTLNQSSP-AHTPTISRGKPYHGWISSDD-KEDLVYLTPAPLPEEFAKL

Query:  LMPDAQRKRKKRWDVKLA
        LM   QRKRK+RWDVK A
Subjt:  LMPDAQRKRKKRWDVKLA

A0A6J1BY42 uncharacterized protein LOC111006514 isoform X18.6e-7069.27Show/hide
Query:  REGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSSSNPG
        + GNLRIDAALDAM+PFGF PKLVRDTVK+LLSVYGGDDGWVFIEEGSYTLLIDTIL+KLKDG I + HEEN R E H+ETS+AGC         SSNP 
Subjt:  REGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSSSNPG

Query:  AEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGGDDRCRSTLNQSSP-AHTPTISRGKPYHGWISSDD-KEDLVYLTPAPLPEEFAKL
         E+TVK+ ++VL+S Y DNEAFRITT L T DSE RY  DD+  G DD  RS  NQS+P AHTP ISR +PYHGWISS+D KEDLV+L P P   EFA+L
Subjt:  AEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGGDDRCRSTLNQSSP-AHTPTISRGKPYHGWISSDD-KEDLVYLTPAPLPEEFAKL

Query:  LMPDAQRKRKKRWDVKLA
        LM   QRKRK+RWDVK A
Subjt:  LMPDAQRKRKKRWDVKLA

A0A6J1GPA9 uncharacterized protein LOC111456280 isoform X14.5e-7970.54Show/hide
Query:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS
        E+  + GNLRIDAALDAM+PFGF PKLVRDTVK+LLSVYGGD+GWVFIEEGSYTLLIDT+LEK KDG IEK HEE+GR  D +ETS AGCSS+ + E S+
Subjt:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS

Query:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP
        SNPGAE+TVK +  V +SSYVDNE FRIT T+P NDS+ERYWK++ I  G   +   RS++NQS   AHTP I R KPYHGWISS  DD+EDLV+LTPA 
Subjt:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP

Query:  LPEEFAKLLMPDAQRKRKKRWDVK
        LPEEFA+LL+P AQRKRK RWDVK
Subjt:  LPEEFAKLLMPDAQRKRKKRWDVK

A0A6J1GQX1 uncharacterized protein LOC111456280 isoform X21.0e-7069.57Show/hide
Query:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS
        E+  + GNLRIDAALDAM+PFGF PKLVRDTVK+LLSVYGGD+GWVFIEEGSYTLLIDT+LEK KDG IEK HEE+GR  D +ETS AGCSS+ + E S+
Subjt:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS

Query:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP
        SNPGAE+TVK +  V +SSYVDNE FRIT T+P NDS+ERYWK++ I  G   +   RS++NQS   AHTP I R KPYHGWISS  DD+EDLV+LTPA 
Subjt:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP

Query:  LPEEFAK
        LPEEFA+
Subjt:  LPEEFAK

A0A6J1JR10 uncharacterized protein LOC1114875575.0e-7069.08Show/hide
Query:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS
        E+  + GNLRIDAALDAM+PFGF PKLVRDTVK+LLSVYGGD+GWVFIEEGSYTLLIDT+LEK KDG IEK HEE+GR  D +ETS AGCSS+ + E S+
Subjt:  EKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAGCSSSGITETSS

Query:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP
        SNPGAE+TVK +  V +SSYVDNE FRIT T+P NDS+ERYWK++ I  G   +   RS++NQS   AH P I R KPYHGWISS  DD+EDLV+LTPA 
Subjt:  SNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGG---DDRCRSTLNQS-SPAHTPTISRGKPYHGWISS--DDKEDLVYLTPAP

Query:  LPEEFAK
        LPEEFA+
Subjt:  LPEEFAK

SwissProt top hitse value%identityAlignment
Q946J2 Probable inactive histone-lysine N-methyltransferase SUVR15.5e-0535.71Show/hide
Query:  NLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKG---HEENGRDEDHKETSVA
        NLRI  A DAM   G      R  ++ LL  Y  ++ W FIEE +Y +L+D I ++    + EK     E+  ++E+ K  SVA
Subjt:  NLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKG---HEENGRDEDHKETSVA

Arabidopsis top hitse value%identityAlignment
AT1G04050.1 homolog of SU(var)3-9 13.9e-0635.71Show/hide
Query:  NLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKG---HEENGRDEDHKETSVA
        NLRI  A DAM   G      R  ++ LL  Y  ++ W FIEE +Y +L+D I ++    + EK     E+  ++E+ K  SVA
Subjt:  NLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKG---HEENGRDEDHKETSVA

AT1G45248.2 Nucleolar histone methyltransferase-related protein2.1e-0442.55Show/hide
Query:  GNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSY
        G  R DAA D M  FGF   ++  ++K +L VY G+D W  IE+ +Y
Subjt:  GNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSY

AT2G40020.1 Nucleolar histone methyltransferase-related protein6.6e-0622.75Show/hide
Query:  LRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKD----------GTIEKGHEENGRDEDHKETSVAGCSSSGITET
        +R DAA D M  FGF   ++ +++K+LL VY  +D W  IE+ SY  L+   LEK ++            + + H E   +E+               + 
Subjt:  LRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKD----------GTIEKGHEENGRDEDHKETSVAGCSSSGITET

Query:  SSSNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERY------------WKDDSISGGDDRCRSTLNQSSPAHTPTISRGKPYHGWISSDDKED
           +   E    Q++ + ++S   N+       L    S+               W  D      +       + +    P     +P     S  D ++
Subjt:  SSSNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERY------------WKDDSISGGDDRCRSTLNQSSPAHTPTISRGKPYHGWISSDDKED

Query:  LVYLTPAPLPEEFAKLLMP---DAQRKRKKRWD
        ++ LTP PL EE  +LL       +RK++ RWD
Subjt:  LVYLTPAPLPEEFAKLLMP---DAQRKRKKRWD

AT2G40020.2 Nucleolar histone methyltransferase-related protein2.3e-0635.8Show/hide
Query:  REGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGH-EENGRDEDHKE
        + G +R DAA D M  FGF   ++ +++K+LL VY  +D W  IE+ SY  L+   LEK ++   +    + N   E+H E
Subjt:  REGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGH-EENGRDEDHKE

AT2G40020.3 Nucleolar histone methyltransferase-related protein1.3e-0622.78Show/hide
Query:  REGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKD----------GTIEKGHEENGRDEDHKETSVAGCSSSG
        + G +R DAA D M  FGF   ++ +++K+LL VY  +D W  IE+ SY  L+   LEK ++            + + H E   +E+             
Subjt:  REGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKD----------GTIEKGHEENGRDEDHKETSVAGCSSSG

Query:  ITETSSSNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERY------------WKDDSISGGDDRCRSTLNQSSPAHTPTISRGKPYHGWISSD
          +    +   E    Q++ + ++S   N+       L    S+               W  D      +       + +    P     +P     S  
Subjt:  ITETSSSNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERY------------WKDDSISGGDDRCRSTLNQSSPAHTPTISRGKPYHGWISSD

Query:  DKEDLVYLTPAPLPEEFAKLLMP---DAQRKRKKRWD
        D ++++ LTP PL EE  +LL       +RK++ RWD
Subjt:  DKEDLVYLTPAPLPEEFAKLLMP---DAQRKRKKRWD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAGCTTTCTTTCTGAGTTTCTAGCTTGTTCCACGACTCTTTTCCGGGTTCTGAGTGTTGAGAAATGGCTCCGAGAGGGCAACTTACGAATTGATGCAGCACTCGA
CGCTATGAGCCCTTTCGGATTTCCTCCGAAGTTGGTTCGCGACACGGTGAAGGACCTCCTCAGTGTCTATGGAGGAGACGATGGATGGGTATTCATTGAAGAAGGCTCTT
ATACTCTCTTGATCGATACCATTCTCGAGAAACTGAAAGATGGTACAATAGAGAAGGGTCATGAAGAGAATGGAAGAGATGAAGATCACAAGGAGACCTCAGTAGCTGGC
TGTTCATCAAGTGGTATCACTGAAACTTCCTCATCTAATCCTGGGGCTGAGGTTACTGTGAAGCAGAGTAATAATGTTTTACTTTCTTCATATGTGGACAATGAAGCTTT
CAGGATCACGACCACATTGCCTACAAATGATTCAGAAGAAAGATACTGGAAGGACGATAGCATTTCTGGAGGCGACGACCGTTGTAGGAGTACTCTTAACCAGTCTTCGC
CAGCACATACCCCCACAATTAGTAGGGGAAAACCTTATCATGGCTGGATCTCTAGCGACGACAAGGAAGATCTCGTGTACTTAACACCAGCCCCATTGCCTGAAGAGTTC
GCCAAGTTACTCATGCCTGATGCACAGAGAAAACGCAAGAAGCGTTGGGATGTGAAGCTTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAGCTTTCTTTCTGAGTTTCTAGCTTGTTCCACGACTCTTTTCCGGGTTCTGAGTGTTGAGAAATGGCTCCGAGAGGGCAACTTACGAATTGATGCAGCACTCGA
CGCTATGAGCCCTTTCGGATTTCCTCCGAAGTTGGTTCGCGACACGGTGAAGGACCTCCTCAGTGTCTATGGAGGAGACGATGGATGGGTATTCATTGAAGAAGGCTCTT
ATACTCTCTTGATCGATACCATTCTCGAGAAACTGAAAGATGGTACAATAGAGAAGGGTCATGAAGAGAATGGAAGAGATGAAGATCACAAGGAGACCTCAGTAGCTGGC
TGTTCATCAAGTGGTATCACTGAAACTTCCTCATCTAATCCTGGGGCTGAGGTTACTGTGAAGCAGAGTAATAATGTTTTACTTTCTTCATATGTGGACAATGAAGCTTT
CAGGATCACGACCACATTGCCTACAAATGATTCAGAAGAAAGATACTGGAAGGACGATAGCATTTCTGGAGGCGACGACCGTTGTAGGAGTACTCTTAACCAGTCTTCGC
CAGCACATACCCCCACAATTAGTAGGGGAAAACCTTATCATGGCTGGATCTCTAGCGACGACAAGGAAGATCTCGTGTACTTAACACCAGCCCCATTGCCTGAAGAGTTC
GCCAAGTTACTCATGCCTGATGCACAGAGAAAACGCAAGAAGCGTTGGGATGTGAAGCTTGCATAA
Protein sequenceShow/hide protein sequence
MRSFLSEFLACSTTLFRVLSVEKWLREGNLRIDAALDAMSPFGFPPKLVRDTVKDLLSVYGGDDGWVFIEEGSYTLLIDTILEKLKDGTIEKGHEENGRDEDHKETSVAG
CSSSGITETSSSNPGAEVTVKQSNNVLLSSYVDNEAFRITTTLPTNDSEERYWKDDSISGGDDRCRSTLNQSSPAHTPTISRGKPYHGWISSDDKEDLVYLTPAPLPEEF
AKLLMPDAQRKRKKRWDVKLA