; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018424 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018424
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionWIYLD domain-containing protein
Genome locationChr04:4032173..4035865
RNA-Seq ExpressionHG10018424
SyntenyHG10018424
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0034968 - histone lysine methylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
GO:0008237 - metallopeptidase activity (molecular function)
GO:0018024 - histone-lysine N-methyltransferase activity (molecular function)
InterPro domainsIPR018848 - WIYLD domain
IPR043017 - WIYLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK24425.1 ubiquitin-binding WIYLD domain protein [Cucumis melo var. makuwa]1.6e-8080.69Show/hide
Query:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRF
        R NLRIDAALDAMK FGFP KLVRDTVKELL VYGGDDGWVFIEEGSYTLLIDTLLEKQN+GAIE+VHDNER  DHQ TSVA CS SAI  T  +E    
Subjt:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRF

Query:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPV
        TATLPAN+ DTLFPGDESYW ++KAS+DDDHF RST NQSLPAYTPKIRRRK YHGWIGKD+ EEDLVYLTPDHLPEE AKLL    LKKRKKRWDVE  
Subjt:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPV

Query:  GS
         S
Subjt:  GS

XP_008441447.1 PREDICTED: uncharacterized protein LOC103485563 isoform X1 [Cucumis melo]1.6e-8080.69Show/hide
Query:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRF
        R NLRIDAALDAMK FGFP KLVRDTVKELL VYGGDDGWVFIEEGSYTLLIDTLLEKQN+GAIE+VHDNER  DHQ TSVA CS SAI  T  +E    
Subjt:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRF

Query:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPV
        TATLPAN+ DTLFPGDESYW ++KAS+DDDHF RST NQSLPAYTPKIRRRK YHGWIGKD+ EEDLVYLTPDHLPEE AKLL    LKKRKKRWDVE  
Subjt:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPV

Query:  GS
         S
Subjt:  GS

XP_038884313.1 uncharacterized protein LOC120075188 isoform X1 [Benincasa hispida]1.0e-9084.13Show/hide
Query:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGE--TSETTRF
        RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQ +GAIELVHDNERAKDHQETS+ASCSSSAI E  ++E T  
Subjt:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGE--TSETTRF

Query:  TATLPANDS------DTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKR
        TATLPANDS      DTLFPGDESYWK DKAS+D DH  RST NQSLPAYTPKIRRRKPYHGWIG DD EEDLVYLTPDHLPEEFAKLL +   +KRKKR
Subjt:  TATLPANDS------DTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKR

Query:  WDVEPVGS
        WDVEP+ S
Subjt:  WDVEPVGS

XP_038884315.1 uncharacterized protein LOC120075188 isoform X2 [Benincasa hispida]1.1e-9286.63Show/hide
Query:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGE--TSETTRF
        RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQ +GAIELVHDNERAKDHQETS+ASCSSSAI E  ++E T  
Subjt:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGE--TSETTRF

Query:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPV
        TATLPANDSDTLFPGDESYWK DKAS+D DH  RST NQSLPAYTPKIRRRKPYHGWIG DD EEDLVYLTPDHLPEEFAKLL +   +KRKKRWDVEP+
Subjt:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPV

Query:  GS
         S
Subjt:  GS

XP_038884316.1 uncharacterized protein LOC120075188 isoform X3 [Benincasa hispida]5.4e-8483.16Show/hide
Query:  MKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGE--TSETTRFTATLPANDS---
        MKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQ +GAIELVHDNERAKDHQETS+ASCSSSAI E  ++E T  TATLPANDS   
Subjt:  MKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGE--TSETTRFTATLPANDS---

Query:  ---DTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPVGS
           DTLFPGDESYWK DKAS+D DH  RST NQSLPAYTPKIRRRKPYHGWIG DD EEDLVYLTPDHLPEEFAKLL +   +KRKKRWDVEP+ S
Subjt:  ---DTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPVGS

TrEMBL top hitse value%identityAlignment
A0A0A0K7Q2 WIYLD domain-containing protein3.0e-8079.4Show/hide
Query:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSA--IGETSETTRF
        R NLRIDAALDAMK FGFP KLVRDTVKELL VYGGDDGWVFIEEGSY LLID+LLEKQN+GAIE+VHDNER  DHQ TSVA CS SA  +  ++E    
Subjt:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSA--IGETSETTRF

Query:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEP
        TA LPAND DTLFPGDESYW ++K S+DDDHF RST NQSLPAYTPKIRRRK YHGW+GKDD EEDLVYLTPDHLPEE AKLL +  LKKRKKRWDVEP
Subjt:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEP

A0A1S3B3F8 uncharacterized protein LOC103485563 isoform X17.8e-8180.69Show/hide
Query:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRF
        R NLRIDAALDAMK FGFP KLVRDTVKELL VYGGDDGWVFIEEGSYTLLIDTLLEKQN+GAIE+VHDNER  DHQ TSVA CS SAI  T  +E    
Subjt:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRF

Query:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPV
        TATLPAN+ DTLFPGDESYW ++KAS+DDDHF RST NQSLPAYTPKIRRRK YHGWIGKD+ EEDLVYLTPDHLPEE AKLL    LKKRKKRWDVE  
Subjt:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPV

Query:  GS
         S
Subjt:  GS

A0A1S3B448 uncharacterized protein LOC103485563 isoform X23.2e-7479.89Show/hide
Query:  KPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRFTATLPANDSDTLF
        K FGFP KLVRDTVKELL VYGGDDGWVFIEEGSYTLLIDTLLEKQN+GAIE+VHDNER  DHQ TSVA CS SAI  T  +E    TATLPAN+ DTLF
Subjt:  KPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRFTATLPANDSDTLF

Query:  PGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPVGS
        PGDESYW ++KAS+DDDHF RST NQSLPAYTPKIRRRK YHGWIGKD+ EEDLVYLTPDHLPEE AKLL    LKKRKKRWDVE   S
Subjt:  PGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPVGS

A0A1S3B481 uncharacterized protein LOC103485563 isoform X39.3e-7480.21Show/hide
Query:  FGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRFTATLPANDSDTLFPG
        FGFP KLVRDTVKELL VYGGDDGWVFIEEGSYTLLIDTLLEKQN+GAIE+VHDNER  DHQ TSVA CS SAI  T  +E    TATLPAN+ DTLFPG
Subjt:  FGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRFTATLPANDSDTLFPG

Query:  DESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPVGS
        DESYW ++KAS+DDDHF RST NQSLPAYTPKIRRRK YHGWIGKD+ EEDLVYLTPDHLPEE AKLL    LKKRKKRWDVE   S
Subjt:  DESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPVGS

A0A5D3DLD6 Ubiquitin-binding WIYLD domain protein7.8e-8180.69Show/hide
Query:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRF
        R NLRIDAALDAMK FGFP KLVRDTVKELL VYGGDDGWVFIEEGSYTLLIDTLLEKQN+GAIE+VHDNER  DHQ TSVA CS SAI  T  +E    
Subjt:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGET--SETTRF

Query:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPV
        TATLPAN+ DTLFPGDESYW ++KAS+DDDHF RST NQSLPAYTPKIRRRK YHGWIGKD+ EEDLVYLTPDHLPEE AKLL    LKKRKKRWDVE  
Subjt:  TATLPANDSDTLFPGDESYWKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPV

Query:  GS
         S
Subjt:  GS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45248.2 Nucleolar histone methyltransferase-related protein4.8e-0644.83Show/hide
Query:  RIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTL-LIDTLLEKQNK
        R DAA D MK FGF   ++  ++K++L VY G+D W  IE+ +Y + LI  L  K+NK
Subjt:  RIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTL-LIDTLLEKQNK

AT1G45248.3 Nucleolar histone methyltransferase-related protein4.8e-0644.83Show/hide
Query:  RIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTL-LIDTLLEKQNK
        R DAA D MK FGF   ++  ++K++L VY G+D W  IE+ +Y + LI  L  K+NK
Subjt:  RIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTL-LIDTLLEKQNK

AT2G40020.1 Nucleolar histone methyltransferase-related protein3.2e-1026.61Show/hide
Query:  LRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELV----------HDNERAKDHQETSVA-----------
        +R DAA D M+ FGF   ++ +++KELL VY  +D W  IE+ SY  L+   LEKQ +   +L           H+ E A++ Q   +A           
Subjt:  LRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELV----------HDNERAKDHQETSVA-----------

Query:  ---------------SCSSSAIGETSETTRFTATLPANDSDTLFPGDESY---WKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEED
                       S +S A+ + S          ++ + +   G E+    W  D+   D +        ++ P   PK +  +P     G DD   +
Subjt:  ---------------SCSSSAIGETSETTRFTATLPANDSDTLFPGDESY---WKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEED

Query:  LVYLTPDHLPEEFAKLLGSHPLKKRKK---RWD
        ++ LTP+ L EE  +LL     +KR+K   RWD
Subjt:  LVYLTPDHLPEEFAKLLGSHPLKKRKK---RWD

AT2G40020.2 Nucleolar histone methyltransferase-related protein1.9e-0740.26Show/hide
Query:  LRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIEL--VHDNERAKDHQE
        +R DAA D M+ FGF   ++ +++KELL VY  +D W  IE+ SY  L+   LEKQ +   +L  V  N+  ++H E
Subjt:  LRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIEL--VHDNERAKDHQE

AT2G40020.3 Nucleolar histone methyltransferase-related protein3.2e-1026.61Show/hide
Query:  LRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELV----------HDNERAKDHQETSVA-----------
        +R DAA D M+ FGF   ++ +++KELL VY  +D W  IE+ SY  L+   LEKQ +   +L           H+ E A++ Q   +A           
Subjt:  LRIDAALDAMKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELV----------HDNERAKDHQETSVA-----------

Query:  ---------------SCSSSAIGETSETTRFTATLPANDSDTLFPGDESY---WKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEED
                       S +S A+ + S          ++ + +   G E+    W  D+   D +        ++ P   PK +  +P     G DD   +
Subjt:  ---------------SCSSSAIGETSETTRFTATLPANDSDTLFPGDESY---WKDDKASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEED

Query:  LVYLTPDHLPEEFAKLLGSHPLKKRKK---RWD
        ++ LTP+ L EE  +LL     +KR+K   RWD
Subjt:  LVYLTPDHLPEEFAKLLGSHPLKKRKK---RWD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCCCAACATTAAGGGGTGAAAGGAACCGGTCGAACCGGAACCGTCGGTTCGGCTGCGCTCACGCTTTTCGATTCGTAGGGTTTCTCGTCGTTCGTCTTTTCATTTC
CGTTTCGTCAAACCCAAATGTAGTTTGCACTGGTCGTTCGTCTCTCTCTCTCAAGTCTTTCGTGGCCTTACTCACTCTCAGCCACCGCGTTGTCGACATTGTTGAAGCTA
GGAGCTTCGGCCTCTTCTCTGCTAACATCGACAGTATCTGCGTCGACTTCGGAAAAGTAACGGTCGTCGGAACGAGGGTTAACTTACGAATTGATGCTGCGCTCGATGCT
ATGAAACCATTTGGATTTCCTCTGAAGTTGGTTCGTGACACGGTCAAGGAGCTCCTTAGTGTCTATGGAGGAGACGATGGATGGGTATTCATTGAAGAAGGCTCTTATAC
TCTCTTGATCGATACCCTTCTCGAGAAACAGAACAAGGGTGCAATAGAGTTGGTTCATGATAATGAAAGAGCTAAAGATCATCAGGAGACCTCAGTAGCTAGCTGTTCAT
CGAGTGCTATCGGTGAAACTAGTGAAACTACAAGGTTCACAGCCACATTGCCTGCAAATGATTCAGATACATTATTTCCTGGAGATGAAAGTTATTGGAAGGACGATAAA
GCTTCTATTGATGATGACCATTTTAGGAGGAGTACTCTTAACCAGTCTTTACCTGCATATACCCCCAAAATACGAAGGCGAAAACCTTATCATGGCTGGATTGGTAAAGA
CGACAATGAGGAGGATCTTGTGTACTTAACCCCTGATCATTTGCCTGAAGAGTTTGCCAAGTTACTCGGGTCTCATCCACTAAAAAAAAGAAAGAAGCGTTGGGATGTGG
AACCTGTAGGATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCCCAACATTAAGGGGTGAAAGGAACCGGTCGAACCGGAACCGTCGGTTCGGCTGCGCTCACGCTTTTCGATTCGTAGGGTTTCTCGTCGTTCGTCTTTTCATTTC
CGTTTCGTCAAACCCAAATGTAGTTTGCACTGGTCGTTCGTCTCTCTCTCTCAAGTCTTTCGTGGCCTTACTCACTCTCAGCCACCGCGTTGTCGACATTGTTGAAGCTA
GGAGCTTCGGCCTCTTCTCTGCTAACATCGACAGTATCTGCGTCGACTTCGGAAAAGTAACGGTCGTCGGAACGAGGGTTAACTTACGAATTGATGCTGCGCTCGATGCT
ATGAAACCATTTGGATTTCCTCTGAAGTTGGTTCGTGACACGGTCAAGGAGCTCCTTAGTGTCTATGGAGGAGACGATGGATGGGTATTCATTGAAGAAGGCTCTTATAC
TCTCTTGATCGATACCCTTCTCGAGAAACAGAACAAGGGTGCAATAGAGTTGGTTCATGATAATGAAAGAGCTAAAGATCATCAGGAGACCTCAGTAGCTAGCTGTTCAT
CGAGTGCTATCGGTGAAACTAGTGAAACTACAAGGTTCACAGCCACATTGCCTGCAAATGATTCAGATACATTATTTCCTGGAGATGAAAGTTATTGGAAGGACGATAAA
GCTTCTATTGATGATGACCATTTTAGGAGGAGTACTCTTAACCAGTCTTTACCTGCATATACCCCCAAAATACGAAGGCGAAAACCTTATCATGGCTGGATTGGTAAAGA
CGACAATGAGGAGGATCTTGTGTACTTAACCCCTGATCATTTGCCTGAAGAGTTTGCCAAGTTACTCGGGTCTCATCCACTAAAAAAAAGAAAGAAGCGTTGGGATGTGG
AACCTGTAGGATCATGA
Protein sequenceShow/hide protein sequence
MFPTLRGERNRSNRNRRFGCAHAFRFVGFLVVRLFISVSSNPNVVCTGRSSLSLKSFVALLTLSHRVVDIVEARSFGLFSANIDSICVDFGKVTVVGTRVNLRIDAALDA
MKPFGFPLKLVRDTVKELLSVYGGDDGWVFIEEGSYTLLIDTLLEKQNKGAIELVHDNERAKDHQETSVASCSSSAIGETSETTRFTATLPANDSDTLFPGDESYWKDDK
ASIDDDHFRRSTLNQSLPAYTPKIRRRKPYHGWIGKDDNEEDLVYLTPDHLPEEFAKLLGSHPLKKRKKRWDVEPVGS