; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G06710 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G06710
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionWIYLD domain-containing protein
Genome locationClcChr11:6581084..6583770
RNA-Seq ExpressionClc11G06710
SyntenyClc11G06710
Gene Ontology termsGO:0019538 - protein metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0018024 - histone-lysine N-methyltransferase activity (molecular function)
InterPro domainsIPR018848 - WIYLD domain
IPR043017 - WIYLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138569.1 uncharacterized protein LOC101218050 isoform X1 [Cucumis sativus]9.2e-7464.52Show/hide
Query:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA
        MAPR     R NLRIDAALDAMK FGFP KLVRDTVKELL                            +VYGGD+GWVFIEEGSY LLID+LLEKQ E A
Subjt:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA

Query:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKP
        IE+VHDNER  DHQ TSVAGCS SA   T   EA   TA LPAND +TLFPGDESYW  +K SVDD+HFRSTFNQS           LPAYTPKIRRRK 
Subjt:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKP

Query:  YHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEPEE
        YHGW+  +DKEEDLVYLTPD LPEE A+LL++ AL+KRKKRWDVEP E
Subjt:  YHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEPEE

XP_008441447.1 PREDICTED: uncharacterized protein LOC103485563 isoform X1 [Cucumis melo]7.0e-7465.73Show/hide
Query:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA
        MAPR     R NLRIDAALDAMK FGFP KLVRDTVKELL                            +VYGGD+GWVFIEEGSYTLLIDTLLEKQ E A
Subjt:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA

Query:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKP
        IE+VHDNER  DHQ TSVAGCS SAI  T   EA   TATLPAN+ +TLFPGDESYW  +KASVDD+HFRSTFNQS           LPAYTPKIRRRK 
Subjt:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKP

Query:  YHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEPEE
        YHGWI  ++KEEDLVYLTPD LPEE A+LL+  AL+KRKKRWDVE  E
Subjt:  YHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEPEE

XP_038884313.1 uncharacterized protein LOC120075188 isoform X1 [Benincasa hispida]1.5e-8470.63Show/hide
Query:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA
        MAPR     RVNLRIDAALDAMKPFGFPLKLVRDTVKELLS                            VYGGD+GWVFIEEGSYTLLIDTLLEKQKE A
Subjt:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA

Query:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDS------NTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPK
        IELVHDNERAKDHQETS+A CSSSAI E    E T ITATLPANDS      +TLFPGDESYWK DKASVD +H RSTFNQS           LPAYTPK
Subjt:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDS------NTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPK

Query:  IRRRKPYHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEP
        IRRRKPYHGWI N+DKEEDLVYLTPD LPEEFA+LL  +A RKRKKRWDVEP
Subjt:  IRRRKPYHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEP

XP_038884315.1 uncharacterized protein LOC120075188 isoform X2 [Benincasa hispida]1.6e-8672.36Show/hide
Query:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA
        MAPR     RVNLRIDAALDAMKPFGFPLKLVRDTVKELLS                            VYGGD+GWVFIEEGSYTLLIDTLLEKQKE A
Subjt:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA

Query:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKP
        IELVHDNERAKDHQETS+A CSSSAI E    E T ITATLPANDS+TLFPGDESYWK DKASVD +H RSTFNQS           LPAYTPKIRRRKP
Subjt:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKP

Query:  YHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEP
        YHGWI N+DKEEDLVYLTPD LPEEFA+LL  +A RKRKKRWDVEP
Subjt:  YHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEP

XP_038884316.1 uncharacterized protein LOC120075188 isoform X3 [Benincasa hispida]2.3e-7770.13Show/hide
Query:  MKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELVHDNERAKDHQETSVAGC
        MKPFGFPLKLVRDTVKELLS                            VYGGD+GWVFIEEGSYTLLIDTLLEKQKE AIELVHDNERAKDHQETS+A C
Subjt:  MKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELVHDNERAKDHQETSVAGC

Query:  SSSAIGETSLYEATRITATLPANDS------NTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNNDKEEDLV
        SSSAI E    E T ITATLPANDS      +TLFPGDESYWK DKASVD +H RSTFNQS           LPAYTPKIRRRKPYHGWI N+DKEEDLV
Subjt:  SSSAIGETSLYEATRITATLPANDS------NTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNNDKEEDLV

Query:  YLTPDSLPEEFAELLMSHALRKRKKRWDVEP
        YLTPD LPEEFA+LL  +A RKRKKRWDVEP
Subjt:  YLTPDSLPEEFAELLMSHALRKRKKRWDVEP

TrEMBL top hitse value%identityAlignment
A0A0A0K7Q2 WIYLD domain-containing protein4.5e-7464.52Show/hide
Query:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA
        MAPR     R NLRIDAALDAMK FGFP KLVRDTVKELL                            +VYGGD+GWVFIEEGSY LLID+LLEKQ E A
Subjt:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA

Query:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKP
        IE+VHDNER  DHQ TSVAGCS SA   T   EA   TA LPAND +TLFPGDESYW  +K SVDD+HFRSTFNQS           LPAYTPKIRRRK 
Subjt:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKP

Query:  YHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEPEE
        YHGW+  +DKEEDLVYLTPD LPEE A+LL++ AL+KRKKRWDVEP E
Subjt:  YHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEPEE

A0A1S3B3F8 uncharacterized protein LOC103485563 isoform X13.4e-7465.73Show/hide
Query:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA
        MAPR     R NLRIDAALDAMK FGFP KLVRDTVKELL                            +VYGGD+GWVFIEEGSYTLLIDTLLEKQ E A
Subjt:  MAPR-----RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAA

Query:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKP
        IE+VHDNER  DHQ TSVAGCS SAI  T   EA   TATLPAN+ +TLFPGDESYW  +KASVDD+HFRSTFNQS           LPAYTPKIRRRK 
Subjt:  IELVHDNERAKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKP

Query:  YHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEPEE
        YHGWI  ++KEEDLVYLTPD LPEE A+LL+  AL+KRKKRWDVE  E
Subjt:  YHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEPEE

A0A1S3B448 uncharacterized protein LOC103485563 isoform X24.0e-6765.04Show/hide
Query:  KPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELVHDNERAKDHQETSVAGCS
        K FGFP KLVRDTVKELL                            +VYGGD+GWVFIEEGSYTLLIDTLLEKQ E AIE+VHDNER  DHQ TSVAGCS
Subjt:  KPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELVHDNERAKDHQETSVAGCS

Query:  SSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNNDKEEDLVYLTPDSL
         SAI  T   EA   TATLPAN+ +TLFPGDESYW  +KASVDD+HFRSTFNQS           LPAYTPKIRRRK YHGWI  ++KEEDLVYLTPD L
Subjt:  SSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNNDKEEDLVYLTPDSL

Query:  PEEFAELLMSHALRKRKKRWDVEPEE
        PEE A+LL+  AL+KRKKRWDVE  E
Subjt:  PEEFAELLMSHALRKRKKRWDVEPEE

A0A1S3B481 uncharacterized protein LOC103485563 isoform X31.2e-6665.18Show/hide
Query:  FGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELVHDNERAKDHQETSVAGCSSS
        FGFP KLVRDTVKELL                            +VYGGD+GWVFIEEGSYTLLIDTLLEKQ E AIE+VHDNER  DHQ TSVAGCS S
Subjt:  FGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELVHDNERAKDHQETSVAGCSSS

Query:  AIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNNDKEEDLVYLTPDSLPE
        AI  T   EA   TATLPAN+ +TLFPGDESYW  +KASVDD+HFRSTFNQS           LPAYTPKIRRRK YHGWI  ++KEEDLVYLTPD LPE
Subjt:  AIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNNDKEEDLVYLTPDSLPE

Query:  EFAELLMSHALRKRKKRWDVEPEE
        E A+LL+  AL+KRKKRWDVE  E
Subjt:  EFAELLMSHALRKRKKRWDVEPEE

A0A5D3DLD6 Ubiquitin-binding WIYLD domain protein1.3e-7366.53Show/hide
Query:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELVHDNER
        R NLRIDAALDAMK FGFP KLVRDTVKELL                            +VYGGD+GWVFIEEGSYTLLIDTLLEKQ E AIE+VHDNER
Subjt:  RVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELVHDNER

Query:  AKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNND
          DHQ TSVAGCS SAI  T   EA   TATLPAN+ +TLFPGDESYW  +KASVDD+HFRSTFNQS           LPAYTPKIRRRK YHGWI  ++
Subjt:  AKDHQETSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNND

Query:  KEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEPEE
        KEEDLVYLTPD LPEE A+LL+  AL+KRKKRWDVE  E
Subjt:  KEEDLVYLTPDSLPEEFAELLMSHALRKRKKRWDVEPEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G40020.1 Nucleolar histone methyltransferase-related protein3.7e-0423.27Show/hide
Query:  LRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELV--------
        +R DAA D M+ FGF   ++ +++KELL V                               ++ W  IE+ SY  L+   LEKQ+E   +L         
Subjt:  LRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELV--------

Query:  --HDNERAKDHQETSVA--------------------------GCSSSAIGETS-----LYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRS
          H+ E A++ Q   +A                            +S A+ + S     L EA+   A         L  G    W  D+   D E    
Subjt:  --HDNERAKDHQETSVA--------------------------GCSSSAIGETS-----LYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRS

Query:  TFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKK---RWD
                  P  ++  P   PK +  +P      ++  +++++ LTP+ L EE  ELL     +KR+K   RWD
Subjt:  TFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKK---RWD

AT2G40020.3 Nucleolar histone methyltransferase-related protein3.7e-0423.27Show/hide
Query:  LRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELV--------
        +R DAA D M+ FGF   ++ +++KELL V                               ++ W  IE+ SY  L+   LEKQ+E   +L         
Subjt:  LRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELV--------

Query:  --HDNERAKDHQETSVA--------------------------GCSSSAIGETS-----LYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRS
          H+ E A++ Q   +A                            +S A+ + S     L EA+   A         L  G    W  D+   D E    
Subjt:  --HDNERAKDHQETSVA--------------------------GCSSSAIGETS-----LYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRS

Query:  TFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKK---RWD
                  P  ++  P   PK +  +P      ++  +++++ LTP+ L EE  ELL     +KR+K   RWD
Subjt:  TFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNNDKEEDLVYLTPDSLPEEFAELLMSHALRKRKK---RWD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCTAGAAGGGTTAACTTACGCATTGATGCTGCGCTCGATGCTATGAAACCATTCGGATTTCCTCTGAAGCTGGTTCGTGACACGGTCAAGGAGCTCCTTAGTGT
AGGTTTTTCAGATTTCCTCTCTGTCATTGTTATTGTTATTGTTTTTGTGTGTGCTTCTTCTGTGGCTGTAGTTTGTAGAAATGTCTATGGAGGAGACGAGGGGTGGGTAT
TCATTGAAGAAGGCTCTTATACTCTCTTGATCGATACCCTTCTCGAGAAACAGAAAGAGGCTGCAATAGAGTTGGTTCATGATAATGAAAGAGCTAAAGATCATCAGGAG
ACCTCAGTAGCTGGCTGTTCATCGAGTGCTATCGGTGAAACTTCCTTATATGAAGCTACAAGGATCACAGCCACATTGCCTGCAAATGATTCAAATACATTATTTCCTGG
AGATGAAAGTTATTGGAAGGGCGATAAAGCTTCTGTTGATGATGAACATTTTAGGAGTACTTTTAACCAGTCTTTACCAGCATATAGCCCCATAATAAGACAGCCTTTAC
CAGCATATACCCCGAAAATACGAAGGCGAAAACCTTATCATGGCTGGATCAGTAACAATGACAAGGAGGAAGATCTTGTGTACTTAACCCCAGATTCTTTGCCTGAAGAG
TTTGCCGAGTTACTCATGTCTCATGCACTGAGAAAAAGAAAGAAGCGTTGGGATGTGGAACCTGAAGAATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCTAGAAGGGTTAACTTACGCATTGATGCTGCGCTCGATGCTATGAAACCATTCGGATTTCCTCTGAAGCTGGTTCGTGACACGGTCAAGGAGCTCCTTAGTGT
AGGTTTTTCAGATTTCCTCTCTGTCATTGTTATTGTTATTGTTTTTGTGTGTGCTTCTTCTGTGGCTGTAGTTTGTAGAAATGTCTATGGAGGAGACGAGGGGTGGGTAT
TCATTGAAGAAGGCTCTTATACTCTCTTGATCGATACCCTTCTCGAGAAACAGAAAGAGGCTGCAATAGAGTTGGTTCATGATAATGAAAGAGCTAAAGATCATCAGGAG
ACCTCAGTAGCTGGCTGTTCATCGAGTGCTATCGGTGAAACTTCCTTATATGAAGCTACAAGGATCACAGCCACATTGCCTGCAAATGATTCAAATACATTATTTCCTGG
AGATGAAAGTTATTGGAAGGGCGATAAAGCTTCTGTTGATGATGAACATTTTAGGAGTACTTTTAACCAGTCTTTACCAGCATATAGCCCCATAATAAGACAGCCTTTAC
CAGCATATACCCCGAAAATACGAAGGCGAAAACCTTATCATGGCTGGATCAGTAACAATGACAAGGAGGAAGATCTTGTGTACTTAACCCCAGATTCTTTGCCTGAAGAG
TTTGCCGAGTTACTCATGTCTCATGCACTGAGAAAAAGAAAGAAGCGTTGGGATGTGGAACCTGAAGAATTTTGAGTTTTTATCTGGCTCGTTCGTCTGTTTGTTGGATG
TTGGTGACTTGAGGGAAGAAGATAGGTGAAAATGTAAATGGTAATTTTTTTGTAGTATAATAGGGGTTAGTGAATTCTAACCACTAACCTCTTAATGCTTACTTTAGCTC
TATGATTCTTTTTCTCTCTCCTCAGGGAGCCTGCTCCATTTCCTTGTGAGACAAGGGAAAACTGTGTCATGAATTTCACCATCTCTTATCTGTTAGGGTAACATTGGAGG
TAACTTTCAAGCGTTTTAGTTGAATTTGCCCACAAAAAGGTGTTTTTGTTTTAGTGGTGATGGATTATGAGGATGATGTAAATGTAGCTCACTCATATCCCACTTTTGTA
AAATGAATACACCTCTTTTAGCAACTTTCTTCCATATGATCTTGCGTGAGAGATAAAAGGAATCCAGAAAATCTGTGATTTGCGTTGGGTGATCGCTTTTTCACCATAAT
GAACTTTTGTTATACCTCTCTTTTAAATCTAAATGTAAAATGGAAAAGTGACAAGATGGATTAAAAAAAGACACTTCACGT
Protein sequenceShow/hide protein sequence
MAPRRVNLRIDAALDAMKPFGFPLKLVRDTVKELLSVGFSDFLSVIVIVIVFVCASSVAVVCRNVYGGDEGWVFIEEGSYTLLIDTLLEKQKEAAIELVHDNERAKDHQE
TSVAGCSSSAIGETSLYEATRITATLPANDSNTLFPGDESYWKGDKASVDDEHFRSTFNQSLPAYSPIIRQPLPAYTPKIRRRKPYHGWISNNDKEEDLVYLTPDSLPEE
FAELLMSHALRKRKKRWDVEPEEF