; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G10610 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G10610
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionNucleolar protein 58-like
Genome locationClcChr11:16283387..16285651
RNA-Seq ExpressionClc11G10610
SyntenyClc11G10610
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]7.5e-2330.47Show/hide
Query:  LMIEVGFFLEEA----QVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKL
        L  E GF L+ +    Q+P +I ++I  H W+ FC+        +VR FY    D E++ V V   +V ++   IN ++ L D P  E ++ I++   + 
Subjt:  LMIEVGFFLEEA----QVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKL

Query:  MEDAIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGVFSKPRGQLFFPTLIMN
        +   +  +A  GA+W+VS  G      S+L P A +W + +K  L+PTTH KTVS+DR++  + ++   + +VGR+I ++IR   ++  G LFFP+LI  
Subjt:  MEDAIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGVFSKPRGQLFFPTLIMN

Query:  LYANASIGGPTEVLDAEKMIEVSVVIAHKTLRRLMRGSQHLPEPAKATKRPHQPIP
        L  NA       +++ EK+     + A    R    G      P ++T++P    P
Subjt:  LYANASIGGPTEVLDAEKMIEVSVVIAHKTLRRLMRGSQHLPEPAKATKRPHQPIP

WP_217833304.1 hypothetical protein, partial [Synechococcus sp. PCC 7002]3.0e-3282.69Show/hide
Query:  KKRRVSPETVEDEEPNSTEQGDRPESPIAATSFDRLAEKEKKKRAMLRKELRTLRQGVEKIAKDALSRKKARREVRPQEVGETQGLLDPQIEEAALAFEE
        KKR+VSPET +DEEP STEQGDRPESPIAATSFDRLAEK KKKR  LRKELR LR+  EKIAKD LS KKAR+EV+PQE GETQGL DPQIEEAALAFEE
Subjt:  KKRRVSPETVEDEEPNSTEQGDRPESPIAATSFDRLAEKEKKKRAMLRKELRTLRQGVEKIAKDALSRKKARREVRPQEVGETQGLLDPQIEEAALAFEE

Query:  EIRE
        E+RE
Subjt:  EIRE

XP_038876674.1 chromatin assembly factor 1 subunit A-like, partial [Benincasa hispida]3.7e-3031.35Show/hide
Query:  PIRKIVMAAEAKRKRDEEEEVVPLLRKKRRVS---PETVEDEEPNSTEQGDRPESPIAATSFDRLAEKEKKKRAMLRKE---LRTLRQGVEKIAKDALSR
        P+   V++ E+++KR +++EV   +          PE     +P ST+     E  +  T  + +AE+     A++ +E     TL     ++  +A + 
Subjt:  PIRKIVMAAEAKRKRDEEEEVVPLLRKKRRVS---PETVEDEEPNSTEQGDRPESPIAATSFDRLAEKEKKKRAMLRKE---LRTLRQGVEKIAKDALSR

Query:  KKARREVRPQEVGETQGLLDPQIEEAALAFEEEIREEERIEQAAEELERELQIEEDEELATRIRAQEEEEGKKKKKKKSRKERIGEDAREEQGVSTLEKK
         K  R V               +EE  +A           E A EE+E +L++ E+                KKKKKKS++ + GE   E       E++
Subjt:  KKARREVRPQEVGETQGLLDPQIEEAALAFEEEIREEERIEQAAEELERELQIEEDEELATRIRAQEEEEGKKKKKKKSRKERIGEDAREEQGVSTLEKK

Query:  KRKEKEVSVTKLPSTHNEKRKGKEKVTEAPVKVAESKKRTVNEIPAKAKGKEKVADASPQQKEGKGRRRAFPNLMIEVGFFLEEAQVPQYITEIIDHHGW
        +++E+E    K        ++ KEK  EA  +  + KK    E+  + K +++      +++E K R+ A   L  E G  +E++  P            
Subjt:  KRKEKEVSVTKLPSTHNEKRKGKEKVTEAPVKVAESKKRTVNEIPAKAKGKEKVADASPQQKEGKGRRRAFPNLMIEVGFFLEEAQVPQYITEIIDHHGW

Query:  RNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKLMEDAIRVMAKPGAKWDVSPTGIKMLSASSLR
                     +VR+FY GR+   +D V +    V F+AR+INEIY+++DNP A GNKII+    + MEDA++V+ + G KW VS  G+  L++ SL 
Subjt:  RNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKLMEDAIRVMAKPGAKWDVSPTGIKMLSASSLR

Query:  PEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGV
         E  LWVYLVKK LI TTHDKTVSRDRVM  YCI+R I  DVG++IA Q+R +
Subjt:  PEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGV

XP_038898613.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120086174 [Benincasa hispida]5.0e-3537.86Show/hide
Query:  ERELQIEEDEELATRIRAQEEEEGKKKKKKKSRKERIGEDAREEQGVSTLEKKKRKEKEVSVTKLPSTHNEKRKGKEKVTEAPVKVAESKKRTVNEIPAK
        E E  +  +E +   + A EE  G    K+  +KE    +A +E GV  L+K    EKE    K+     E+RK +EK      ++A+++ + V E    
Subjt:  ERELQIEEDEELATRIRAQEEEEGKKKKKKKSRKERIGEDAREEQGVSTLEKKKRKEKEVSVTKLPSTHNEKRKGKEKVTEAPVKVAESKKRTVNEIPAK

Query:  AKGKEKVADASPQQKEGKGRRRAFPNLMIEVGFFLEEAQVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEI
        A  K++ A           +R+   +++IE+GF+     +P  IT II  HGW  F    S I P +V +FY G + E +D V      V F+ + IN +
Subjt:  AKGKEKVADASPQQKEGKGRRRAFPNLMIEVGFFLEEAQVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEI

Query:  YELRDNPDAEGNKIIESTPTKLMEDAIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDR
        Y++RDNPDA GNKII+    +LM +A++V+A+PG +W VSP GI+ L + SL  +  LWVYLVKK LIPTTHD+TVS+D+
Subjt:  YELRDNPDAEGNKIIESTPTKLMEDAIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDR

XP_038904385.1 uncharacterized protein LOC120090747 [Benincasa hispida]2.7e-4139.29Show/hide
Query:  RIRAQEEEEGKKKKKKKSRKERIGEDAREEQGVSTLEKKKRKEK-EVSVTKLPSTHNEKRKGKEKVTEAPVKVAESKKRTVNE-------IPAKAKGKEK
        R+   EE   KK+KK+K   E+  E  REE+ +   EK+KR E  +V      ++  E    +EK +  P + +   +  V E        P   + KE 
Subjt:  RIRAQEEEEGKKKKKKKSRKERIGEDAREEQGVSTLEKKKRKEK-EVSVTKLPSTHNEKRKGKEKVTEAPVKVAESKKRTVNE-------IPAKAKGKEK

Query:  VADASPQ-QKEGKGRRRAFPNLMIEVGFFLEEAQVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRD
             P   K     ++   ++M+E+GF      +P + T ++  HGW  F    S I P +VR FY GR+   KD V++    VPF+AR+INE+Y+++D
Subjt:  VADASPQ-QKEGKGRRRAFPNLMIEVGFFLEEAQVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRD

Query:  NPDAEGNKIIESTPTKLMEDAIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRG
         PDA GNKII+    + MEDA+R + + G +W VS  GIK L++S L PEA LWVYLVK+ +IPT+HDKTVSRDRVM AYCI   I  DV  +IAAQ + 
Subjt:  NPDAEGNKIIESTPTKLMEDAIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRG

Query:  VFSKPRGQ
           + + Q
Subjt:  VFSKPRGQ

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.8e-2230.47Show/hide
Query:  LMIEVGFFLEEA----QVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKL
        L  E GF L+ +    Q+P +I ++I  H W+ FC+        +VR FY    D  ++ V V   +V ++   IN ++ L D P  E ++ IE+     
Subjt:  LMIEVGFFLEEA----QVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKL

Query:  MEDAIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGVFSKPRGQLFFPTLIMN
        +   +  +A  GA+W+VS  G      S+L P A +W + +K  L+PTTH KTVS+DR++  + ++   + +VGR+I ++IR   ++  G LFFP+LI  
Subjt:  MEDAIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGVFSKPRGQLFFPTLIMN

Query:  LYANASIGGPTEVLDAEKMIEVSVVIAHKTLRRLMRGSQHLPEPAKATKRPHQPIP
        L  NA       +++ EK+     + A    R    G      P ++T++P    P
Subjt:  LYANASIGGPTEVLDAEKMIEVSVVIAHKTLRRLMRGSQHLPEPAKATKRPHQPIP

A0A2P5BCG4 Uncharacterized protein (Fragment)3.6e-2330.47Show/hide
Query:  LMIEVGFFLEEA----QVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKL
        L  E GF L+ +    Q+P +I ++I  H W+ FC+        +VR FY    D E++ V V   +V ++   IN ++ L D P  E ++ I++   + 
Subjt:  LMIEVGFFLEEA----QVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKL

Query:  MEDAIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGVFSKPRGQLFFPTLIMN
        +   +  +A  GA+W+VS  G      S+L P A +W + +K  L+PTTH KTVS+DR++  + ++   + +VGR+I ++IR   ++  G LFFP+LI  
Subjt:  MEDAIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGVFSKPRGQLFFPTLIMN

Query:  LYANASIGGPTEVLDAEKMIEVSVVIAHKTLRRLMRGSQHLPEPAKATKRPHQPIP
        L  NA       +++ EK+     + A    R    G      P ++T++P    P
Subjt:  LYANASIGGPTEVLDAEKMIEVSVVIAHKTLRRLMRGSQHLPEPAKATKRPHQPIP

A0A2P5DAQ2 Uncharacterized protein1.2e-2132.8Show/hide
Query:  PQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKLMEDAIRVMAKPGAKWDVS
        P +I ++I  H W+ FC+        +VR FY    + + D V +   +VP +   IN I+ L D P  E ++ +E      +   +  +A  GA+W+VS
Subjt:  PQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKLMEDAIRVMAKPGAKWDVS

Query:  PTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGVFSKPRGQLFFPTLIMNLYAN
          G      SSL P A +W + +K  L+PTTH KTVS++ V   Y ++   + +VGR+I  +I    ++  G LFFP+LI ++  N
Subjt:  PTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGVFSKPRGQLFFPTLIMNLYAN

A0A5D3DVQ6 Uncharacterized protein9.9e-2132.83Show/hide
Query:  NLMIEVGFFLEEAQVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKLMED
        + M+E GFF+ E Q+  ++   I   GW+ F      I+  +V+ FYNG+ID EK   +V E   P +                             M++
Subjt:  NLMIEVGFFLEEAQVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKLMED

Query:  AIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGVFSKPRGQLFFPTLIMNL
        A+  +A    KWDV+      L   +L  EA++W+  +KK L+PT HD T+S +R+M  YCIM  I  DV  II   I+     PRG   FP LI  L
Subjt:  AIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGVFSKPRGQLFFPTLIMNL

W9QTD9 Uncharacterized protein8.1e-2334.21Show/hide
Query:  PQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKLMEDAIRVMAKPGAKWDVS
        P +IT +I  HGWR FC   S     +VR FY   +D  ++ V V   +VPF AR IN I+ L +  D   +   E T  +L E  +  +A  GA W +S
Subjt:  PQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKDVVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKLMEDAIRVMAKPGAKWDVS

Query:  PTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGV-FSKPRGQLFFPTLIMNLYANASI
        P G        L+  A +W + +    +P+TH KTV++DRV+  Y I+  I+ ++  I   +I+    ++ RG L+FP+LI  L+  A++
Subjt:  PTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRITFDVGRIIAAQIRGV-FSKPRGQLFFPTLIMNLYANASI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTTGGACTCCAATGAAGGACCAATAAGAAAAATTGTGATGGCCGCAGAGGCCAAGAGAAAAAGAGATGAGGAAGAGGAAGTGGTACCCCTGTTGAGGAAGAAACG
AAGGGTATCACCGGAAACTGTCGAGGATGAAGAACCCAATAGCACAGAACAAGGAGACAGACCTGAGTCCCCAATTGCAGCCACATCCTTTGATAGGCTGGCAGAGAAAG
AGAAGAAGAAGAGGGCCATGTTGAGAAAAGAGCTCAGGACCCTAAGACAAGGAGTTGAGAAAATTGCCAAGGACGCATTGTCCCGCAAGAAGGCGCGCAGGGAAGTGCGG
CCTCAAGAAGTAGGGGAGACGCAAGGCCTTCTTGATCCTCAAATAGAAGAAGCCGCATTGGCATTTGAAGAGGAAATAAGGGAAGAAGAAAGAATAGAACAAGCAGCCGA
GGAGCTTGAAAGAGAGCTACAAATTGAAGAAGATGAAGAGCTGGCTACTAGAATAAGAGCCCAAGAGGAAGAAGAGGGCAAGAAGAAAAAGAAAAAGAAAAGTAGGAAGG
AACGCATCGGGGAGGATGCGAGGGAAGAACAAGGTGTTAGCACCTTGGAGAAGAAGAAGAGGAAAGAAAAGGAGGTGTCAGTCACCAAACTCCCAAGCACGCATAATGAG
AAACGCAAAGGGAAGGAGAAGGTGACTGAAGCACCAGTTAAAGTGGCCGAGAGCAAGAAGAGAACTGTGAATGAGATTCCAGCTAAGGCCAAAGGAAAAGAGAAGGTGGC
TGATGCGTCGCCACAACAAAAGGAAGGAAAAGGGAGACGCAGGGCGTTCCCTAACTTGATGATTGAGGTGGGATTTTTCCTTGAAGAAGCCCAAGTGCCGCAATACATCA
CCGAAATAATTGACCACCACGGCTGGAGAAACTTCTGTTCCAGCACCTCTTGGATCCAACCTGACATAGTCCGAAATTTTTACAATGGACGCATAGACGAAGAGAAAGAC
GTGGTGGTGGTGGATGAAACTGAAGTCCCATTCAATGCAAGAGAAATCAATGAAATATATGAGTTGAGGGACAACCCGGATGCGGAAGGAAACAAGATCATAGAATCAAC
TCCAACGAAGTTAATGGAGGACGCAATCCGGGTAATGGCGAAGCCAGGGGCCAAGTGGGATGTTTCACCCACAGGTATAAAAATGCTATCAGCTAGCAGCCTAAGACCAG
AGGCAAACCTGTGGGTGTACTTGGTGAAGAAGTGGCTGATCCCTACAACGCATGACAAGACAGTGTCAAGGGATCGGGTAATGACAGCATATTGCATTATGCGTCGCATC
ACCTTTGATGTTGGACGCATAATTGCGGCGCAGATAAGAGGGGTGTTCAGCAAGCCAAGGGGCCAACTCTTTTTCCCCACTCTTATCATGAACTTATATGCCAATGCTTC
CATCGGAGGGCCAACGGAAGTGTTAGACGCAGAGAAGATGATTGAAGTTAGTGTCGTCATCGCCCACAAGACTCTGCGTCGGTTGATGAGAGGATCACAACATCTGCCAG
AGCCTGCCAAGGCCACCAAAAGACCTCATCAACCCATTCCGCCTCCAAAGCCGCAACCGCAGAAAAGGGAACCAACTCTACCTCCACCTCCTCTTGAGACGCATCGCCCT
AAGCCACCTCCACCTCCTCATAAAACGCATCACCCCAAACATCCAAAACCTCGACCTTCCACAAAACCCTCCGCCCCAATAAGAGTTGATGACTCACACTTTGGGGATCT
TGGCGCGACTTTTATGCGTCCATCATCCCCTGCATTTGACGAAACTCCATCCACTCCCCCACCTCAGCCAAACCCACCTACTCCACCTCAGCCAAACTCGCCTACCCCAC
CAATGCAACCACAACCGCAAGAAGGCCAACTTGACCTTGAGTGGGAGATAGTGCAAACATATCTGCAGCAATCGATCTTGTGTCCTGTGATGCACTCATTGAATATGTTA
GCACGCCGGCAGACGGAGCAGTTCCAATTCACATTGAATTACGTTCATCAACTGCTTTCGATCCGACCAGAAATCCCGCCCCCTGATATTTCAGAACTCCTCCAACAGCC
CCTTGTCTTCCCAGTGCCGCAGCGCCCTACACCTAAGCAGCAACGCAGACCTGATGAAGACAGGACTGATGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCTTGGACTCCAATGAAGGACCAATAAGAAAAATTGTGATGGCCGCAGAGGCCAAGAGAAAAAGAGATGAGGAAGAGGAAGTGGTACCCCTGTTGAGGAAGAAACG
AAGGGTATCACCGGAAACTGTCGAGGATGAAGAACCCAATAGCACAGAACAAGGAGACAGACCTGAGTCCCCAATTGCAGCCACATCCTTTGATAGGCTGGCAGAGAAAG
AGAAGAAGAAGAGGGCCATGTTGAGAAAAGAGCTCAGGACCCTAAGACAAGGAGTTGAGAAAATTGCCAAGGACGCATTGTCCCGCAAGAAGGCGCGCAGGGAAGTGCGG
CCTCAAGAAGTAGGGGAGACGCAAGGCCTTCTTGATCCTCAAATAGAAGAAGCCGCATTGGCATTTGAAGAGGAAATAAGGGAAGAAGAAAGAATAGAACAAGCAGCCGA
GGAGCTTGAAAGAGAGCTACAAATTGAAGAAGATGAAGAGCTGGCTACTAGAATAAGAGCCCAAGAGGAAGAAGAGGGCAAGAAGAAAAAGAAAAAGAAAAGTAGGAAGG
AACGCATCGGGGAGGATGCGAGGGAAGAACAAGGTGTTAGCACCTTGGAGAAGAAGAAGAGGAAAGAAAAGGAGGTGTCAGTCACCAAACTCCCAAGCACGCATAATGAG
AAACGCAAAGGGAAGGAGAAGGTGACTGAAGCACCAGTTAAAGTGGCCGAGAGCAAGAAGAGAACTGTGAATGAGATTCCAGCTAAGGCCAAAGGAAAAGAGAAGGTGGC
TGATGCGTCGCCACAACAAAAGGAAGGAAAAGGGAGACGCAGGGCGTTCCCTAACTTGATGATTGAGGTGGGATTTTTCCTTGAAGAAGCCCAAGTGCCGCAATACATCA
CCGAAATAATTGACCACCACGGCTGGAGAAACTTCTGTTCCAGCACCTCTTGGATCCAACCTGACATAGTCCGAAATTTTTACAATGGACGCATAGACGAAGAGAAAGAC
GTGGTGGTGGTGGATGAAACTGAAGTCCCATTCAATGCAAGAGAAATCAATGAAATATATGAGTTGAGGGACAACCCGGATGCGGAAGGAAACAAGATCATAGAATCAAC
TCCAACGAAGTTAATGGAGGACGCAATCCGGGTAATGGCGAAGCCAGGGGCCAAGTGGGATGTTTCACCCACAGGTATAAAAATGCTATCAGCTAGCAGCCTAAGACCAG
AGGCAAACCTGTGGGTGTACTTGGTGAAGAAGTGGCTGATCCCTACAACGCATGACAAGACAGTGTCAAGGGATCGGGTAATGACAGCATATTGCATTATGCGTCGCATC
ACCTTTGATGTTGGACGCATAATTGCGGCGCAGATAAGAGGGGTGTTCAGCAAGCCAAGGGGCCAACTCTTTTTCCCCACTCTTATCATGAACTTATATGCCAATGCTTC
CATCGGAGGGCCAACGGAAGTGTTAGACGCAGAGAAGATGATTGAAGTTAGTGTCGTCATCGCCCACAAGACTCTGCGTCGGTTGATGAGAGGATCACAACATCTGCCAG
AGCCTGCCAAGGCCACCAAAAGACCTCATCAACCCATTCCGCCTCCAAAGCCGCAACCGCAGAAAAGGGAACCAACTCTACCTCCACCTCCTCTTGAGACGCATCGCCCT
AAGCCACCTCCACCTCCTCATAAAACGCATCACCCCAAACATCCAAAACCTCGACCTTCCACAAAACCCTCCGCCCCAATAAGAGTTGATGACTCACACTTTGGGGATCT
TGGCGCGACTTTTATGCGTCCATCATCCCCTGCATTTGACGAAACTCCATCCACTCCCCCACCTCAGCCAAACCCACCTACTCCACCTCAGCCAAACTCGCCTACCCCAC
CAATGCAACCACAACCGCAAGAAGGCCAACTTGACCTTGAGTGGGAGATAGTGCAAACATATCTGCAGCAATCGATCTTGTGTCCTGTGATGCACTCATTGAATATGTTA
GCACGCCGGCAGACGGAGCAGTTCCAATTCACATTGAATTACGTTCATCAACTGCTTTCGATCCGACCAGAAATCCCGCCCCCTGATATTTCAGAACTCCTCCAACAGCC
CCTTGTCTTCCCAGTGCCGCAGCGCCCTACACCTAAGCAGCAACGCAGACCTGATGAAGACAGGACTGATGGTTAA
Protein sequenceShow/hide protein sequence
MTLDSNEGPIRKIVMAAEAKRKRDEEEEVVPLLRKKRRVSPETVEDEEPNSTEQGDRPESPIAATSFDRLAEKEKKKRAMLRKELRTLRQGVEKIAKDALSRKKARREVR
PQEVGETQGLLDPQIEEAALAFEEEIREEERIEQAAEELERELQIEEDEELATRIRAQEEEEGKKKKKKKSRKERIGEDAREEQGVSTLEKKKRKEKEVSVTKLPSTHNE
KRKGKEKVTEAPVKVAESKKRTVNEIPAKAKGKEKVADASPQQKEGKGRRRAFPNLMIEVGFFLEEAQVPQYITEIIDHHGWRNFCSSTSWIQPDIVRNFYNGRIDEEKD
VVVVDETEVPFNAREINEIYELRDNPDAEGNKIIESTPTKLMEDAIRVMAKPGAKWDVSPTGIKMLSASSLRPEANLWVYLVKKWLIPTTHDKTVSRDRVMTAYCIMRRI
TFDVGRIIAAQIRGVFSKPRGQLFFPTLIMNLYANASIGGPTEVLDAEKMIEVSVVIAHKTLRRLMRGSQHLPEPAKATKRPHQPIPPPKPQPQKREPTLPPPPLETHRP
KPPPPPHKTHHPKHPKPRPSTKPSAPIRVDDSHFGDLGATFMRPSSPAFDETPSTPPPQPNPPTPPQPNSPTPPMQPQPQEGQLDLEWEIVQTYLQQSILCPVMHSLNML
ARRQTEQFQFTLNYVHQLLSIRPEIPPPDISELLQQPLVFPVPQRPTPKQQRRPDEDRTDG