; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G09280 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G09280
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description5'-3' exonuclease
Genome locationClcChr11:11862983..11866221
RNA-Seq ExpressionClc11G09280
SyntenyClc11G09280
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004527 - exonuclease activity (molecular function)
InterPro domainsIPR002298 - DNA polymerase A
IPR002421 - 5'-3' exonuclease
IPR020045 - DNA polymerase I-like, H3TH domain
IPR020046 - 5'-3' exonuclease, alpha-helical arch, N-terminal
IPR029060 - PIN-like domain superfamily
IPR036279 - 5'-3' exonuclease, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037015.1 5'-3' exonuclease [Cucumis melo var. makuwa]1.5e-19290.72Show/hide
Query:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN
        M EASA IGVNFPPFLNSSSR+ LPSRT     ALTS LKP TWRTKPLKLTAF  SSR TSAAFNQTD G+F PRIEADN RKGRVFFLDVNPLCYQGN
Subjt:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE
        +PSL NFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR PSSQRFTKGN R SYQVIRDALR+CNVPVVKVDG EADDVVATLVE
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE

Query:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQ+GVRVV+ASPDKDFKQLIS++VQLVMPLPELNRWSFYT+RHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
Subjt:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV
        SAAAIRTVGKPYAQD LTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDR+NDSTILSKFVENNDRN LVQPS+QV
Subjt:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV

XP_004138652.1 uncharacterized protein LOC101219234 [Cucumis sativus]2.6e-19289.66Show/hide
Query:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN
        MAEASANIGVNFPPFLNSSS + LPSRTLK E  LTS LKP TWRTKPL LTAFA SSR TSAAF QTD G+F PRIEADN+R GRVFFLDVNPLCYQG+
Subjt:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE
        +PSL NFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR PSS+RFTKGN R SYQVIRDALR+CNVPVV+V+G EADDV+ATLVE
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE

Query:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQ+GVRVV+ASPDKDFKQLIS+++QLVMPLPELNRWSFYTLRHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
Subjt:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV
        SAAAIRTVGKPYAQD LTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDR+NDSTILSKFVENNDRN LVQPS+QV
Subjt:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV

XP_008441212.1 PREDICTED: 5'-3' exonuclease [Cucumis melo]1.5e-19290.72Show/hide
Query:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN
        M EASA IGVNFPPFLNSSSR+ LPSRT     ALTS LKP TWRTKPLKLTAF  SSR TSAAFNQTD G+F PRIEADN RKGRVFFLDVNPLCYQGN
Subjt:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE
        +PSL NFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR PSSQRFTKGN R SYQVIRDALR+CNVPVVKVDG EADDVVATLVE
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE

Query:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQ+GVRVV+ASPDKDFKQLIS++VQLVMPLPELNRWSFYT+RHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
Subjt:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV
        SAAAIRTVGKPYAQD LTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDR+NDSTILSKFVENNDRN LVQPS+QV
Subjt:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV

XP_022152532.1 uncharacterized protein LOC111020233 [Momordica charantia]1.2e-18487Show/hide
Query:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN
        MAEA ANIGVN PPFLNS+SRSSLPSRTLK ES LT+  K  +WRTK L+L+AF  +S ST   F QTDGG   P IEADN RKGRVFFLDVNPLCY+G+
Subjt:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE
        RPSLHNFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ SSQR+TKGNSRR YQVIRDALRNCNVPVVKVDG EADDVVATLV+
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE

Query:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQ+G RVVIASPDKDFKQLIS++VQLVMPLPELNRWSFYTLRHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLL
Subjt:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV
        SAAAIRTVG+PYAQD LTKYA+YLRTNYKVLALRRDVDVQFQ+EWLVERDRQNDS ILSKFVENNDRNSLVQPS++V
Subjt:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV

XP_038884032.1 5'-3' exonuclease [Benincasa hispida]1.5e-19592.06Show/hide
Query:  MAEASANIGV-NFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQG
        MAEASANIGV N PPFLNSSSR+SLPSRTLKAE+A+TS LK  TWRTKPLKLT FAASSR TSAAFNQTD G+F PRIEADN R GRVFFLDVNPLCYQG
Subjt:  MAEASANIGV-NFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQG

Query:  NRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLV
        NRPSLHNFGRW+SIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLV
Subjt:  NRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLV

Query:  EQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL
        EQVLQ+GVRVVIASPDKDFKQLIS++VQLVMPLPELNRWSFYTLRHY+AQY+CDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL
Subjt:  EQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL

Query:  LSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV
        LSAAAIRTVGKPYAQ  LTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQND  ILSKFVEN +RNSL QPS++V
Subjt:  LSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV

TrEMBL top hitse value%identityAlignment
A0A0A0LQN4 53EXOc domain-containing protein1.3e-19289.66Show/hide
Query:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN
        MAEASANIGVNFPPFLNSSS + LPSRTLK E  LTS LKP TWRTKPL LTAFA SSR TSAAF QTD G+F PRIEADN+R GRVFFLDVNPLCYQG+
Subjt:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE
        +PSL NFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR PSS+RFTKGN R SYQVIRDALR+CNVPVV+V+G EADDV+ATLVE
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE

Query:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQ+GVRVV+ASPDKDFKQLIS+++QLVMPLPELNRWSFYTLRHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
Subjt:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV
        SAAAIRTVGKPYAQD LTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDR+NDSTILSKFVENNDRN LVQPS+QV
Subjt:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV

A0A1S3B2X7 5'-3' exonuclease7.3e-19390.72Show/hide
Query:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN
        M EASA IGVNFPPFLNSSSR+ LPSRT     ALTS LKP TWRTKPLKLTAF  SSR TSAAFNQTD G+F PRIEADN RKGRVFFLDVNPLCYQGN
Subjt:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE
        +PSL NFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR PSSQRFTKGN R SYQVIRDALR+CNVPVVKVDG EADDVVATLVE
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE

Query:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQ+GVRVV+ASPDKDFKQLIS++VQLVMPLPELNRWSFYT+RHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
Subjt:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV
        SAAAIRTVGKPYAQD LTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDR+NDSTILSKFVENNDRN LVQPS+QV
Subjt:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV

A0A5A7T6A6 5'-3' exonuclease7.3e-19390.72Show/hide
Query:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN
        M EASA IGVNFPPFLNSSSR+ LPSRT     ALTS LKP TWRTKPLKLTAF  SSR TSAAFNQTD G+F PRIEADN RKGRVFFLDVNPLCYQGN
Subjt:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE
        +PSL NFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR PSSQRFTKGN R SYQVIRDALR+CNVPVVKVDG EADDVVATLVE
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE

Query:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQ+GVRVV+ASPDKDFKQLIS++VQLVMPLPELNRWSFYT+RHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
Subjt:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV
        SAAAIRTVGKPYAQD LTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDR+NDSTILSKFVENNDRN LVQPS+QV
Subjt:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV

A0A6J1DGI1 uncharacterized protein LOC1110202335.6e-18587Show/hide
Query:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN
        MAEA ANIGVN PPFLNS+SRSSLPSRTLK ES LT+  K  +WRTK L+L+AF  +S ST   F QTDGG   P IEADN RKGRVFFLDVNPLCY+G+
Subjt:  MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGN

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE
        RPSLHNFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ SSQR+TKGNSRR YQVIRDALRNCNVPVVKVDG EADDVVATLV+
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVE

Query:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQ+G RVVIASPDKDFKQLIS++VQLVMPLPELNRWSFYTLRHY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLL
Subjt:  QVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV
        SAAAIRTVG+PYAQD LTKYA+YLRTNYKVLALRRDVDVQFQ+EWLVERDRQNDS ILSKFVENNDRNSLVQPS++V
Subjt:  SAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV

A0A6J1JHT6 uncharacterized protein LOC1114871201.0e-18186.06Show/hide
Query:  MAEASANIGVNFPPFLNSSSRSSLPSRTLK-AESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQG
        MAEASANIG+N PPFLNS+S +SLPSRTLK AES  T+  K  +WRTKPLKL+ FAA+SRSTS+ F Q + G+  PR+EADN RKGRVFFLDVNPLCYQG
Subjt:  MAEASANIGVNFPPFLNSSSRSSLPSRTLK-AESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQG

Query:  NRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLV
        +RPSLHNFGRWVSIFFEEVSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQ SSQRFTKGNS RSYQVIRDALR+C+VPV+K+ G EADDVVATLV
Subjt:  NRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLV

Query:  EQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL
        EQVLQ+G R VIASPDKDFKQLIS++VQLVMPLPELNRWSFYTL+HY+AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENL
Subjt:  EQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL

Query:  LSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQ
        LSAAAIRTVGKPYAQD LTKYA+YLRTNYKVLALRRDVDVQF++EWLVERDRQNDSTILSKFVENNDRNSL +
Subjt:  LSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQ

SwissProt top hitse value%identityAlignment
O67550 5'-3' exonuclease2.4e-2328.18Show/hide
Query:  FLDVNPLCYQGNRPSLHNFGRWVSIFFEEVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKV
        F  + PL      P+   +G ++ + F  +    P  ++ VFD    ++ R  +   YK  R K       Q            VI++ L+   +P++++
Subjt:  FLDVNPLCYQGNRPSLHNFGRWVSIFFEEVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKV

Query:  DGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA
         G EADDV+A L E+  QKG +V I SPDKD  QL+S+NV ++ P+ +      +T    I ++  +P        ++GD+VD VPGI+    G G KTA
Subjt:  DGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA

Query:  LKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV
        + +LKK+GS+EN+L         + + ++      E L  +YK++ L  D+D++  +E L  + ++ D   L + ++  +  SL++   ++
Subjt:  LKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV

P52026 DNA polymerase I4.7e-1924.74Show/hide
Query:  KGRVFFLDVNPLCYQG--NRPSLHN---------FGRWVSIFFEEVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQ
        K ++  +D N + Y+     P LHN         +G +  +  + ++   P  ++  FD  G +  R      YK  R +   + S Q          + 
Subjt:  KGRVFFLDVNPLCYQG--NRPSLHN---------FGRWVSIFFEEVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQ

Query:  VIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDG
        ++R+ L+   +P  ++D  EADD++ T+  +  ++G  V + S D+D  QL S  V + +    +     YT    + +Y   P   + L+ +MGD+ D 
Subjt:  VIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDG

Query:  VPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTI
        +PG+    PG G KTA+KLLK+ G++EN+L  A+I  +     ++ L +Y +    + ++ A+ RD  V+   + +V +    +  +
Subjt:  VPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTI

Q04957 DNA polymerase I1.2e-1929.78Show/hide
Query:  EEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPD
        EE +H   ++  FD  G +  R      YK  R +   + S Q          + ++R+ LR   +P  +++  EADD++ TL  +  Q+G  V + S D
Subjt:  EEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPD

Query:  KDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQD
        +D  QL S +V + +    +     YT      +Y   P   + L+ +MGD+ D +PG+    PG G KTA+KLL++ G++EN+L  A+I  +     ++
Subjt:  KDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQD

Query:  TLTKYAEYLRTNYKVLALRRDVDVQ
        TL ++ E    + K+ A+RRD  V+
Subjt:  TLTKYAEYLRTNYKVLALRRDVDVQ

Q92GB7 DNA polymerase I8.0e-1928.77Show/hide
Query:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLIS
        V  VFD  GG   R  + P YKA+R      P      +        ++RD   N N P+++ +G EADD++AT   +    G  VVI S DKD  QL++
Subjt:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLIS

Query:  DNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEY
        +N+++  PL    +  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ +  ++  V     ++TL    E 
Subjt:  DNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEY

Query:  LRTNYKVLALRRDVDVQFQ
           +++++ L  +VD+ FQ
Subjt:  LRTNYKVLALRRDVDVQFQ

Q9RLB6 DNA polymerase I2.7e-1929.68Show/hide
Query:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLIS
        V  VFD  GG   R  + P YKA+R      P      +        ++RD   N N P+++ +G EADD++AT   +    G  VVI S DKD  QL+S
Subjt:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLIS

Query:  DNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEY
        +N+++  PL    R  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ +  ++  V     ++TL    E 
Subjt:  DNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEY

Query:  LRTNYKVLALRRDVDVQFQ
           +++++ L  +VD+ FQ
Subjt:  LRTNYKVLALRRDVDVQFQ

Arabidopsis top hitse value%identityAlignment
AT1G34380.1 5'-3' exonuclease family protein4.9e-6455.61Show/hide
Query:  RTKPLKLTAFAASSRSTSAAFNQTDGGQFHPR------IEADNTRKGRVFFLDVNPLCYQGNRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRR
        RTK +  ++ + SS S+   F++T   Q   +       E    +  RVFFLDV+PLCY+GN+PS   FG W+S+FF +VS +DPVIAV DGE G++ RR
Subjt:  RTKPLKLTAFAASSRSTSAAFNQTDGGQFHPR------IEADNTRKGRVFFLDVNPLCYQGNRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRR

Query:  LLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRW
         LLPSYKAHR    + P+  R+    S+R +Q + + LR CNVPVV+++G EADDVVATL+EQ +Q+G R VIASPDKDFKQLIS+NVQ+V+PL +L RW
Subjt:  LLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRW

Query:  SFYTLRHYIAQYNCDPCSDLSLR
        SFYTL+HY AQYNCDP SDLS R
Subjt:  SFYTLRHYIAQYNCDPCSDLSLR

AT1G34380.2 5'-3' exonuclease family protein4.8e-11261.92Show/hide
Query:  RTKPLKLTAFAASSRSTSAAFNQTDGGQFHPR------IEADNTRKGRVFFLDVNPLCYQGNRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRR
        RTK +  ++ + SS S+   F++T   Q   +       E    +  RVFFLDV+PLCY+GN+PS   FG W+S+FF +VS +DPVIAV DGE G++ RR
Subjt:  RTKPLKLTAFAASSRSTSAAFNQTDGGQFHPR------IEADNTRKGRVFFLDVNPLCYQGNRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRR

Query:  LLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRW
         LLPSYKAHR    + P+  R+    S+R +Q + + LR CNVPVV+++G EADDVVATL+EQ +Q+G R VIASPDKDFKQLIS+NVQ+V+PL +L RW
Subjt:  LLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRW

Query:  SFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVD
        SFYTL+HY AQYNCDP SDLS RCIMGDEVDGVPGIQH+ P FGRKTA+KL++KHGSLE+LLSAAA+RTVG+PYAQ+ LTKYA+YLR NY+VLAL RDV 
Subjt:  SFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVLALRRDVD

Query:  VQFQDEWLVERDRQNDSTILSKF
        VQ Q+EWL+ERD  NDS +LS F
Subjt:  VQFQDEWLVERDRQNDSTILSKF

AT3G52050.2 5'-3' exonuclease family protein1.4e-1827.67Show/hide
Query:  GSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPL
        G   R  L P+YK++     R P+     +G      Q ++ +++  ++ V++V G EADDV+ TL  + +  G +V + SPDKDF Q++S +++L+   
Subjt:  GSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPL

Query:  PELNRWSFYTLRHYIAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVL
        P  +  + + +  +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL +      GK   +++L   A+    + K+ 
Subjt:  PELNRWSFYTLRHYIAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEYLRTNYKVL

Query:  ALRRDV
         LR D+
Subjt:  ALRRDV

AT3G52050.4 5'-3' exonuclease family protein7.4e-2028.18Show/hide
Query:  VIAVFDGEG-----GSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDF
        V  VFD +G     G   R  L P+YK++     R P+     +G      Q ++ +++  ++ V++V G EADDV+ TL  + +  G +V + SPDKDF
Subjt:  VIAVFDGEG-----GSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDF

Query:  KQLISDNVQLVMPLPELNRWSFYTLRHYIAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTL
         Q++S +++L+   P  +  + + +  +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL +      GK   +++L
Subjt:  KQLISDNVQLVMPLPELNRWSFYTLRHYIAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTL

Query:  TKYAEYLRTNYKVLALRRDV
           A+    + K+  LR D+
Subjt:  TKYAEYLRTNYKVLALRRDV

AT3G52050.5 5'-3' exonuclease family protein9.6e-2026.16Show/hide
Query:  GRVFFLDVNPLCYQ-----------GNRPSLHNFGRWVSIFFEEVSHSDPV-------IAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSR
        GRV  +D   + Y+           G+         WV   F  +S    V       +AV     G   R  L P+YK++     R P+     +G   
Subjt:  GRVFFLDVNPLCYQ-----------GNRPSLHNFGRWVSIFFEEVSHSDPV-------IAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSR

Query:  RSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQY-NCDPCSDLSLRCIMG
           Q ++ +++  ++ V++V G EADDV+ TL  + +  G +V + SPDKDF Q++S +++L+   P  +  + + +  +  ++ N +P   + +  + G
Subjt:  RSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQLISDNVQLVMPLPELNRWSFYTLRHYIAQY-NCDPCSDLSLRCIMG

Query:  DEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSA
        D+ D +PG+     G G   A++L+ + G+LENLL +
Subjt:  DEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGGCGAGCGCCAACATAGGCGTTAATTTTCCGCCATTTTTGAACTCTTCATCGCGTAGTTCATTACCGTCAAGAACTCTGAAAGCTGAGTCCGCATTGACATC
CAATCTCAAACCCTTTACATGGAGAACCAAGCCGCTGAAGCTCACTGCCTTTGCAGCATCTTCTCGTTCTACTTCTGCGGCTTTTAATCAAACAGATGGTGGGCAGTTTC
ACCCAAGAATTGAAGCTGATAATACAAGAAAGGGAAGGGTTTTTTTCTTGGACGTAAACCCTTTATGCTACCAAGGTAACCGACCTAGTTTGCACAATTTTGGTCGCTGG
GTTTCCATCTTCTTCGAGGAAGTTAGCCACAGTGACCCTGTTATTGCTGTTTTTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTGTTGTTACCCTCATATAAAGCACA
TCGGATCAAATTCACAAGACAACCATCTTCACAAAGATTTACAAAGGGAAATTCTAGAAGGTCATATCAAGTGATAAGAGACGCTCTAAGAAACTGTAATGTTCCAGTTG
TAAAGGTTGATGGTCAGGAAGCAGACGATGTTGTTGCTACACTTGTGGAACAAGTTTTGCAAAAAGGGGTCCGGGTGGTAATAGCCTCTCCTGATAAAGATTTCAAGCAG
TTGATTTCAGACAATGTCCAACTCGTGATGCCTTTGCCGGAGCTCAATAGATGGTCCTTTTACACCTTAAGGCACTACATAGCTCAGTACAACTGTGATCCGTGCTCGGA
CTTGAGTCTTAGATGCATTATGGGTGACGAGGTAGATGGCGTTCCAGGAATCCAGCATGTTGCTCCTGGATTTGGTCGAAAAACAGCGTTGAAGCTCTTGAAGAAACATG
GTTCTTTGGAGAATCTACTCAGCGCTGCTGCAATCAGAACTGTGGGCAAACCTTATGCACAAGACACACTTACAAAGTATGCCGAATACCTGCGAACGAACTATAAAGTT
CTAGCCTTAAGGAGAGATGTTGATGTTCAATTTCAAGATGAGTGGTTAGTTGAAAGAGACAGGCAAAATGATTCGACAATTTTATCTAAGTTTGTAGAAAACAATGACAG
AAACTCGCTGGTTCAACCATCACAACAGGTCTAA
mRNA sequenceShow/hide mRNA sequence
TTCCTTCAGAATTTTTGTTTTTCCGCATCATTTTTCCTGATAGGAACCGTACTAAAATGGCTGAGGCGAGCGCCAACATAGGCGTTAATTTTCCGCCATTTTTGAACTCT
TCATCGCGTAGTTCATTACCGTCAAGAACTCTGAAAGCTGAGTCCGCATTGACATCCAATCTCAAACCCTTTACATGGAGAACCAAGCCGCTGAAGCTCACTGCCTTTGC
AGCATCTTCTCGTTCTACTTCTGCGGCTTTTAATCAAACAGATGGTGGGCAGTTTCACCCAAGAATTGAAGCTGATAATACAAGAAAGGGAAGGGTTTTTTTCTTGGACG
TAAACCCTTTATGCTACCAAGGTAACCGACCTAGTTTGCACAATTTTGGTCGCTGGGTTTCCATCTTCTTCGAGGAAGTTAGCCACAGTGACCCTGTTATTGCTGTTTTT
GATGGGGAAGGAGGTAGTGAGCATCGCAGGCTGTTGTTACCCTCATATAAAGCACATCGGATCAAATTCACAAGACAACCATCTTCACAAAGATTTACAAAGGGAAATTC
TAGAAGGTCATATCAAGTGATAAGAGACGCTCTAAGAAACTGTAATGTTCCAGTTGTAAAGGTTGATGGTCAGGAAGCAGACGATGTTGTTGCTACACTTGTGGAACAAG
TTTTGCAAAAAGGGGTCCGGGTGGTAATAGCCTCTCCTGATAAAGATTTCAAGCAGTTGATTTCAGACAATGTCCAACTCGTGATGCCTTTGCCGGAGCTCAATAGATGG
TCCTTTTACACCTTAAGGCACTACATAGCTCAGTACAACTGTGATCCGTGCTCGGACTTGAGTCTTAGATGCATTATGGGTGACGAGGTAGATGGCGTTCCAGGAATCCA
GCATGTTGCTCCTGGATTTGGTCGAAAAACAGCGTTGAAGCTCTTGAAGAAACATGGTTCTTTGGAGAATCTACTCAGCGCTGCTGCAATCAGAACTGTGGGCAAACCTT
ATGCACAAGACACACTTACAAAGTATGCCGAATACCTGCGAACGAACTATAAAGTTCTAGCCTTAAGGAGAGATGTTGATGTTCAATTTCAAGATGAGTGGTTAGTTGAA
AGAGACAGGCAAAATGATTCGACAATTTTATCTAAGTTTGTAGAAAACAATGACAGAAACTCGCTGGTTCAACCATCACAACAGGTCTAAATCTCCCTATGGTTGACAGA
AACTCGCTTGTTCAACCATCAAAATGGGTCTAAATCCCCCATTGGTGAATACATACGGGTAAAACTAGCTTAGCCTTTGTTAGATTATGATTTTATGCTCAGAAATATAC
ATACCTCCATACTATTAGTTGCTGAGGTAAAAAATGTATTATTCCTTTTCTCCTCATATGTTGTGGAATGAAACTAATCTATATGCAATATCAGGGCTCTAGTCGTACAC
TGTTTTCACTTTGAACAAATGTCTCTGTCATTGATTTGTTCAGGGAGGCAGCAGCATCTGGTTAGTTCAAAACATTTATAAACTAACTTTGCCTATACTTTGCCACCTTC
CAAT
Protein sequenceShow/hide protein sequence
MAEASANIGVNFPPFLNSSSRSSLPSRTLKAESALTSNLKPFTWRTKPLKLTAFAASSRSTSAAFNQTDGGQFHPRIEADNTRKGRVFFLDVNPLCYQGNRPSLHNFGRW
VSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQPSSQRFTKGNSRRSYQVIRDALRNCNVPVVKVDGQEADDVVATLVEQVLQKGVRVVIASPDKDFKQ
LISDNVQLVMPLPELNRWSFYTLRHYIAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGKPYAQDTLTKYAEYLRTNYKV
LALRRDVDVQFQDEWLVERDRQNDSTILSKFVENNDRNSLVQPSQQV