; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016533 (gene) of Snake gourd v1 genome

Gene IDTan0016533
OrganismTrichosanthes anguina (Snake gourd v1)
Description53EXOc domain-containing protein
Genome locationLG11:823072..826938
RNA-Seq ExpressionTan0016533
SyntenyTan0016533
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004527 - exonuclease activity (molecular function)
InterPro domainsIPR002298 - DNA polymerase A
IPR002421 - 5'-3' exonuclease
IPR020045 - DNA polymerase I-like, H3TH domain
IPR020046 - 5'-3' exonuclease, alpha-helical arch, N-terminal
IPR029060 - PIN-like domain superfamily
IPR036279 - 5'-3' exonuclease, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138652.1 uncharacterized protein LOC101219234 [Cucumis sativus]3.8e-18085.45Show/hide
Query:  MAGASANLGVNIPPFFNSASRTSLPPRTLNAESGITS--KANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQG
        MA ASAN+GVN PPF NS+S T LP RTL  E  +TS  K NTW TKPL L+ F A SSR TS AF QTD GK QPRIEAD SR GRVFFLDVNPLCYQG
Subjt:  MAGASANLGVNIPPFFNSASRTSLPPRTLNAESGITS--KANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQG

Query:  SRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR--QSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLA
        S+PSL NFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR   S+RFTKGN R SYQVIRDALR CNVPV+++ GHEADDV+ATL 
Subjt:  SRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR--QSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLA

Query:  EQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL
        EQVLQRG RVV+ASPDKDFKQLISED+QLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL
Subjt:  EQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL

Query:  LSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV
        LSAAAIRTVG+PYAQ+AL KYA+YLRTNYKVLALRRDVDVQFQDEWLVERDR+NDSTILSKFVEN+DRN LVQPSK+V
Subjt:  LSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV

XP_022152532.1 uncharacterized protein LOC111020233 [Momordica charantia]5.7e-18486.44Show/hide
Query:  MAGASANLGVNIPPFFNSASRTSLPPRTLNAESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGSR
        MA A AN+GVNIPPF NSASR+SLP RTL  ES +T+K+N+W TK L+LS F  +S       FKQTDGG LQP IEAD  RKGRVFFLDVNPLCY+GSR
Subjt:  MAGASANLGVNIPPFFNSASRTSLPPRTLNAESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQ
        PSLHNFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ  SQR+TKGNSRR YQVIRDALR+CNVPV+K++GHEADDVVATL +Q
Subjt:  PSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLS
        VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV
        AAAIRTVGRPYAQ+AL KYADYLRTNYKVLALRRDVDVQFQ+EWLVERDRQNDS ILSKFVEN+DRN LVQPSKRV
Subjt:  AAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV

XP_022990137.1 uncharacterized protein LOC111487120 [Cucurbita maxima]1.6e-18187.63Show/hide
Query:  MAGASANLGVNIPPFFNSASRTSLPPRTLN-AESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGS
        MA ASAN+G+NIPPF NSAS TSLP RTL  AES  T+KA +W TKPLKLS+FAA +SRSTS  FKQ + GKL PR+EAD  RKGRVFFLDVNPLCYQGS
Subjt:  MAGASANLGVNIPPFFNSASRTSLPPRTLN-AESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAE
        RPSLHNFGRWVSIFFEEVSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQ  SQRFTKGNS RSYQVIRDALR C+VPVIKI GHEADDVVATL E
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAE

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQ
        SAAAIRTVG+PYAQ+AL KYADYLRTNYKVLALRRDVDVQF++EWLVERDRQNDSTILSKFVEN+DRN L +
Subjt:  SAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQ

XP_023542307.1 uncharacterized protein LOC111802238 [Cucurbita pepo subsp. pepo]4.5e-18186.17Show/hide
Query:  MAGASANLGVNIPPFFNSASRTSLPPRTLN-AESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGS
        MA ASAN+G+NIPPFFNSAS TSLP RTL  AES  T+KA +W TKPLKLS+FAA SSRSTS  FKQ + GKL PR+EAD  RKGRVFFLDVNPLCYQGS
Subjt:  MAGASANLGVNIPPFFNSASRTSLPPRTLN-AESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAE
        RPSLHNFGRWVSIFFEEVSHSDPVIAV DGEGGSEHRRLLLPSYK+HRIKFTRQ  SQRFTKGNS RSYQVI+DALR CNVPVIKI GHEADDVVATL E
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAE

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQRG R VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HY AQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKR
        SAAAIRTVG+PYAQ+AL KYADYLRTNYKVLALRRDVDVQF++EWLVERDRQNDSTILSKFVEN+D+N L +   +
Subjt:  SAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKR

XP_038884032.1 5'-3' exonuclease [Benincasa hispida]4.4e-18487.6Show/hide
Query:  MAGASANLGV-NIPPFFNSASRTSLPPRTLNAESGITS--KANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQ
        MA ASAN+GV N+PPF NS+SRTSLP RTL AE+ +TS  KANTW TKPLKL++FAA SSR TS AF QTD GK QPRIEAD  R GRVFFLDVNPLCYQ
Subjt:  MAGASANLGV-NIPPFFNSASRTSLPPRTLNAESGITS--KANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQ

Query:  GSRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATL
        G+RPSLHNFGRW+SIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ  SQRFTKGNSRRSYQVIRDALR+CNVPV+K++G EADDVVATL
Subjt:  GSRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATL

Query:  AEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLEN
         EQVLQRG RVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQY+CDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLEN
Subjt:  AEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLEN

Query:  LLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV
        LLSAAAIRTVG+PYAQ AL KYA+YLRTNYKVLALRRDVDVQFQDEWLVERDRQND  ILSKFVEN +RN L QPSKRV
Subjt:  LLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV

TrEMBL top hitse value%identityAlignment
A0A0A0LQN4 53EXOc domain-containing protein1.9e-18085.45Show/hide
Query:  MAGASANLGVNIPPFFNSASRTSLPPRTLNAESGITS--KANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQG
        MA ASAN+GVN PPF NS+S T LP RTL  E  +TS  K NTW TKPL L+ F A SSR TS AF QTD GK QPRIEAD SR GRVFFLDVNPLCYQG
Subjt:  MAGASANLGVNIPPFFNSASRTSLPPRTLNAESGITS--KANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQG

Query:  SRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR--QSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLA
        S+PSL NFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR   S+RFTKGN R SYQVIRDALR CNVPV+++ GHEADDV+ATL 
Subjt:  SRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR--QSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLA

Query:  EQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL
        EQVLQRG RVV+ASPDKDFKQLISED+QLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL
Subjt:  EQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL

Query:  LSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV
        LSAAAIRTVG+PYAQ+AL KYA+YLRTNYKVLALRRDVDVQFQDEWLVERDR+NDSTILSKFVEN+DRN LVQPSK+V
Subjt:  LSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV

A0A1S3B2X7 5'-3' exonuclease1.0e-17885.19Show/hide
Query:  MAGASANLGVNIPPFFNSASRTSLPPRTLNAESGITS--KANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQG
        M  ASA +GVN PPF NS+SRT LP RT      +TS  K NTW TKPLKL+ F   SSR TS AF QTD GK QPRIEAD  RKGRVFFLDVNPLCYQG
Subjt:  MAGASANLGVNIPPFFNSASRTSLPPRTLNAESGITS--KANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQG

Query:  SRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR--QSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLA
        ++PSL NFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR   SQRFTKGN R SYQVIRDALR CNVPV+K++GHEADDVVATL 
Subjt:  SRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR--QSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLA

Query:  EQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL
        EQVLQRG RVV+ASPDKDFKQLISEDVQLVMPLPELNRWSFYT+RHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL
Subjt:  EQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENL

Query:  LSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV
        LSAAAIRTVG+PYAQ+AL KYA+YLRTNYKVLALRRDVDVQFQDEWLVERDR+NDSTILSKFVEN+DRN LVQPSK+V
Subjt:  LSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV

A0A6J1DGI1 uncharacterized protein LOC1110202332.8e-18486.44Show/hide
Query:  MAGASANLGVNIPPFFNSASRTSLPPRTLNAESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGSR
        MA A AN+GVNIPPF NSASR+SLP RTL  ES +T+K+N+W TK L+LS F  +S       FKQTDGG LQP IEAD  RKGRVFFLDVNPLCY+GSR
Subjt:  MAGASANLGVNIPPFFNSASRTSLPPRTLNAESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGSR

Query:  PSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQ
        PSLHNFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ  SQR+TKGNSRR YQVIRDALR+CNVPV+K++GHEADDVVATL +Q
Subjt:  PSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLS
        VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV
        AAAIRTVGRPYAQ+AL KYADYLRTNYKVLALRRDVDVQFQ+EWLVERDRQNDS ILSKFVEN+DRN LVQPSKRV
Subjt:  AAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV

A0A6J1HDC1 uncharacterized protein LOC1114617813.2e-18085.64Show/hide
Query:  MAGASANLGVNIPPFFNSASRTSLPPRTLN-AESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGS
        MA ASAN+G+NIPPF NS S TSLP RTL  AES  T+KA +W TKPLKLS+FAA SSRSTS  FKQ + GKL P +EAD  RKGRVFFLDVNPLCYQGS
Subjt:  MAGASANLGVNIPPFFNSASRTSLPPRTLN-AESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAE
        RPSLHNFGRWVSIFFEEVS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQ  SQRFTKGNS RSYQVIRDALR CNVPVIKI GHEADDVVATL E
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAE

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKR
        SAAA+RTVG+PYAQ+AL KYADYLRTNYKVLALRRD+DVQF++EWLV+RDRQNDSTILSKFVEN+DRN L +   +
Subjt:  SAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKR

A0A6J1JHT6 uncharacterized protein LOC1114871207.6e-18287.63Show/hide
Query:  MAGASANLGVNIPPFFNSASRTSLPPRTLN-AESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGS
        MA ASAN+G+NIPPF NSAS TSLP RTL  AES  T+KA +W TKPLKLS+FAA +SRSTS  FKQ + GKL PR+EAD  RKGRVFFLDVNPLCYQGS
Subjt:  MAGASANLGVNIPPFFNSASRTSLPPRTLN-AESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGS

Query:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAE
        RPSLHNFGRWVSIFFEEVSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQ  SQRFTKGNS RSYQVIRDALR C+VPVIKI GHEADDVVATL E
Subjt:  RPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ--SQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAE

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL
        QVLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLL

Query:  SAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQ
        SAAAIRTVG+PYAQ+AL KYADYLRTNYKVLALRRDVDVQF++EWLVERDRQNDSTILSKFVEN+DRN L +
Subjt:  SAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQ

SwissProt top hitse value%identityAlignment
O67550 5'-3' exonuclease9.6e-2529.69Show/hide
Query:  FLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSQRFTKGNSRRSYQVIRDALRDCNVPVIKING
        F  + PL      P+   +G ++ + F  +    P  ++ VFD    ++ R  +   YK  R K             +    VI++ L+   +P++++ G
Subjt:  FLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSQRFTKGNSRRSYQVIRDALRDCNVPVIKING

Query:  HEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALK
        +EADDV+A LAE+  Q+GF+V I SPDKD  QL+SE+V ++ P+ +      +T    + ++  +P        ++GD+VD VPGI+    G G KTA+ 
Subjt:  HEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALK

Query:  LLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDE
        +LKK+GS+EN+L           + +E      + L  +YK++ L  D+D++  +E
Subjt:  LLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDE

P52026 DNA polymerase I4.9e-2125Show/hide
Query:  KGRVFFLDVNPLCYQG--SRPSLHNFGRWVSIFFEEVSHSDPVIA---VFDGEGGSEHRRLLLPSYKAHRIKFTRQSQRFTKGNSRRS-------YQVIR
        K ++  +D N + Y+   + P LHN         ++  H++ V     + +     E    +L ++ A +  F  ++ +  KG  +++       + ++R
Subjt:  KGRVFFLDVNPLCYQG--SRPSLHNFGRWVSIFFEEVSHSDPVIA---VFDGEGGSEHRRLLLPSYKAHRIKFTRQSQRFTKGNSRRS-------YQVIR

Query:  DALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPG
        + L+   +P  +++ +EADD++ T+A +  + GF V + S D+D  QL S  V + +    +     YT    + +Y   P   + L+ +MGD+ D +PG
Subjt:  DALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPG

Query:  IQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTI
        +    PG G KTA+KLLK+ G++EN+L  A+I  +     +E L +Y D    + ++ A+ RD  V+   + +V +    +  +
Subjt:  IQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQNDSTI

Q04957 DNA polymerase I2.5e-2029.61Show/hide
Query:  LLPSYKAHRIKFTRQSQRFTKGNSRRS-------YQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPE
        +L ++ A +  F  ++ +  KG  +++       + ++R+ LR   +P  ++  +EADD++ TLA +  Q GF V + S D+D  QL S  V + +    
Subjt:  LLPSYKAHRIKFTRQSQRFTKGNSRRS-------YQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPE

Query:  LNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALR
        +     YT      +Y   P   + L+ +MGD+ D +PG+    PG G KTA+KLL++ G++EN+L  A+I  +     +E L ++ +    + K+ A+R
Subjt:  LNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALR

Query:  RDVDVQ
        RD  V+
Subjt:  RDVDVQ

Q9RLB6 DNA polymerase I6.0e-1929.49Show/hide
Query:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISED
        V  VFD  GG   R  + P YKA+R             +      ++RD   + N P+++ NG+EADD++AT A +    G  VVI S DKD  QL+SE+
Subjt:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISED

Query:  VQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLR
        +++  PL    R  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ +  ++  V     +E L    +   
Subjt:  VQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLR

Query:  TNYKVLALRRDVDVQFQ
         +++++ L  +VD+ FQ
Subjt:  TNYKVLALRRDVDVQFQ

Q9S1G2 DNA polymerase I1.8e-1834.44Show/hide
Query:  YQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEV
        + +IR+A R  N+P I+  G EADD++AT A Q    G  V I S DKD  QL+S +V +   + +        +   + ++   P   + L+ + GD V
Subjt:  YQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEV

Query:  DGVPGIQHVAPGFGRKTALKLLKKHGSLENLLS-AAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLV
        D VPGI    PG G KTA +LL+++G L+ LL  A  I+ V R   +E ++   D  R +  ++ LR DV +    + LV
Subjt:  DGVPGIQHVAPGFGRKTALKLLKKHGSLENLLS-AAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLV

Arabidopsis top hitse value%identityAlignment
AT1G34380.1 5'-3' exonuclease family protein4.1e-6358.85Show/hide
Query:  SSRSTSEAFKQTDGGK-LQPRI-----EADESRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIK
        SS S+ E F +T   + LQ  +     E    +  RVFFLDV+PLCY+G++PS   FG W+S+FF +VS +DPVIAV DGE G++ RR LLPSYKAH   
Subjt:  SSRSTSEAFKQTDGGK-LQPRI-----EADESRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIK

Query:  FTRQSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNC
          R+S    +  S+R +Q + + LR CNVPV++I GHEADDVVATL EQ +QRG+R VIASPDKDFKQLISE+VQ+V+PL +L RWSFYTL+HY AQYNC
Subjt:  FTRQSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNC

Query:  DPCSDLSLR
        DP SDLS R
Subjt:  DPCSDLSLR

AT1G34380.2 5'-3' exonuclease family protein3.7e-11265.37Show/hide
Query:  SSRSTSEAFKQTDGGK-LQPRI-----EADESRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIK
        SS S+ E F +T   + LQ  +     E    +  RVFFLDV+PLCY+G++PS   FG W+S+FF +VS +DPVIAV DGE G++ RR LLPSYKAH   
Subjt:  SSRSTSEAFKQTDGGK-LQPRI-----EADESRKGRVFFLDVNPLCYQGSRPSLHNFGRWVSIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIK

Query:  FTRQSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNC
          R+S    +  S+R +Q + + LR CNVPV++I GHEADDVVATL EQ +QRG+R VIASPDKDFKQLISE+VQ+V+PL +L RWSFYTL+HY AQYNC
Subjt:  FTRQSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNC

Query:  DPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQ
        DP SDLS RCIMGDEVDGVPGIQH+ P FGRKTA+KL++KHGSLE+LLSAAA+RTVGRPYAQEAL KYADYLR NY+VLAL RDV VQ Q+EWL+ERD  
Subjt:  DPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDVDVQFQDEWLVERDRQ

Query:  NDSTILSKF
        NDS +LS F
Subjt:  NDSTILSKF

AT3G52050.2 5'-3' exonuclease family protein8.7e-2128.92Show/hide
Query:  GSEHRRLLLPSYKAHRIKFTRQSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPE
        G   R  L P+YK++R          T     +  Q ++ +++  ++ VI++ G EADDV+ TLA + +  GF+V + SPDKDF Q++S  ++L+   P 
Subjt:  GSEHRRLLLPSYKAHRIKFTRQSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPE

Query:  LNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLAL
         +  + + +  +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL   ++  +     +E+LI  AD    + K+  L
Subjt:  LNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLAL

Query:  RRDV
        R D+
Subjt:  RRDV

AT3G52050.4 5'-3' exonuclease family protein3.2e-2327.6Show/hide
Query:  SRKGRVFFLDVNPLCYQGSRPSL---------HNFGR--WVSIFFEEVS--------HSDPVIAVFDGEG-----GSEHRRLLLPSYKAHRIKFTRQSQR
        S  GRV  +D   + Y+     L         H  G   WV   F  +S            V  VFD +G     G   R  L P+YK++R         
Subjt:  SRKGRVFFLDVNPLCYQGSRPSL---------HNFGR--WVSIFFEEVS--------HSDPVIAVFDGEG-----GSEHRRLLLPSYKAHRIKFTRQSQR

Query:  FTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDL
         T     +  Q ++ +++  ++ VI++ G EADDV+ TLA + +  GF+V + SPDKDF Q++S  ++L+   P  +  + + +  +  ++ N +P   +
Subjt:  FTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDL

Query:  SLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDV
         +  + GD+ D +PG+     G G   A++L+ + G+LENLL   ++  +     +E+LI  AD    + K+  LR D+
Subjt:  SLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLALRRDV

AT3G52050.5 5'-3' exonuclease family protein5.1e-2127.73Show/hide
Query:  SRKGRVFFLDVNPLCYQGSRPSL---------HNFGR--WVSIFFEEVSHSDPV-------IAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSQRFTKGNS
        S  GRV  +D   + Y+     L         H  G   WV   F  +S    V       +AV     G   R  L P+YK++R          T    
Subjt:  SRKGRVFFLDVNPLCYQGSRPSL---------HNFGR--WVSIFFEEVSHSDPV-------IAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSQRFTKGNS

Query:  RRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIM
         +  Q ++ +++  ++ VI++ G EADDV+ TLA + +  GF+V + SPDKDF Q++S  ++L+   P  +  + + +  +  ++ N +P   + +  + 
Subjt:  RRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIM

Query:  GDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSA
        GD+ D +PG+     G G   A++L+ + G+LENLL +
Subjt:  GDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGGGCGAGCGCCAACTTAGGCGTTAATATTCCGCCATTTTTTAACTCTGCTTCGCGTACTTCCTTACCGCCGAGAACTCTGAATGCCGAGTCGGGAATAACATC
GAAAGCCAATACATGGGGAACGAAGCCATTGAAGCTAAGTATCTTTGCGGCATCTTCTTCTCGTTCTACTTCTGAGGCTTTTAAGCAAACAGACGGTGGGAAGTTGCAGC
CAAGAATTGAAGCGGATGAGTCAAGAAAGGGAAGGGTCTTTTTCCTTGACGTAAATCCTCTCTGCTATCAAGGTAGCAGACCTAGTTTGCACAATTTTGGTCGCTGGGTT
TCCATCTTCTTCGAGGAAGTTAGCCACAGTGATCCTGTTATTGCCGTTTTTGATGGGGAAGGAGGTAGCGAGCATCGCAGGCTGTTGTTACCCTCATATAAAGCACATCG
GATCAAATTCACGAGACAATCACAAAGATTTACAAAGGGAAATTCTAGAAGGTCATATCAAGTGATAAGAGATGCTCTCAGAGACTGTAATGTGCCAGTTATAAAGATCA
ATGGTCACGAAGCAGATGATGTTGTTGCTACACTTGCGGAACAAGTTTTGCAAAGAGGGTTTCGGGTGGTAATAGCCTCTCCTGATAAAGATTTCAAGCAGTTGATTTCA
GAAGATGTCCAACTCGTGATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAGACACTACCTAGCTCAGTATAACTGTGATCCGTGCTCTGACTTGAGTCT
TAGATGCATTATGGGTGATGAGGTAGATGGCGTTCCAGGAATCCAGCATGTTGCTCCTGGATTTGGTCGAAAGACTGCATTGAAGCTCTTAAAGAAACACGGTTCTTTGG
AGAACCTACTCAGTGCTGCTGCAATAAGAACTGTGGGCAGACCGTATGCACAAGAGGCACTTATAAAGTATGCTGATTACCTGCGAACGAACTATAAAGTTCTAGCCTTA
AGAAGAGATGTTGATGTTCAATTTCAAGACGAGTGGTTGGTCGAAAGAGACAGACAAAACGATTCAACTATTTTATCTAAGTTTGTAGAAAACGATGACAGAAACCCACT
TGTTCAACCATCGAAACGAGTCTAA
mRNA sequenceShow/hide mRNA sequence
CATTTTTCCGAATCCATTTTCCTGATAGGACCCGTACAAGAAATGGCGGGGGCGAGCGCCAACTTAGGCGTTAATATTCCGCCATTTTTTAACTCTGCTTCGCGTACTTC
CTTACCGCCGAGAACTCTGAATGCCGAGTCGGGAATAACATCGAAAGCCAATACATGGGGAACGAAGCCATTGAAGCTAAGTATCTTTGCGGCATCTTCTTCTCGTTCTA
CTTCTGAGGCTTTTAAGCAAACAGACGGTGGGAAGTTGCAGCCAAGAATTGAAGCGGATGAGTCAAGAAAGGGAAGGGTCTTTTTCCTTGACGTAAATCCTCTCTGCTAT
CAAGGTAGCAGACCTAGTTTGCACAATTTTGGTCGCTGGGTTTCCATCTTCTTCGAGGAAGTTAGCCACAGTGATCCTGTTATTGCCGTTTTTGATGGGGAAGGAGGTAG
CGAGCATCGCAGGCTGTTGTTACCCTCATATAAAGCACATCGGATCAAATTCACGAGACAATCACAAAGATTTACAAAGGGAAATTCTAGAAGGTCATATCAAGTGATAA
GAGATGCTCTCAGAGACTGTAATGTGCCAGTTATAAAGATCAATGGTCACGAAGCAGATGATGTTGTTGCTACACTTGCGGAACAAGTTTTGCAAAGAGGGTTTCGGGTG
GTAATAGCCTCTCCTGATAAAGATTTCAAGCAGTTGATTTCAGAAGATGTCCAACTCGTGATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAGACACTA
CCTAGCTCAGTATAACTGTGATCCGTGCTCTGACTTGAGTCTTAGATGCATTATGGGTGATGAGGTAGATGGCGTTCCAGGAATCCAGCATGTTGCTCCTGGATTTGGTC
GAAAGACTGCATTGAAGCTCTTAAAGAAACACGGTTCTTTGGAGAACCTACTCAGTGCTGCTGCAATAAGAACTGTGGGCAGACCGTATGCACAAGAGGCACTTATAAAG
TATGCTGATTACCTGCGAACGAACTATAAAGTTCTAGCCTTAAGAAGAGATGTTGATGTTCAATTTCAAGACGAGTGGTTGGTCGAAAGAGACAGACAAAACGATTCAAC
TATTTTATCTAAGTTTGTAGAAAACGATGACAGAAACCCACTTGTTCAACCATCGAAACGAGTCTAAATTGCCCTATGGTTGAACACATACAGGTAAAACTAGCTTAGCC
TTTGTTAGATTATGATTTTATGCTTGGAAATATACATACATCCATATTAAAAGTTTGCTGAGGTGAAAAATGTATTATTCCTTTTATCCTCATATACTGTGGAATCAAGC
TAATCTATATGCAATGGCAGAGCTCTACTGACT
Protein sequenceShow/hide protein sequence
MAGASANLGVNIPPFFNSASRTSLPPRTLNAESGITSKANTWGTKPLKLSIFAASSSRSTSEAFKQTDGGKLQPRIEADESRKGRVFFLDVNPLCYQGSRPSLHNFGRWV
SIFFEEVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSQRFTKGNSRRSYQVIRDALRDCNVPVIKINGHEADDVVATLAEQVLQRGFRVVIASPDKDFKQLIS
EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTALKLLKKHGSLENLLSAAAIRTVGRPYAQEALIKYADYLRTNYKVLAL
RRDVDVQFQDEWLVERDRQNDSTILSKFVENDDRNPLVQPSKRV