; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1985 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1985
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Description5'-3' exonuclease
Genome locationMC04:26632933..26635646
RNA-Seq ExpressionMC04g1985
SyntenyMC04g1985
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004527 - exonuclease activity (molecular function)
InterPro domainsIPR002298 - DNA polymerase A
IPR002421 - 5'-3' exonuclease
IPR020045 - DNA polymerase I-like, H3TH domain
IPR020046 - 5'-3' exonuclease, alpha-helical arch, N-terminal
IPR029060 - PIN-like domain superfamily
IPR036279 - 5'-3' exonuclease, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033404.1 hypothetical protein SDJN02_07460 [Cucurbita argyrosperma subsp. argyrosperma]7.91e-23086.25Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKP-ESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNS S +SLPSRTLK  ES  TTK+  WRTK L+LS FP +S ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKP-ESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+CNVPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ
        AAA+RTVG+PYAQDALTKYADYLRTNYKVLALRRD+DVQF+EEWLV+RDRQNDS ILSKFVENNDRNSL +
Subjt:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ

XP_022152532.1 uncharacterized protein LOC111020233 [Momordica charantia]1.36e-269100Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH
        MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH

Query:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
        NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
Subjt:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR

Query:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
        GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
Subjt:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI

Query:  RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
        RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
Subjt:  RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV

XP_022961199.1 uncharacterized protein LOC111461781 [Cucurbita moschata]3.22e-22985.33Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKP-ESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNS S +SLPSRTLK  ES  TTK+ SWRTK L+LS F  +S ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKP-ESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+CNVPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR
        AAA+RTVG+PYAQDALTKYADYLRTNYKVLALRRD+DVQF+EEWLV+RDRQNDS ILSKFVENNDRNSL +   +
Subjt:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR

XP_022990137.1 uncharacterized protein LOC111487120 [Cucurbita maxima]4.78e-23187.6Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKP-ESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNSAS +SLPSRTLK  ES  TTK+ SWRTK L+LS F  AS ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKP-ESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+C+VPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ
        AAAIRTVG+PYAQDALTKYADYLRTNYKVLALRRDVDVQF+EEWLVERDRQNDS ILSKFVENNDRNSL +
Subjt:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ

XP_038884032.1 5'-3' exonuclease [Benincasa hispida]7.08e-23086.24Show/hide
Query:  MAEAGANIGV-NIPPFLNSASRSSLPSRTLKPESTLTTK--SNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEG
        MAEA ANIGV N+PPFLNS+SR+SLPSRTLK E+ +T+K  +N+WRTK L+L+ F  +S  T   F QTD G  QP IEADNPR GRVFFLDVNPLCY+G
Subjt:  MAEAGANIGV-NIPPFLNSASRSSLPSRTLKPESTLTTK--SNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEG

Query:  SRPSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLV
        +RPSLHNFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ SSQR+TKGNSRR YQVIRDALRNCNVPVVKVDG EADDVVATLV
Subjt:  SRPSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLV

Query:  DQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENL
        +QVLQRG RVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQY+CDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENL
Subjt:  DQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENL

Query:  LSAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
        LSAAAIRTVG+PYAQ ALTKYA+YLRTNYKVLALRRDVDVQFQ+EWLVERDRQND AILSKFVEN +RNSL QPSKRV
Subjt:  LSAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV

TrEMBL top hitse value%identityAlignment
A0A0A0LQN4 53EXOc domain-containing protein2.12e-22684.35Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTK--SNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGS
        MAEA ANIGVN PPFLNS+S + LPSRTLKPE  LT+K   N+WRTK L L+AF  +S  T   F QTD G  QP IEADN R GRVFFLDVNPLCY+GS
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTK--SNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGS

Query:  RPSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVD
        +PSL NFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR  SS+R+TKGN R  YQVIRDALR+CNVPVV+V+GHEADDV+ATLV+
Subjt:  RPSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVD

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLL
        QVLQRG RVV+ASPDKDFKQLISED+QLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLL

Query:  SAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
        SAAAIRTVG+PYAQDALTKYA+YLRTNYKVLALRRDVDVQFQ+EWLVERDR+NDS ILSKFVENNDRN LVQPSK+V
Subjt:  SAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV

A0A1S3B2X7 5'-3' exonuclease5.88e-22584.53Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRP
        M EA A IGVN PPFLNS+SR+ LPSRT     T   K N+WRTK L+L+AF  +S  T   F QTD G  QP IEADNPRKGRVFFLDVNPLCY+G++P
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRP

Query:  SLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQV
        SL NFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR  SSQR+TKGN R  YQVIRDALR+CNVPVVKVDGHEADDVVATLV+QV
Subjt:  SLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQV

Query:  LQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA
        LQRG RVV+ASPDKDFKQLISEDVQLVMPLPELNRWSFYT+RHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLLSA
Subjt:  LQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA

Query:  AAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
        AAIRTVG+PYAQDALTKYA+YLRTNYKVLALRRDVDVQFQ+EWLVERDR+NDS ILSKFVENNDRN LVQPSK+V
Subjt:  AAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV

A0A6J1DGI1 uncharacterized protein LOC1110202336.56e-270100Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH
        MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH

Query:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
        NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
Subjt:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR

Query:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
        GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
Subjt:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI

Query:  RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
        RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
Subjt:  RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV

A0A6J1HDC1 uncharacterized protein LOC1114617811.56e-22985.33Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKP-ESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNS S +SLPSRTLK  ES  TTK+ SWRTK L+LS F  +S ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKP-ESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+CNVPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR
        AAA+RTVG+PYAQDALTKYADYLRTNYKVLALRRD+DVQF+EEWLV+RDRQNDS ILSKFVENNDRNSL +   +
Subjt:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR

A0A6J1JHT6 uncharacterized protein LOC1114871202.32e-23187.6Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKP-ESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNSAS +SLPSRTLK  ES  TTK+ SWRTK L+LS F  AS ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKP-ESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+C+VPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ
        AAAIRTVG+PYAQDALTKYADYLRTNYKVLALRRDVDVQF+EEWLVERDRQNDS ILSKFVENNDRNSL +
Subjt:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ

SwissProt top hitse value%identityAlignment
O67550 5'-3' exonuclease8.1e-2429.07Show/hide
Query:  FLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKV
        F  + PL      P+   +G +  + F  +    P  ++ VFD    ++ R  +   YK  R K       Q            VI++ L+   +P++++
Subjt:  FLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKV

Query:  DGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA
         G+EADDV+A L ++  Q+GF+V I SPDKD  QL+SE+V ++ P+ +      +T    + ++  +P        ++GD+VD VPGI+    G G KTA
Subjt:  DGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA

Query:  MKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEE
        + +LKK+GS+EN+L           + ++      + L  +YK++ L  D+D++  EE
Subjt:  MKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEE

P52026 DNA polymerase I3.8e-2125.61Show/hide
Query:  KGRVFFLDVNPLCYEG--SRPSLHNFGRWASIFFEQVSHSDPVIA---VFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVI
        K ++  +D N + Y    + P LHN         ++  H++ V     + +     E    +L ++ A +  F R  + Q Y  G  + P      + ++
Subjt:  KGRVFFLDVNPLCYEG--SRPSLHNFGRWASIFFEQVSHSDPVIA---VFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVI

Query:  RDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVP
        R+ L+   +P  ++D +EADD++ T+  +  + GF V + S D+D  QL S  V + +    +     YT    + +Y   P   + L+ +MGD+ D +P
Subjt:  RDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVP

Query:  GIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAI
        G+    PG G KTA+KLLK+ G++EN+L  A+I  +     ++ L +Y D    + ++ A+ RD  V+   + +V +    +  +
Subjt:  GIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAI

Q04957 DNA polymerase I2.4e-2029.95Show/hide
Query:  LLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLP
        +L ++ A +  F R  + Q Y  G  + P      + ++R+ LR   +P  +++ +EADD++ TL  +  Q GF V + S D+D  QL S  V + +   
Subjt:  LLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLP

Query:  ELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLAL
         +     YT      +Y   P   + L+ +MGD+ D +PG+    PG G KTA+KLL++ G++EN+L  A+I  +     ++ L ++ +    + K+ A+
Subjt:  ELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLAL

Query:  RRDVDVQ
        RRD  V+
Subjt:  RRDVDVQ

Q92GB7 DNA polymerase I3.0e-1827.85Show/hide
Query:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS
        V  VFD  GG   R  + P YKA+R         Q            ++RD   N N P+++ +G+EADD++AT   +    G  VVI S DKD  QL++
Subjt:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS

Query:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY
        E++++  PL    +  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ +  ++  V     ++ L    + 
Subjt:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY

Query:  LRTNYKVLALRRDVDVQFQ
           +++++ L  +VD+ FQ
Subjt:  LRTNYKVLALRRDVDVQFQ

Q9RLB6 DNA polymerase I1.0e-1828.77Show/hide
Query:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS
        V  VFD  GG   R  + P YKA+R         Q            ++RD   N N P+++ +G+EADD++AT   +    G  VVI S DKD  QL+S
Subjt:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS

Query:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY
        E++++  PL    R  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ +  ++  V     ++ L    + 
Subjt:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY

Query:  LRTNYKVLALRRDVDVQFQ
           +++++ L  +VD+ FQ
Subjt:  LRTNYKVLALRRDVDVQFQ

Arabidopsis top hitse value%identityAlignment
AT1G34380.1 5'-3' exonuclease family protein5.1e-6656.71Show/hide
Query:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRKGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG
        K+   RTK +  S+   +SHS   TF +T  G +Q +++ D           +  RVFFLDV+PLCYEG++PS   FG W S+FF QVS +DPVIAV DG
Subjt:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRKGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG

Query:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM
        E G++ RR LLPSYKAHR    +  +  RY    S+RP+Q + + LR CNVPVV+++GHEADDVVATL++Q +QRG+R VIASPDKDFKQLISE+VQ+V+
Subjt:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM

Query:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLR
        PL +L RWSFYTL+HY AQYNCDP SDLS R
Subjt:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLR

AT1G34380.2 5'-3' exonuclease family protein1.6e-11563.75Show/hide
Query:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRKGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG
        K+   RTK +  S+   +SHS   TF +T  G +Q +++ D           +  RVFFLDV+PLCYEG++PS   FG W S+FF QVS +DPVIAV DG
Subjt:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRKGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG

Query:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM
        E G++ RR LLPSYKAHR    +  +  RY    S+RP+Q + + LR CNVPVV+++GHEADDVVATL++Q +QRG+R VIASPDKDFKQLISE+VQ+V+
Subjt:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM

Query:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKV
        PL +L RWSFYTL+HY AQYNCDP SDLS RCIMGDEVDGVPGIQH+ P FGRKTAMKL++KHGSLE+LLSAAA+RTVGRPYAQ+ALTKYADYLR NY+V
Subjt:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKV

Query:  LALRRDVDVQFQEEWLVERDRQNDSAILSKF
        LAL RDV VQ Q EWL+ERD  NDS +LS F
Subjt:  LALRRDVDVQFQEEWLVERDRQNDSAILSKF

AT3G52050.1 5'-3' exonuclease family protein8.1e-1927.18Show/hide
Query:  GSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPL
        G   R  L P+YK++R            T     +  Q ++ +++  ++ V++V G EADDV+ TL  + +  GF+V + SPDKDF Q++S  ++L+   
Subjt:  GSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPL

Query:  PELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVL
        P  +  + + +  +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL   ++  +     +++L   AD    + K+ 
Subjt:  PELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVL

Query:  ALRRDV
         LR D+
Subjt:  ALRRDV

AT3G52050.2 5'-3' exonuclease family protein8.1e-1927.18Show/hide
Query:  GSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPL
        G   R  L P+YK++R            T     +  Q ++ +++  ++ V++V G EADDV+ TL  + +  GF+V + SPDKDF Q++S  ++L+   
Subjt:  GSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPL

Query:  PELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVL
        P  +  + + +  +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL   ++  +     +++L   AD    + K+ 
Subjt:  PELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVL

Query:  ALRRDV
         LR D+
Subjt:  ALRRDV

AT3G52050.4 5'-3' exonuclease family protein4.3e-2027.73Show/hide
Query:  VIAVFDGEG-----GSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDF
        V  VFD +G     G   R  L P+YK++R            T     +  Q ++ +++  ++ V++V G EADDV+ TL  + +  GF+V + SPDKDF
Subjt:  VIAVFDGEG-----GSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDF

Query:  KQLISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDAL
         Q++S  ++L+   P  +  + + +  +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL   ++  +     +++L
Subjt:  KQLISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDAL

Query:  TKYADYLRTNYKVLALRRDV
           AD    + K+  LR D+
Subjt:  TKYADYLRTNYKVLALRRDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGGCGGGCGCCAACATAGGCGTGAATATTCCGCCATTTCTGAACTCTGCATCGCGTAGTTCCTTACCCTCCAGAACTCTGAAACCAGAATCGACATTAACAAC
GAAATCCAATTCATGGAGAACGAAGGCGTTGCAGCTAAGTGCCTTTCCGGTAGCATCTCATTCTACCTTTAAACAAACAGATGGCGGGACGTTGCAGCCAATAATTGAAG
CGGATAATCCAAGAAAGGGAAGGGTCTTTTTCCTGGACGTAAATCCATTATGCTACGAAGGTAGCAGACCTAGTTTGCACAATTTTGGTCGCTGGGCTTCCATATTCTTC
GAGCAAGTTAGCCACAGTGATCCTGTTATTGCTGTTTTTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTGTTATTACCCTCATATAAAGCACATCGGATCAAATTCAC
GAGACAATCATCTTCACAAAGATATACTAAGGGAAATTCTAGAAGGCCATATCAAGTAATAAGGGATGCTCTCAGAAACTGTAATGTGCCAGTTGTAAAGGTTGACGGTC
ATGAAGCAGATGATGTTGTAGCTACACTTGTGGACCAAGTTTTACAAAGAGGGTTTCGGGTGGTTATAGCCTCTCCTGATAAAGATTTCAAGCAGTTGATTTCAGAAGAT
GTCCAACTCGTGATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAGGCACTACCTAGCTCAGTATAACTGCGATCCATGCTCTGACTTGAGTCTGAGATG
TATTATGGGTGATGAGGTAGATGGCGTTCCGGGAATCCAGCATGTTGCTCCTGGATTTGGTCGAAAAACTGCAATGAAGCTCTTGAAGAAACATGGTTCTTTGGAGAATC
TACTCAGTGCTGCTGCAATAAGAACTGTGGGCAGACCGTATGCACAAGATGCACTTACAAAGTATGCCGATTACCTGCGTACGAATTATAAAGTTCTAGCCCTGAGAAGA
GATGTTGATGTTCAATTTCAAGAGGAGTGGTTGGTTGAAAGAGACAGACAAAACGATTCGGCTATTTTATCCAAGTTTGTAGAAAACAATGACAGAAACTCACTTGTTCA
ACCATCAAAACGGGTC
mRNA sequenceShow/hide mRNA sequence
TTCATTTTCAGTATCCTTTTTCCTGACAGGAGGCGTAGACTAGAAATGGCTGAGGCGGGCGCCAACATAGGCGTGAATATTCCGCCATTTCTGAACTCTGCATCGCGTAG
TTCCTTACCCTCCAGAACTCTGAAACCAGAATCGACATTAACAACGAAATCCAATTCATGGAGAACGAAGGCGTTGCAGCTAAGTGCCTTTCCGGTAGCATCTCATTCTA
CCTTTAAACAAACAGATGGCGGGACGTTGCAGCCAATAATTGAAGCGGATAATCCAAGAAAGGGAAGGGTCTTTTTCCTGGACGTAAATCCATTATGCTACGAAGGTAGC
AGACCTAGTTTGCACAATTTTGGTCGCTGGGCTTCCATATTCTTCGAGCAAGTTAGCCACAGTGATCCTGTTATTGCTGTTTTTGATGGGGAAGGAGGTAGTGAGCATCG
CAGGCTGTTATTACCCTCATATAAAGCACATCGGATCAAATTCACGAGACAATCATCTTCACAAAGATATACTAAGGGAAATTCTAGAAGGCCATATCAAGTAATAAGGG
ATGCTCTCAGAAACTGTAATGTGCCAGTTGTAAAGGTTGACGGTCATGAAGCAGATGATGTTGTAGCTACACTTGTGGACCAAGTTTTACAAAGAGGGTTTCGGGTGGTT
ATAGCCTCTCCTGATAAAGATTTCAAGCAGTTGATTTCAGAAGATGTCCAACTCGTGATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAGGCACTACCT
AGCTCAGTATAACTGCGATCCATGCTCTGACTTGAGTCTGAGATGTATTATGGGTGATGAGGTAGATGGCGTTCCGGGAATCCAGCATGTTGCTCCTGGATTTGGTCGAA
AAACTGCAATGAAGCTCTTGAAGAAACATGGTTCTTTGGAGAATCTACTCAGTGCTGCTGCAATAAGAACTGTGGGCAGACCGTATGCACAAGATGCACTTACAAAGTAT
GCCGATTACCTGCGTACGAATTATAAAGTTCTAGCCCTGAGAAGAGATGTTGATGTTCAATTTCAAGAGGAGTGGTTGGTTGAAAGAGACAGACAAAACGATTCGGCTAT
TTTATCCAAGTTTGTAGAAAACAATGACAGAAACTCACTTGTTCAACCATCAAAACGGGTC
Protein sequenceShow/hide protein sequence
MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLHNFGRWASIFF
EQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISED
VQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRR
DVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV