; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003542 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003542
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Description5'-3' exonuclease
Genome locationscaffold234:3428907..3431574
RNA-Seq ExpressionMS003542
SyntenyMS003542
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004527 - exonuclease activity (molecular function)
InterPro domainsIPR002298 - DNA polymerase A
IPR002421 - 5'-3' exonuclease
IPR020045 - DNA polymerase I-like, H3TH domain
IPR020046 - 5'-3' exonuclease, alpha-helical arch, N-terminal
IPR029060 - PIN-like domain superfamily
IPR036279 - 5'-3' exonuclease, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033404.1 hypothetical protein SDJN02_07460 [Cucurbita argyrosperma subsp. argyrosperma]4.5e-18185.71Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNS S +SLPSRTLK  ES  TTK+  WRTK L+LS FP +S ST   FKQ + G L P +EADNPR GRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+CNVPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ
        AAA+RTVG+PYAQDALTKYADYLR NYKVLALRRD+DVQF+EEWLV+RDRQNDS ILSKFVENNDRNSL +
Subjt:  AAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ

XP_022152532.1 uncharacterized protein LOC111020233 [Momordica charantia]3.2e-21199.46Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSRPSLH
        MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPR GRVFFLDVNPLCYEGSRPSLH
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSRPSLH

Query:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
        NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
Subjt:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR

Query:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
        GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
Subjt:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI

Query:  RTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
        RTVGRPYAQDALTKYADYLR NYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
Subjt:  RTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV

XP_022961199.1 uncharacterized protein LOC111461781 [Cucurbita moschata]1.3e-18084.8Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNS S +SLPSRTLK  ES  TTK+ SWRTK L+LS F  +S ST   FKQ + G L P +EADNPR GRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+CNVPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR
        AAA+RTVG+PYAQDALTKYADYLR NYKVLALRRD+DVQF+EEWLV+RDRQNDS ILSKFVENNDRNSL +   +
Subjt:  AAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR

XP_022990137.1 uncharacterized protein LOC111487120 [Cucurbita maxima]5.3e-18287.06Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNSAS +SLPSRTLK  ES  TTK+ SWRTK L+LS F  AS ST   FKQ + G L P +EADNPR GRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+C+VPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ
        AAAIRTVG+PYAQDALTKYADYLR NYKVLALRRDVDVQF+EEWLVERDRQNDS ILSKFVENNDRNSL +
Subjt:  AAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ

XP_038884032.1 5'-3' exonuclease [Benincasa hispida]2.4e-18286.24Show/hide
Query:  MAEAGANIGV-NIPPFLNSASRSSLPSRTLKPESTLTT--KSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEG
        MAEA ANIGV N+PPFLNS+SR+SLPSRTLK E+ +T+  K+N+WRTK L+L+ F  +S  T   F QTD G  QP IEADNPRNGRVFFLDVNPLCY+G
Subjt:  MAEAGANIGV-NIPPFLNSASRSSLPSRTLKPESTLTT--KSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEG

Query:  SRPSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLV
        +RPSLHNFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQ SSQR+TKGNSRR YQVIRDALRNCNVPVVKVDG EADDVVATLV
Subjt:  SRPSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLV

Query:  DQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENL
        +QVLQRG RVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQY+CDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENL
Subjt:  DQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENL

Query:  LSAAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
        LSAAAIRTVG+PYAQ ALTKYA+YLR NYKVLALRRDVDVQFQ+EWLVERDRQND AILSKFVEN +RNSL QPSKRV
Subjt:  LSAAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV

TrEMBL top hitse value%identityAlignment
A0A0A0LQN4 53EXOc domain-containing protein3.5e-17984.08Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTT--KSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGS
        MAEA ANIGVN PPFLNS+S + LPSRTLKPE  LT+  K N+WRTK L L+AF  +S  T   F QTD G  QP IEADN R GRVFFLDVNPLCY+GS
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTT--KSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGS

Query:  RPSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVD
        +PSL NFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR  SS+R+TKGN R  YQVIRDALR+CNVPVV+V+GHEADDV+ATLV+
Subjt:  RPSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVD

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLL
        QVLQRG RVV+ASPDKDFKQLISED+QLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLL

Query:  SAAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
        SAAAIRTVG+PYAQDALTKYA+YLR NYKVLALRRDVDVQFQ+EWLVERDR+NDS ILSKFVENNDRN LVQPSK+V
Subjt:  SAAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV

A0A1S3B2X7 5'-3' exonuclease3.3e-17784Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSRP
        M EA A IGVN PPFLNS+SR+ LPSRT     T   K N+WRTK L+L+AF  +S  T   F QTD G  QP IEADNPR GRVFFLDVNPLCY+G++P
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSRP

Query:  SLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQV
        SL NFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR  SSQR+TKGN R  YQVIRDALR+CNVPVVKVDGHEADDVVATLV+QV
Subjt:  SLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQV

Query:  LQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA
        LQRG RVV+ASPDKDFKQLISEDVQLVMPLPELNRWSFYT+RHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLLSA
Subjt:  LQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA

Query:  AAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
        AAIRTVG+PYAQDALTKYA+YLR NYKVLALRRDVDVQFQ+EWLVERDR+NDS ILSKFVENNDRN LVQPSK+V
Subjt:  AAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV

A0A6J1DGI1 uncharacterized protein LOC1110202331.6e-21199.46Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSRPSLH
        MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPR GRVFFLDVNPLCYEGSRPSLH
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSRPSLH

Query:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
        NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
Subjt:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR

Query:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
        GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
Subjt:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI

Query:  RTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
        RTVGRPYAQDALTKYADYLR NYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV
Subjt:  RTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV

A0A6J1HDC1 uncharacterized protein LOC1114617816.4e-18184.8Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNS S +SLPSRTLK  ES  TTK+ SWRTK L+LS F  +S ST   FKQ + G L P +EADNPR GRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+CNVPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR
        AAA+RTVG+PYAQDALTKYADYLR NYKVLALRRD+DVQF+EEWLV+RDRQNDS ILSKFVENNDRNSL +   +
Subjt:  AAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR

A0A6J1JHT6 uncharacterized protein LOC1114871202.6e-18287.06Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNSAS +SLPSRTLK  ES  TTK+ SWRTK L+LS F  AS ST   FKQ + G L P +EADNPR GRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+C+VPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ
        AAAIRTVG+PYAQDALTKYADYLR NYKVLALRRDVDVQF+EEWLVERDRQNDS ILSKFVENNDRNSL +
Subjt:  AAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQ

SwissProt top hitse value%identityAlignment
O67550 5'-3' exonuclease3.6e-2429.07Show/hide
Query:  FLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKV
        F  + PL      P+   +G +  + F  +    P  ++ VFD    ++ R  +   YK  R K       Q            VI++ L+   +P++++
Subjt:  FLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKV

Query:  DGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA
         G+EADDV+A L ++  Q+GF+V I SPDKD  QL+SE+V ++ P+ +      +T    + ++  +P        ++GD+VD VPGI+    G G KTA
Subjt:  DGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA

Query:  MKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEE
        + +LKK+GS+EN+L           + ++      + L ++YK++ L  D+D++  EE
Subjt:  MKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRMNYKVLALRRDVDVQFQEE

P52026 DNA polymerase I3.2e-2027.68Show/hide
Query:  LLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLP
        +L ++ A +  F R  + Q Y  G  + P      + ++R+ L+   +P  ++D +EADD++ T+  +  + GF V + S D+D  QL S  V + +   
Subjt:  LLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLP

Query:  ELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRMNYKVLAL
         +     YT    + +Y   P   + L+ +MGD+ D +PG+    PG G KTA+KLLK+ G++EN+L  A+I  +     ++ L +Y D   ++ ++ A+
Subjt:  ELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRMNYKVLAL

Query:  RRDVDVQFQEEWLVERDRQNDSAI
         RD  V+   + +V +    +  +
Subjt:  RRDVDVQFQEEWLVERDRQNDSAI

Q04957 DNA polymerase I1.1e-2029.95Show/hide
Query:  LLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLP
        +L ++ A +  F R  + Q Y  G  + P      + ++R+ LR   +P  +++ +EADD++ TL  +  Q GF V + S D+D  QL S  V + +   
Subjt:  LLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLP

Query:  ELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRMNYKVLAL
         +     YT      +Y   P   + L+ +MGD+ D +PG+    PG G KTA+KLL++ G++EN+L  A+I  +     ++ L ++ +   ++ K+ A+
Subjt:  ELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRMNYKVLAL

Query:  RRDVDVQ
        RRD  V+
Subjt:  RRDVDVQ

Q92GB7 DNA polymerase I1.7e-1827.85Show/hide
Query:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS
        V  VFD  GG   R  + P YKA+R         Q            ++RD   N N P+++ +G+EADD++AT   +    G  VVI S DKD  QL++
Subjt:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS

Query:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY
        E++++  PL    +  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ +  ++  V     ++ L    + 
Subjt:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY

Query:  LRMNYKVLALRRDVDVQFQ
          ++++++ L  +VD+ FQ
Subjt:  LRMNYKVLALRRDVDVQFQ

Q9RLB6 DNA polymerase I6.0e-1928.77Show/hide
Query:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS
        V  VFD  GG   R  + P YKA+R         Q            ++RD   N N P+++ +G+EADD++AT   +    G  VVI S DKD  QL+S
Subjt:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS

Query:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY
        E++++  PL    R  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ +  ++  V     ++ L    + 
Subjt:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY

Query:  LRMNYKVLALRRDVDVQFQ
          ++++++ L  +VD+ FQ
Subjt:  LRMNYKVLALRRDVDVQFQ

Arabidopsis top hitse value%identityAlignment
AT1G34380.1 5'-3' exonuclease family protein1.4e-6657.14Show/hide
Query:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRNGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG
        K+   RTK +  S+   +SHS   TF +T  G +Q +++ D           +N RVFFLDV+PLCYEG++PS   FG W S+FF QVS +DPVIAV DG
Subjt:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRNGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG

Query:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM
        E G++ RR LLPSYKAHR    +  +  RY    S+RP+Q + + LR CNVPVV+++GHEADDVVATL++Q +QRG+R VIASPDKDFKQLISE+VQ+V+
Subjt:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM

Query:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLR
        PL +L RWSFYTL+HY AQYNCDP SDLS R
Subjt:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLR

AT1G34380.2 5'-3' exonuclease family protein3.2e-11664.05Show/hide
Query:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRNGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG
        K+   RTK +  S+   +SHS   TF +T  G +Q +++ D           +N RVFFLDV+PLCYEG++PS   FG W S+FF QVS +DPVIAV DG
Subjt:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRNGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG

Query:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM
        E G++ RR LLPSYKAHR    +  +  RY    S+RP+Q + + LR CNVPVV+++GHEADDVVATL++Q +QRG+R VIASPDKDFKQLISE+VQ+V+
Subjt:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM

Query:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRMNYKV
        PL +L RWSFYTL+HY AQYNCDP SDLS RCIMGDEVDGVPGIQH+ P FGRKTAMKL++KHGSLE+LLSAAA+RTVGRPYAQ+ALTKYADYLR NY+V
Subjt:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRMNYKV

Query:  LALRRDVDVQFQEEWLVERDRQNDSAILSKF
        LAL RDV VQ Q EWL+ERD  NDS +LS F
Subjt:  LALRRDVDVQFQEEWLVERDRQNDSAILSKF

AT3G52050.2 5'-3' exonuclease family protein3.6e-1927.18Show/hide
Query:  GSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPL
        G   R  L P+YK++R            T     +  Q ++ +++  ++ V++V G EADDV+ TL  + +  GF+V + SPDKDF Q++S  ++L+   
Subjt:  GSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPL

Query:  PELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRMNYKVL
        P  +  + + +  +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL   ++  +     +++L   AD   ++ K+ 
Subjt:  PELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRMNYKVL

Query:  ALRRDV
         LR D+
Subjt:  ALRRDV

AT3G52050.4 5'-3' exonuclease family protein1.9e-2027.73Show/hide
Query:  VIAVFDGEG-----GSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDF
        V  VFD +G     G   R  L P+YK++R            T     +  Q ++ +++  ++ V++V G EADDV+ TL  + +  GF+V + SPDKDF
Subjt:  VIAVFDGEG-----GSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDF

Query:  KQLISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDAL
         Q++S  ++L+   P  +  + + +  +  ++ N +P   + +  + GD+ D +PG+     G G   A++L+ + G+LENLL   ++  +     +++L
Subjt:  KQLISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDAL

Query:  TKYADYLRMNYKVLALRRDV
           AD   ++ K+  LR D+
Subjt:  TKYADYLRMNYKVLALRRDV

AT3G52050.5 5'-3' exonuclease family protein1.2e-1926.89Show/hide
Query:  NGRVFFLDVNPLCYEGSRPSL---------HNFGR--WASIFFEQVSHSDPV-------IAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNS
        NGRV  +D   + Y      L         H  G   W    F  +S    V       +AV     G   R  L P+YK++R            T    
Subjt:  NGRVFFLDVNPLCYEGSRPSL---------HNFGR--WASIFFEQVSHSDPV-------IAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNS

Query:  RRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIM
         +  Q ++ +++  ++ V++V G EADDV+ TL  + +  GF+V + SPDKDF Q++S  ++L+   P  +  + + +  +  ++ N +P   + +  + 
Subjt:  RRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQY-NCDPCSDLSLRCIM

Query:  GDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA
        GD+ D +PG+     G G   A++L+ + G+LENLL +
Subjt:  GDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGGCGGGCGCCAACATAGGCGTGAATATTCCGCCATTTCTGAACTCTGCATCGCGTAGTTCCTTACCCTCCAGAACTCTGAAACCAGAATCGACATTAACAAC
GAAATCCAATTCATGGAGAACGAAGGCGTTGCAGCTAAGTGCCTTTCCGGTAGCATCTCATTCTACCTTTAAACAAACAGATGGCGGGACGTTGCAGCCAATAATTGAAG
CGGATAATCCAAGAAATGGAAGGGTCTTTTTCCTGGACGTAAATCCATTATGCTACGAAGGTAGCAGACCTAGTTTGCACAATTTTGGTCGCTGGGCTTCCATATTCTTC
GAGCAAGTTAGCCACAGTGATCCTGTTATTGCTGTTTTTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTGTTATTACCCTCATATAAAGCACATCGGATCAAATTCAC
GAGACAATCATCTTCACAAAGATATACTAAGGGAAATTCTAGAAGGCCATATCAAGTAATAAGAGATGCTCTCAGAAACTGTAATGTGCCAGTTGTAAAGGTTGACGGTC
ATGAAGCAGATGATGTTGTAGCTACACTTGTGGACCAAGTTTTACAAAGAGGGTTTCGGGTGGTTATAGCCTCTCCTGATAAAGATTTCAAGCAGTTGATTTCAGAAGAT
GTCCAACTCGTGATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAGGCACTACCTAGCTCAGTATAACTGCGATCCATGCTCTGACTTGAGTCTGAGATG
TATTATGGGTGATGAGGTAGATGGCGTTCCGGGAATCCAGCATGTTGCTCCTGGATTTGGTCGAAAAACTGCAATGAAGCTCTTGAAGAAACATGGTTCTTTGGAGAATC
TACTCAGTGCTGCTGCAATAAGAACTGTGGGCAGACCGTATGCACAAGATGCACTTACAAAGTATGCCGATTACCTGCGTATGAATTATAAAGTTCTAGCCCTGAGAAGA
GATGTTGATGTTCAATTTCAAGAGGAGTGGTTGGTTGAAAGAGACAGACAAAACGATTCGGCTATTTTATCCAAGTTTGTAGAAAACAATGACAGAAACTCACTTGTTCA
ACCATCAAAACGGGTC
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAGGCGGGCGCCAACATAGGCGTGAATATTCCGCCATTTCTGAACTCTGCATCGCGTAGTTCCTTACCCTCCAGAACTCTGAAACCAGAATCGACATTAACAAC
GAAATCCAATTCATGGAGAACGAAGGCGTTGCAGCTAAGTGCCTTTCCGGTAGCATCTCATTCTACCTTTAAACAAACAGATGGCGGGACGTTGCAGCCAATAATTGAAG
CGGATAATCCAAGAAATGGAAGGGTCTTTTTCCTGGACGTAAATCCATTATGCTACGAAGGTAGCAGACCTAGTTTGCACAATTTTGGTCGCTGGGCTTCCATATTCTTC
GAGCAAGTTAGCCACAGTGATCCTGTTATTGCTGTTTTTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTGTTATTACCCTCATATAAAGCACATCGGATCAAATTCAC
GAGACAATCATCTTCACAAAGATATACTAAGGGAAATTCTAGAAGGCCATATCAAGTAATAAGAGATGCTCTCAGAAACTGTAATGTGCCAGTTGTAAAGGTTGACGGTC
ATGAAGCAGATGATGTTGTAGCTACACTTGTGGACCAAGTTTTACAAAGAGGGTTTCGGGTGGTTATAGCCTCTCCTGATAAAGATTTCAAGCAGTTGATTTCAGAAGAT
GTCCAACTCGTGATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAGGCACTACCTAGCTCAGTATAACTGCGATCCATGCTCTGACTTGAGTCTGAGATG
TATTATGGGTGATGAGGTAGATGGCGTTCCGGGAATCCAGCATGTTGCTCCTGGATTTGGTCGAAAAACTGCAATGAAGCTCTTGAAGAAACATGGTTCTTTGGAGAATC
TACTCAGTGCTGCTGCAATAAGAACTGTGGGCAGACCGTATGCACAAGATGCACTTACAAAGTATGCCGATTACCTGCGTATGAATTATAAAGTTCTAGCCCTGAGAAGA
GATGTTGATGTTCAATTTCAAGAGGAGTGGTTGGTTGAAAGAGACAGACAAAACGATTCGGCTATTTTATCCAAGTTTGTAGAAAACAATGACAGAAACTCACTTGTTCA
ACCATCAAAACGGGTC
Protein sequenceShow/hide protein sequence
MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRNGRVFFLDVNPLCYEGSRPSLHNFGRWASIFF
EQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISED
VQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRMNYKVLALRR
DVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRV