; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g39680 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g39680
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Description5'-3' exonuclease
Genome locationchr4:29435747..29440985
RNA-Seq ExpressionMoc04g39680
SyntenyMoc04g39680
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004527 - exonuclease activity (molecular function)
InterPro domainsIPR002298 - DNA polymerase A
IPR002421 - 5'-3' exonuclease
IPR018790 - Protein of unknown function DUF2358
IPR020045 - DNA polymerase I-like, H3TH domain
IPR020046 - 5'-3' exonuclease, alpha-helical arch, N-terminal
IPR029060 - PIN-like domain superfamily
IPR032710 - NTF2-like domain superfamily
IPR036279 - 5'-3' exonuclease, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602717.1 hypothetical protein SDJN03_07950, partial [Cucurbita argyrosperma subsp. sororia]1.0e-25071.41Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR
        MAEA AN+G+NIPPFLNS S +SLPSRTLK  ES  TTK+  WRTK L+LS F  +S ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+CNVPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDR----------QNDSAIL----------------------------SKFVE
        AAA+RTVG+PYAQDALTKYADYLRTNYKVLALRRD+D    E  L   +           Q +  ++                            + +  
Subjt:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDR----------QNDSAIL----------------------------SKFVE

Query:  NNDRNS------------LVQPSKRAASSAV-----------------------------------------DDITFIDPLNTFSGIERYKLIFWALRFH
        ++  +S            L+Q  + A+ SA                                          DDITFIDPLNTFSGIERYKLIFWALRFH
Subjt:  NNDRNS------------LVQPSKRAASSAV-----------------------------------------DDITFIDPLNTFSGIERYKLIFWALRFH

Query:  ARILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEVRGEFQGTSRYKLDRNGKIYEHKVDNLAFNFPQQLKPAASVLDLVTACPASPNPTFLWGTEE
         RILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWE RGEFQGTSR+K+DRNGKIYEHKVDNLAFNFPQ LKPAASVLDLVTACPASPNPTF WGTEE
Subjt:  ARILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEVRGEFQGTSRYKLDRNGKIYEHKVDNLAFNFPQQLKPAASVLDLVTACPASPNPTFLWGTEE

Query:  LHCSSWVELYQAVRTSVGGEGYLITQDGFLTCS
        LHCSSWVELYQAVR SVGGEGYLITQDGFLTCS
Subjt:  LHCSSWVELYQAVRTSVGGEGYLITQDGFLTCS

KAG7033404.1 hypothetical protein SDJN02_07460 [Cucurbita argyrosperma subsp. argyrosperma]1.6e-18284.92Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNS S +SLPSRTLK  ES  TTK+  WRTK L+LS FP +S ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+CNVPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRAAS
        AAA+RTVG+PYAQDALTKYADYLRTNYKVLALRRD+DVQF+EEWLV+RDRQNDS ILSKFVENNDRNSL +   +  S
Subjt:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRAAS

XP_022152532.1 uncharacterized protein LOC111020233 [Momordica charantia]7.2e-212100Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH
        MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH

Query:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
        NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
Subjt:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR

Query:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
        GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
Subjt:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI

Query:  RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR
        RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR
Subjt:  RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR

XP_022961199.1 uncharacterized protein LOC111461781 [Cucurbita moschata]4.5e-18284.92Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNS S +SLPSRTLK  ES  TTK+ SWRTK L+LS F  +S ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+CNVPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRAAS
        AAA+RTVG+PYAQDALTKYADYLRTNYKVLALRRD+DVQF+EEWLV+RDRQNDS ILSKFVENNDRNSL +   +  S
Subjt:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRAAS

XP_022990137.1 uncharacterized protein LOC111487120 [Cucurbita maxima]1.8e-18386.24Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNSAS +SLPSRTLK  ES  TTK+ SWRTK L+LS F  AS ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+C+VPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRAAS
        AAAIRTVG+PYAQDALTKYADYLRTNYKVLALRRDVDVQF+EEWLVERDRQNDS ILSKFVENNDRNSL +   +  S
Subjt:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRAAS

TrEMBL top hitse value%identityAlignment
A0A0A0LQN4 53EXOc domain-containing protein3.9e-17984.31Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTT--KSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGS
        MAEA ANIGVN PPFLNS+S + LPSRTLKPE  LT+  K N+WRTK L L+AF  +S  T   F QTD G  QP IEADN R GRVFFLDVNPLCY+GS
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTT--KSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGS

Query:  RPSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVD
        +PSL NFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR  SS+R+TKGN R  YQVIRDALR+CNVPVV+V+GHEADDV+ATLV+
Subjt:  RPSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVD

Query:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLL
        QVLQRG RVV+ASPDKDFKQLISED+QLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLL
Subjt:  QVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLL

Query:  SAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR
        SAAAIRTVG+PYAQDALTKYA+YLRTNYKVLALRRDVDVQFQ+EWLVERDR+NDS ILSKFVENNDRN LVQPSK+
Subjt:  SAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR

A0A1S3B2X7 5'-3' exonuclease5.6e-17884.49Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRP
        M EA A IGVN PPFLNS+SR+ LPSRT     T   K N+WRTK L+L+AF  +S  T   F QTD G  QP IEADNPRKGRVFFLDVNPLCY+G++P
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRP

Query:  SLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQV
        SL NFGRW SIFFE+VSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTR  SSQR+TKGN R  YQVIRDALR+CNVPVVKVDGHEADDVVATLV+QV
Subjt:  SLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQV

Query:  LQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA
        LQRG RVV+ASPDKDFKQLISEDVQLVMPLPELNRWSFYT+RHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA+KLLKKHGSLENLLSA
Subjt:  LQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSA

Query:  AAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR
        AAIRTVG+PYAQDALTKYA+YLRTNYKVLALRRDVDVQFQ+EWLVERDR+NDS ILSKFVENNDRN LVQPSK+
Subjt:  AAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR

A0A6J1DGI1 uncharacterized protein LOC1110202333.5e-212100Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH
        MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLH

Query:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
        NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR
Subjt:  NFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQR

Query:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
        GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI
Subjt:  GFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAI

Query:  RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR
        RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR
Subjt:  RTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKR

A0A6J1HDC1 uncharacterized protein LOC1114617812.2e-18284.92Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNS S +SLPSRTLK  ES  TTK+ SWRTK L+LS F  +S ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VS SDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+CNVPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRAAS
        AAA+RTVG+PYAQDALTKYADYLRTNYKVLALRRD+DVQF+EEWLV+RDRQNDS ILSKFVENNDRNSL +   +  S
Subjt:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRAAS

A0A6J1JHT6 uncharacterized protein LOC1114871208.9e-18486.24Show/hide
Query:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR
        MAEA ANIG+NIPPFLNSAS +SLPSRTLK  ES  TTK+ SWRTK L+LS F  AS ST   FKQ + G L P +EADNPRKGRVFFLDVNPLCY+GSR
Subjt:  MAEAGANIGVNIPPFLNSASRSSLPSRTLK-PESTLTTKSNSWRTKALQLSAFPVASHST---FKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSR

Query:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ
        PSLHNFGRW SIFFE+VSHSDPVIAV DGEGGSEHRRLLLPSYKAHRIKFTRQSSSQR+TKGNS R YQVIRDALR+C+VPV+K+ GHEADDVVATLV+Q
Subjt:  PSLHNFGRWASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQ

Query:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
        VLQRGFR VIASPDKDFKQLISEDVQLVMPLPELNRWSFYTL+HYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS
Subjt:  VLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLS

Query:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRAAS
        AAAIRTVG+PYAQDALTKYADYLRTNYKVLALRRDVDVQF+EEWLVERDRQNDS ILSKFVENNDRNSL +   +  S
Subjt:  AAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRAAS

SwissProt top hitse value%identityAlignment
O67550 5'-3' exonuclease1.2e-2329.07Show/hide
Query:  FLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKV
        F  + PL      P+   +G +  + F  +    P  ++ VFD    ++ R  +   YK  R K       Q            VI++ L+   +P++++
Subjt:  FLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDP--VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKV

Query:  DGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA
         G+EADDV+A L ++  Q+GF+V I SPDKD  QL+SE+V ++ P+ +      +T    + ++  +P        ++GD+VD VPGI+    G G KTA
Subjt:  DGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTA

Query:  MKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEE
        + +LKK+GS+EN+L           + ++      + L  +YK++ L  D+D++  EE
Subjt:  MKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEE

P52026 DNA polymerase I5.4e-2125.61Show/hide
Query:  KGRVFFLDVNPLCYEG--SRPSLHNFGRWASIFFEQVSHSDPVIA---VFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVI
        K ++  +D N + Y    + P LHN         ++  H++ V     + +     E    +L ++ A +  F R  + Q Y  G  + P      + ++
Subjt:  KGRVFFLDVNPLCYEG--SRPSLHNFGRWASIFFEQVSHSDPVIA---VFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVI

Query:  RDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVP
        R+ L+   +P  ++D +EADD++ T+  +  + GF V + S D+D  QL S  V + +    +     YT    + +Y   P   + L+ +MGD+ D +P
Subjt:  RDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVP

Query:  GIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAI
        G+    PG G KTA+KLLK+ G++EN+L  A+I  +     ++ L +Y D    + ++ A+ RD  V+   + +V +    +  +
Subjt:  GIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAI

Q04957 DNA polymerase I3.5e-2029.95Show/hide
Query:  LLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLP
        +L ++ A +  F R  + Q Y  G  + P      + ++R+ LR   +P  +++ +EADD++ TL  +  Q GF V + S D+D  QL S  V + +   
Subjt:  LLPSYKAHRIKFTRQSSSQRYTKGNSRRP------YQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVMPLP

Query:  ELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLAL
         +     YT      +Y   P   + L+ +MGD+ D +PG+    PG G KTA+KLL++ G++EN+L  A+I  +     ++ L ++ +    + K+ A+
Subjt:  ELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKVLAL

Query:  RRDVDVQ
        RRD  V+
Subjt:  RRDVDVQ

Q92GB7 DNA polymerase I4.3e-1827.85Show/hide
Query:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS
        V  VFD  GG   R  + P YKA+R         Q            ++RD   N N P+++ +G+EADD++AT   +    G  VVI S DKD  QL++
Subjt:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS

Query:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY
        E++++  PL    +  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ +  ++  V     ++ L    + 
Subjt:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY

Query:  LRTNYKVLALRRDVDVQFQ
           +++++ L  +VD+ FQ
Subjt:  LRTNYKVLALRRDVDVQFQ

Q9RLB6 DNA polymerase I1.5e-1828.77Show/hide
Query:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS
        V  VFD  GG   R  + P YKA+R         Q            ++RD   N N P+++ +G+EADD++AT   +    G  VVI S DKD  QL+S
Subjt:  VIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLIS

Query:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY
        E++++  PL    R  + T    + ++         +  ++GD  D +PG+    P  G KTA  L+ + GS+EN+ +  ++  V     ++ L    + 
Subjt:  EDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADY

Query:  LRTNYKVLALRRDVDVQFQ
           +++++ L  +VD+ FQ
Subjt:  LRTNYKVLALRRDVDVQFQ

Arabidopsis top hitse value%identityAlignment
AT1G16320.1 Uncharacterized conserved protein (DUF2358)8.5e-7072.84Show/hide
Query:  DDITFIDPLNTFSGIERYKLIFWALRFHARILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEVRGEFQGTSRYKLDRNGKIYEHKVDNLAFNFPQQ
        DDITF+DP+NTF+G++ YK+IFWALRFH +ILFR+I +E++R+WQPSEN+ILIRWNLKGVPRVPWE +GEFQGTSRYKLDRNGKIYEHKVDNLAFNFPQQ
Subjt:  DDITFIDPLNTFSGIERYKLIFWALRFHARILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEVRGEFQGTSRYKLDRNGKIYEHKVDNLAFNFPQQ

Query:  LKPAASVLDLVTACPA-SPNPTFLWGTEELHCSSWVELYQAVRTSVGGEGYLITQDGFLTCS
        LKPAASVLDLVTA PA SPNPTF +   + + SSWV+ YQAVR ++  E   +T D  +TCS
Subjt:  LKPAASVLDLVTACPA-SPNPTFLWGTEELHCSSWVELYQAVRTSVGGEGYLITQDGFLTCS

AT1G34380.1 5'-3' exonuclease family protein7.4e-6656.71Show/hide
Query:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRKGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG
        K+   RTK +  S+   +SHS   TF +T  G +Q +++ D           +  RVFFLDV+PLCYEG++PS   FG W S+FF QVS +DPVIAV DG
Subjt:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRKGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG

Query:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM
        E G++ RR LLPSYKAHR    +  +  RY    S+RP+Q + + LR CNVPVV+++GHEADDVVATL++Q +QRG+R VIASPDKDFKQLISE+VQ+V+
Subjt:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM

Query:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLR
        PL +L RWSFYTL+HY AQYNCDP SDLS R
Subjt:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLR

AT1G34380.2 5'-3' exonuclease family protein2.3e-11563.75Show/hide
Query:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRKGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG
        K+   RTK +  S+   +SHS   TF +T  G +Q +++ D           +  RVFFLDV+PLCYEG++PS   FG W S+FF QVS +DPVIAV DG
Subjt:  KSNSWRTKALQLSAFPVASHS---TFKQTDGGTLQPIIEAD---------NPRKGRVFFLDVNPLCYEGSRPSLHNFGRWASIFFEQVSHSDPVIAVFDG

Query:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM
        E G++ RR LLPSYKAHR    +  +  RY    S+RP+Q + + LR CNVPVV+++GHEADDVVATL++Q +QRG+R VIASPDKDFKQLISE+VQ+V+
Subjt:  EGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPDKDFKQLISEDVQLVM

Query:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKV
        PL +L RWSFYTL+HY AQYNCDP SDLS RCIMGDEVDGVPGIQH+ P FGRKTAMKL++KHGSLE+LLSAAA+RTVGRPYAQ+ALTKYADYLR NY+V
Subjt:  PLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKYADYLRTNYKV

Query:  LALRRDVDVQFQEEWLVERDRQNDSAILSKF
        LAL RDV VQ Q EWL+ERD  NDS +LS F
Subjt:  LALRRDVDVQFQEEWLVERDRQNDSAILSKF

AT1G79510.1 Uncharacterized conserved protein (DUF2358)4.5e-7173.46Show/hide
Query:  DDITFIDPLNTFSGIERYKLIFWALRFHARILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEVRGEFQGTSRYKLDRNGKIYEHKVDNLAFNFPQQ
        DDIT +DP+NTFSGI+ YKLIFWALRFH +ILFR+I +E++R+WQPSEN+ILIRWNLKGVPRVPWE +GEFQGTSRYKLDRNGKIYEHKVDNLAFNFP Q
Subjt:  DDITFIDPLNTFSGIERYKLIFWALRFHARILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEVRGEFQGTSRYKLDRNGKIYEHKVDNLAFNFPQQ

Query:  LKPAASVLDLVTACPASPNPTFLWGTEELHCSSWVELYQAV-RTSVGGEGYLITQDGFLTCS
        LKPA SVLD+VTACPASPNPTF++G  + + SSW+E Y+AV RT    E  ++ QD F+ CS
Subjt:  LKPAASVLDLVTACPASPNPTFLWGTEELHCSSWVELYQAV-RTSVGGEGYLITQDGFLTCS

AT1G79510.2 Uncharacterized conserved protein (DUF2358)4.5e-7173.46Show/hide
Query:  DDITFIDPLNTFSGIERYKLIFWALRFHARILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEVRGEFQGTSRYKLDRNGKIYEHKVDNLAFNFPQQ
        DDIT +DP+NTFSGI+ YKLIFWALRFH +ILFR+I +E++R+WQPSEN+ILIRWNLKGVPRVPWE +GEFQGTSRYKLDRNGKIYEHKVDNLAFNFP Q
Subjt:  DDITFIDPLNTFSGIERYKLIFWALRFHARILFREIGIEVYRIWQPSENVILIRWNLKGVPRVPWEVRGEFQGTSRYKLDRNGKIYEHKVDNLAFNFPQQ

Query:  LKPAASVLDLVTACPASPNPTFLWGTEELHCSSWVELYQAV-RTSVGGEGYLITQDGFLTCS
        LKPA SVLD+VTACPASPNPTF++G  + + SSW+E Y+AV RT    E  ++ QD F+ CS
Subjt:  LKPAASVLDLVTACPASPNPTFLWGTEELHCSSWVELYQAV-RTSVGGEGYLITQDGFLTCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGGCGGGCGCCAACATAGGCGTGAATATTCCGCCATTTCTGAACTCTGCATCGCGTAGTTCCTTACCCTCCAGAACTCTGAAACCAGAATCGACATTA
ACAACGAAATCCAATTCATGGAGAACGAAGGCGTTGCAGCTAAGTGCCTTTCCGGTAGCATCTCATTCTACCTTTAAACAAACAGATGGCGGGACGTTGCAGCCA
ATAATTGAAGCGGATAATCCAAGAAAGGGAAGGGTCTTTTTCCTGGACGTAAATCCATTATGCTACGAAGGTAGCAGACCTAGTTTGCACAATTTTGGTCGCTGG
GCTTCCATATTCTTCGAGCAAGTTAGCCACAGTGATCCTGTTATTGCTGTTTTTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTGTTATTACCCTCATATAAA
GCACATCGGATCAAATTCACGAGACAATCATCTTCACAAAGATATACTAAGGGAAATTCTAGAAGGCCATATCAAGTAATAAGGGATGCTCTCAGAAACTGTAAT
GTGCCAGTTGTAAAGGTTGACGGTCATGAAGCAGATGATGTTGTAGCTACACTTGTGGACCAAGTTTTACAAAGAGGGTTTCGGGTGGTTATAGCCTCTCCTGAT
AAAGATTTCAAGCAGTTGATTTCAGAAGATGTCCAACTCGTGATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAGGCACTACCTAGCTCAGTAT
AACTGCGATCCATGCTCTGACTTGAGTCTGAGATGTATTATGGGTGATGAGGTAGATGGCGTTCCGGGAATCCAGCATGTTGCTCCTGGATTTGGTCGAAAAACT
GCAATGAAGCTCTTGAAGAAACATGGTTCTTTGGAGAATCTACTCAGTGCTGCTGCAATAAGAACTGTGGGCAGACCGTATGCACAAGATGCACTTACAAAGTAT
GCCGATTACCTGCGTACGAATTATAAAGTTCTAGCCCTGAGAAGAGATGTTGATGTTCAATTTCAAGAGGAGTGGTTGGTTGAAAGAGACAGACAAAACGATTCG
GCTATTTTATCCAAGTTTGTAGAAAACAATGACAGAAACTCACTTGTTCAACCATCAAAACGGGCAGCATCATCTGCTGTGGACGATATCACCTTTATTGATCCT
TTGAACACCTTCAGCGGGATCGAGAGGTACAAATTGATATTCTGGGCGTTGAGATTTCACGCCAGAATTCTCTTCCGGGAGATTGGGATTGAGGTGTACAGGATT
TGGCAGCCTTCGGAGAACGTTATACTGATCCGGTGGAACTTGAAGGGCGTTCCTCGGGTTCCATGGGAGGTGAGGGGCGAGTTTCAGGGCACTTCGCGATATAAA
CTGGATCGAAATGGGAAAATTTACGAACACAAGGTGGACAATTTAGCATTTAATTTCCCTCAGCAACTGAAACCAGCTGCATCAGTGTTGGATTTGGTTACTGCC
TGCCCTGCAAGCCCTAATCCAACATTCTTGTGGGGGACAGAGGAATTACATTGTTCTTCCTGGGTGGAGCTTTATCAGGCAGTGAGGACAAGTGTGGGTGGAGAA
GGCTATTTGATTACACAAGATGGATTTCTTACATGTTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAGGCGGGCGCCAACATAGGCGTGAATATTCCGCCATTTCTGAACTCTGCATCGCGTAGTTCCTTACCCTCCAGAACTCTGAAACCAGAATCGACATTA
ACAACGAAATCCAATTCATGGAGAACGAAGGCGTTGCAGCTAAGTGCCTTTCCGGTAGCATCTCATTCTACCTTTAAACAAACAGATGGCGGGACGTTGCAGCCA
ATAATTGAAGCGGATAATCCAAGAAAGGGAAGGGTCTTTTTCCTGGACGTAAATCCATTATGCTACGAAGGTAGCAGACCTAGTTTGCACAATTTTGGTCGCTGG
GCTTCCATATTCTTCGAGCAAGTTAGCCACAGTGATCCTGTTATTGCTGTTTTTGATGGGGAAGGAGGTAGTGAGCATCGCAGGCTGTTATTACCCTCATATAAA
GCACATCGGATCAAATTCACGAGACAATCATCTTCACAAAGATATACTAAGGGAAATTCTAGAAGGCCATATCAAGTAATAAGGGATGCTCTCAGAAACTGTAAT
GTGCCAGTTGTAAAGGTTGACGGTCATGAAGCAGATGATGTTGTAGCTACACTTGTGGACCAAGTTTTACAAAGAGGGTTTCGGGTGGTTATAGCCTCTCCTGAT
AAAGATTTCAAGCAGTTGATTTCAGAAGATGTCCAACTCGTGATGCCTTTGCCAGAGCTCAACAGATGGTCCTTTTACACCTTAAGGCACTACCTAGCTCAGTAT
AACTGCGATCCATGCTCTGACTTGAGTCTGAGATGTATTATGGGTGATGAGGTAGATGGCGTTCCGGGAATCCAGCATGTTGCTCCTGGATTTGGTCGAAAAACT
GCAATGAAGCTCTTGAAGAAACATGGTTCTTTGGAGAATCTACTCAGTGCTGCTGCAATAAGAACTGTGGGCAGACCGTATGCACAAGATGCACTTACAAAGTAT
GCCGATTACCTGCGTACGAATTATAAAGTTCTAGCCCTGAGAAGAGATGTTGATGTTCAATTTCAAGAGGAGTGGTTGGTTGAAAGAGACAGACAAAACGATTCG
GCTATTTTATCCAAGTTTGTAGAAAACAATGACAGAAACTCACTTGTTCAACCATCAAAACGGGCAGCATCATCTGCTGTGGACGATATCACCTTTATTGATCCT
TTGAACACCTTCAGCGGGATCGAGAGGTACAAATTGATATTCTGGGCGTTGAGATTTCACGCCAGAATTCTCTTCCGGGAGATTGGGATTGAGGTGTACAGGATT
TGGCAGCCTTCGGAGAACGTTATACTGATCCGGTGGAACTTGAAGGGCGTTCCTCGGGTTCCATGGGAGGTGAGGGGCGAGTTTCAGGGCACTTCGCGATATAAA
CTGGATCGAAATGGGAAAATTTACGAACACAAGGTGGACAATTTAGCATTTAATTTCCCTCAGCAACTGAAACCAGCTGCATCAGTGTTGGATTTGGTTACTGCC
TGCCCTGCAAGCCCTAATCCAACATTCTTGTGGGGGACAGAGGAATTACATTGTTCTTCCTGGGTGGAGCTTTATCAGGCAGTGAGGACAAGTGTGGGTGGAGAA
GGCTATTTGATTACACAAGATGGATTTCTTACATGTTCATAG
Protein sequenceShow/hide protein sequence
MAEAGANIGVNIPPFLNSASRSSLPSRTLKPESTLTTKSNSWRTKALQLSAFPVASHSTFKQTDGGTLQPIIEADNPRKGRVFFLDVNPLCYEGSRPSLHNFGRW
ASIFFEQVSHSDPVIAVFDGEGGSEHRRLLLPSYKAHRIKFTRQSSSQRYTKGNSRRPYQVIRDALRNCNVPVVKVDGHEADDVVATLVDQVLQRGFRVVIASPD
KDFKQLISEDVQLVMPLPELNRWSFYTLRHYLAQYNCDPCSDLSLRCIMGDEVDGVPGIQHVAPGFGRKTAMKLLKKHGSLENLLSAAAIRTVGRPYAQDALTKY
ADYLRTNYKVLALRRDVDVQFQEEWLVERDRQNDSAILSKFVENNDRNSLVQPSKRAASSAVDDITFIDPLNTFSGIERYKLIFWALRFHARILFREIGIEVYRI
WQPSENVILIRWNLKGVPRVPWEVRGEFQGTSRYKLDRNGKIYEHKVDNLAFNFPQQLKPAASVLDLVTACPASPNPTFLWGTEELHCSSWVELYQAVRTSVGGE
GYLITQDGFLTCS