; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g34400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g34400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPolynucleotidyl transferase, ribonuclease H-like superfamily protein
Genome locationchr8:25139426..25144902
RNA-Seq ExpressionMoc08g34400
SyntenyMoc08g34400
Gene Ontology termsGO:0006139 - nucleobase-containing compound metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008408 - 3'-5' exonuclease activity (molecular function)
GO:0043167 - ion binding (molecular function)
InterPro domainsIPR002562 - 3'-5' exonuclease domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]4.0e-10779.85Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQTEAADAPPL
        AVRPIESSRPNSELAMVCGFAS VKRKSKG+AHALEAAQSSKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTE VD                PL
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQTEAADAPPL

Query:  GEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVDDPRARMGGTSDVTARFRIEPSSLGVREQ
        GEE REE PLKRRRKKKK  SP EVGA  VLPA +ADRVDDP ARMGGT DVT RFR+EPSS GVR+Q
Subjt:  GEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVDDPRARMGGTSDVTARFRIEPSSLGVREQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]8.3e-14594.68Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTELAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRT LAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTELAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESG SFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQTEAADAPPLGEEA
        GVKRKSKGRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRD       QTEA DAQTEAAD PPLGE A
Subjt:  GVKRKSKGRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQTEAADAPPLGEEA

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.0e-13090.88Show/hide
Query:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQMTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSS GVR+QV+RISAASLDRCLRRASKFVSAPGSVLQ TIDYAAEAFVASIQSAL +KAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQMTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAT ELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRWYAEKWASGPSGTPGPQALVDQYVRVLDSDYSDPEEDQVDSNQEGAPPAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQID SGLKR YAEKWASGP GTPGPQALVDQYVR LDSDYSDPEEDQV S QEGA P GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRWYAEKWASGPSGTPGPQALVDQYVRVLDSDYSDPEEDQVDSNQEGAPPAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.7e-19096.6Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENIFLRLPEEGERADHPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYL SLRRGFAIPENI LRLPEEGERAD+PPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENIFLRLPEEGERADHPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTELAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRT LAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTELAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESG SFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVD
        GRAHALEAAQSSKP TPAVVGPASEDPA VIELESSGGPSREKRPRDQTE VD
Subjt:  GRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.8e-18567.86Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESG +FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQT
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+              ++
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQT

Query:  EAADAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVDDPRARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPG
        EA D  PL  E R E+PL+RRRKKKK  S SE GA   LP   AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR LRRASKFVS PG
Subjt:  EAADAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVDDPRARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPG

Query:  SVLQMTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE
        SVLQ TID  AEAF+ASI  A+++KAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE
Subjt:  SVLQMTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE

Query:  REKFQLLKEKDDMLQALEAKDKELEHATTELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRWYAEKWASGPS
        +EKFQLLKEKDD+ Q LE KD  +   TTEL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQID +GLK+ Y+EKWASGP+
Subjt:  REKFQLLKEKDDMLQALEAKDKELEHATTELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRWYAEKWASGPS

Query:  GTPGPQALVDQYVRVLDSDYSDPEEDQVDSNQ
        GTP PQ+LVD+YVR LDSDYSD EE+   S +
Subjt:  GTPGPQALVDQYVRVLDSDYSDPEEDQVDSNQ

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.9e-10779.85Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQTEAADAPPL
        AVRPIESSRPNSELAMVCGFAS VKRKSKG+AHALEAAQSSKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTE VD                PL
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQTEAADAPPL

Query:  GEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVDDPRARMGGTSDVTARFRIEPSSLGVREQ
        GEE REE PLKRRRKKKK  SP EVGA  VLPA +ADRVDDP ARMGGT DVT RFR+EPSS GVR+Q
Subjt:  GEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVDDPRARMGGTSDVTARFRIEPSSLGVREQ

A0A6J1CR42 uncharacterized protein LOC1110138264.0e-14594.68Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTELAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRT LAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTELAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESG SFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQTEAADAPPLGEEA
        GVKRKSKGRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRD       QTEA DAQTEAAD PPLGE A
Subjt:  GVKRKSKGRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQTEAADAPPLGEEA

A0A6J1D971 uncharacterized protein LOC1110185389.6e-13190.88Show/hide
Query:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQMTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSS GVR+QV+RISAASLDRCLRRASKFVSAPGSVLQ TIDYAAEAFVASIQSAL +KAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQMTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAT ELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRWYAEKWASGPSGTPGPQALVDQYVRVLDSDYSDPEEDQVDSNQEGAPPAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQID SGLKR YAEKWASGP GTPGPQALVDQYVR LDSDYSDPEEDQV S QEGA P GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRWYAEKWASGPSGTPGPQALVDQYVRVLDSDYSDPEEDQVDSNQEGAPPAGS

A0A6J1DXS5 uncharacterized protein LOC1110255028.2e-19196.6Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENIFLRLPEEGERADHPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYL SLRRGFAIPENI LRLPEEGERAD+PPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENIFLRLPEEGERADHPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTELAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRT LAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTELAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESG SFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVD
        GRAHALEAAQSSKP TPAVVGPASEDPA VIELESSGGPSREKRPRDQTE VD
Subjt:  GRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVD

A0A6J1DZB3 uncharacterized protein LOC1110256652.3e-18567.86Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESG +FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQT
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+              ++
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEAADAQT

Query:  EAADAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVDDPRARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPG
        EA D  PL  E R E+PL+RRRKKKK  S SE GA   LP   AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR LRRASKFVS PG
Subjt:  EAADAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVDDPRARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPG

Query:  SVLQMTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE
        SVLQ TID  AEAF+ASI  A+++KAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE
Subjt:  SVLQMTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE

Query:  REKFQLLKEKDDMLQALEAKDKELEHATTELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRWYAEKWASGPS
        +EKFQLLKEKDD+ Q LE KD  +   TTEL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQID +GLK+ Y+EKWASGP+
Subjt:  REKFQLLKEKDDMLQALEAKDKELEHATTELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRWYAEKWASGPS

Query:  GTPGPQALVDQYVRVLDSDYSDPEEDQVDSNQ
        GTP PQ+LVD+YVR LDSDYSD EE+   S +
Subjt:  GTPGPQALVDQYVRVLDSDYSDPEEDQVDSNQ

SwissProt top hitse value%identityAlignment
Q8VEG4 Exonuclease 3'-5' domain-containing protein 23.8e-0727.83Show/hide
Query:  LDTEWRQNPDPGGHQPVAILQL------CVDCRCLVFQFFHADAIPLSLFHLLANTPWTFCGVGVGQDRDKLFEDWGLRVSRTMDVAKMAAKKFREREMK
        +D EW      G   P+++LQ+      C   R L    +    +P +L  +LA+      GVG  +D +KL +D+GL V   +D+  +A K+       
Subjt:  LDTEWRQNPDPGGHQPVAILQL------CVDCRCLVFQFFHADAIPLSLFHLLANTPWTFCGVGVGQDRDKLFEDWGLRVSRTMDVAKMAAKKFREREMK

Query:  RQGLKSLMLFFTDTYMEKPKHITLSQWDAKELSFAQIKYACIDAYASYVLGLKF--YDFFNHIYGKPS---LFWHPHLEPYKNSSGRP-RPRPRRRYPHD
           LKSL     +  ++K   +  S WDA+ L+  Q+ YA  DA  S  L L    Y F    Y + S   + W   LE  +N    P R +   R   +
Subjt:  RQGLKSLMLFFTDTYMEKPKHITLSQWDAKELSFAQIKYACIDAYASYVLGLKF--YDFFNHIYGKPS---LFWHPHLEPYKNSSGRP-RPRPRRRYPHD

Query:  RDGESSSCRVFP
         +GE+   ++ P
Subjt:  RDGESSSCRVFP

Arabidopsis top hitse value%identityAlignment
AT2G36110.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.5e-1134.42Show/hide
Query:  LDTEWRQNPDPGGHQPVA-ILQLCVDCRCLVFQFFHADAIPLSLFHLLANTPWTFCGVGVGQDRDKL------FEDWGLRVSRTMDVAKMAAKKFREREM
        LD +W     PGG  P   ILQLCV  RCL+ Q  H   IP  L   L +   TF GV   QD+ KL       E W L   R     ++    F +   
Subjt:  LDTEWRQNPDPGGHQPVA-ILQLCVDCRCLVFQFFHADAIPLSLFHLLANTPWTFCGVGVGQDRDKL------FEDWGLRVSRTMDVAKMAAKKFREREM

Query:  KRQGLKSLMLFFTDTYMEKPKHITLSQWDAKELSFAQIKYACIDAYASYVLGLK
        +  G K          + K K I +S W A+ LS  QI  A  D Y    LG+K
Subjt:  KRQGLKSLMLFFTDTYMEKPKHITLSQWDAKELSFAQIKYACIDAYASYVLGLK

AT3G12410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.2e-1031.85Show/hide
Query:  PVAILQLCVDCRCLVFQFFHADAIPLSLFHLLANTPWTFCGVGVGQDRDKLFE-DWGLRVSRTMDVAKMAAKKFREREMKRQGLKSLM---LFFTDTYME
        P  ILQLCV  RCL+ Q  + D +P +L   LA+   TF GV  GQD  KL      L +   +D+ +     +  R M+R   + ++   + +    ++
Subjt:  PVAILQLCVDCRCLVFQFFHADAIPLSLFHLLANTPWTFCGVGVGQDRDKLFE-DWGLRVSRTMDVAKMAAKKFREREMKRQGLKSLM---LFFTDTYME

Query:  KPKHITLSQWDAKELSFAQIKYACIDAYASYVLGL
            I++S W A +L   QI  A +DAY  + LG+
Subjt:  KPKHITLSQWDAKELSFAQIKYACIDAYASYVLGL

AT3G12430.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.5e-0829.32Show/hide
Query:  PVAILQLCVDCRCLVFQFFHADAIPLSLFHLLANTPWTFCGVGVGQDRDKLFED-WGLRVSRTMDVAKMAAKKFREREMKRQGLKSLM-LFFTDTYMEKP
        P   LQLCV  RC++ Q  H + +P  L + LA+  +TF G+   QD  KL      L ++  +D+ K  +     R MKR   + ++      + +   
Subjt:  PVAILQLCVDCRCLVFQFFHADAIPLSLFHLLANTPWTFCGVGVGQDRDKLFED-WGLRVSRTMDVAKMAAKKFREREMKRQGLKSLM-LFFTDTYMEKP

Query:  KHITLSQWDAKELSFAQIKYACIDAYASYVLGL
        + ++ S W   +L + QI  A ID YA   L +
Subjt:  KHITLSQWDAKELSFAQIKYACIDAYASYVLGL

AT3G12440.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.9e-0828.38Show/hide
Query:  NPDPGGHQPVAILQLCVDCRCLVFQFFHADAIPLSLFHLLANTPWTFCGVGVGQDRDKLFEDWGLRVSRTMDVAKMAAKKFREREMKRQGLKSL------
        +P      P   LQLCV  RC++ Q F+ + +P  L   L +   TF G    QD  KL      R    +++A++   +    + + +GLK        
Subjt:  NPDPGGHQPVAILQLCVDCRCLVFQFFHADAIPLSLFHLLANTPWTFCGVGVGQDRDKLFEDWGLRVSRTMDVAKMAAKKFREREMKRQGLKSL------

Query:  --MLFFTDTYMEKPKHITLSQWDAKELSFAQIKYACIDAYASYVLGLK
           L +    +EK   I +S W   +LS+ QI  A ID Y    LG +
Subjt:  --MLFFTDTYMEKPKHITLSQWDAKELSFAQIKYACIDAYASYVLGLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTCGGAAACCTACCGACTCCACAGTCGAAAAATCCACAAGCTTGTCGTCACCTCGACACCGAATGGCGCCAAAATCCCGACCCCGGCGGGCACCAACCCGTCGC
GATCCTCCAACTCTGCGTGGATTGCCGCTGCCTTGTTTTCCAATTCTTCCATGCTGATGCCATTCCTCTCTCCCTCTTCCACCTCCTCGCCAACACGCCGTGGACATTCT
GCGGGGTCGGCGTCGGGCAGGACCGGGACAAGCTGTTCGAGGATTGGGGGTTGAGGGTTTCACGTACGATGGATGTCGCGAAGATGGCGGCGAAGAAATTTAGAGAAAGG
GAGATGAAGAGACAAGGGTTAAAGAGTTTGATGCTTTTCTTCACTGATACATACATGGAGAAACCAAAGCATATAACCTTGAGTCAATGGGATGCCAAGGAGCTGAGTTT
TGCACAAATTAAATATGCATGCATTGATGCTTATGCTTCTTATGTTTTGGGTTTGAAGTTTTATGACTTCTTTAACCATATCTATGGCAAGCCCTCGTTGTTTTGGCATC
CACACTTGGAACCCTATAAGAATTCCTCGGGTCGTCCTCGTCCTCGTCCTCGTCGTCGTTATCCTCATGATCGTGATGGAGAATCATCATCTTGTAGAGTATTCCCTTCC
CCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTC
GCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCAGATCTCAGAGAGGATCCCAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTT
CCCTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCTGAGCTCGAGGAGATAGAAAACTTTAGA
ATCTCCGATGACGGGGAGGATAGCGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCAGATCCCTTCGTAGGGGGTTCGCTAT
CCCTGAGAACATCTTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTC
CCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGAGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTT
CGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAG
GAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCGGGGGAATGGCTCGCAAAGGACGAGTCAGGTTGTTCCT
TCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGG
GGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCAAGGCCGAACTCTGAACT
TGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCTCCCACTCCTGCCGTGGTAGGGC
CTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGTGGTGGACGCCCAGACCGAGGCG
GCGGACGCCCAGACCGAGGCGGCGGACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTAAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCTCCCTCGGA
GGTCGGAGCTTGCAGGGTCTTGCCTGCAGGTTGGGCTGATCGGGTGGACGATCCTAGGGCCAGGATGGGCGGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCGT
CAAGTCTCGGGGTGAGGGAGCAGGTGACCCGCATCTCAGCTGCGAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGATG
ACCATTGACTACGCCGCCGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGTTATAAAGGCCGAGCTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGAAAGAGGAGTT
CTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCCGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTACTGA
AGAAGGAGGAGGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGATGACATGCTCCAG
GCGCTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTACCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAATCGTTTAGGCAACATCC
TGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATTTCAGTGGTCTGAAAA
GGTGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGTTCTGGACTCTGACTACTCCGATCCCGAAGAG
GACCAGGTCGACTCCAATCAGGAGGGCGCTCCCCCAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCTCGGAAACCTACCGACTCCACAGTCGAAAAATCCACAAGCTTGTCGTCACCTCGACACCGAATGGCGCCAAAATCCCGACCCCGGCGGGCACCAACCCGTCGC
GATCCTCCAACTCTGCGTGGATTGCCGCTGCCTTGTTTTCCAATTCTTCCATGCTGATGCCATTCCTCTCTCCCTCTTCCACCTCCTCGCCAACACGCCGTGGACATTCT
GCGGGGTCGGCGTCGGGCAGGACCGGGACAAGCTGTTCGAGGATTGGGGGTTGAGGGTTTCACGTACGATGGATGTCGCGAAGATGGCGGCGAAGAAATTTAGAGAAAGG
GAGATGAAGAGACAAGGGTTAAAGAGTTTGATGCTTTTCTTCACTGATACATACATGGAGAAACCAAAGCATATAACCTTGAGTCAATGGGATGCCAAGGAGCTGAGTTT
TGCACAAATTAAATATGCATGCATTGATGCTTATGCTTCTTATGTTTTGGGTTTGAAGTTTTATGACTTCTTTAACCATATCTATGGCAAGCCCTCGTTGTTTTGGCATC
CACACTTGGAACCCTATAAGAATTCCTCGGGTCGTCCTCGTCCTCGTCCTCGTCGTCGTTATCCTCATGATCGTGATGGAGAATCATCATCTTGTAGAGTATTCCCTTCC
CCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTC
GCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCAGATCTCAGAGAGGATCCCAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTT
CCCTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCTGAGCTCGAGGAGATAGAAAACTTTAGA
ATCTCCGATGACGGGGAGGATAGCGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCAGATCCCTTCGTAGGGGGTTCGCTAT
CCCTGAGAACATCTTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTC
CCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGAGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTT
CGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAG
GAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCGGGGGAATGGCTCGCAAAGGACGAGTCAGGTTGTTCCT
TCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGG
GGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCAAGGCCGAACTCTGAACT
TGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCTCCCACTCCTGCCGTGGTAGGGC
CTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGTGGTGGACGCCCAGACCGAGGCG
GCGGACGCCCAGACCGAGGCGGCGGACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTAAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCTCCCTCGGA
GGTCGGAGCTTGCAGGGTCTTGCCTGCAGGTTGGGCTGATCGGGTGGACGATCCTAGGGCCAGGATGGGCGGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCGT
CAAGTCTCGGGGTGAGGGAGCAGGTGACCCGCATCTCAGCTGCGAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGATG
ACCATTGACTACGCCGCCGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGTTATAAAGGCCGAGCTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGAAAGAGGAGTT
CTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCCGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTACTGA
AGAAGGAGGAGGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGATGACATGCTCCAG
GCGCTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTACCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAATCGTTTAGGCAACATCC
TGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATTTCAGTGGTCTGAAAA
GGTGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGTTCTGGACTCTGACTACTCCGATCCCGAAGAG
GACCAGGTCGACTCCAATCAGGAGGGCGCTCCCCCAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MDLGNLPTPQSKNPQACRHLDTEWRQNPDPGGHQPVAILQLCVDCRCLVFQFFHADAIPLSLFHLLANTPWTFCGVGVGQDRDKLFEDWGLRVSRTMDVAKMAAKKFRER
EMKRQGLKSLMLFFTDTYMEKPKHITLSQWDAKELSFAQIKYACIDAYASYVLGLKFYDFFNHIYGKPSLFWHPHLEPYKNSSGRPRPRPRRRYPHDRDGESSSCRVFPS
PNIGPLSVWSDLDLAEKFIRLALDTWRLPIRGKIQPSRKIYRRNIQIFRRFRSQRGSQPLVDYTSRTLGRSVSSLSLSNIIAMSSSISSNLGSDLARRLESELEEIENFR
ISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENIFLRLPEEGERADHPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTELAPAQVAPNGWGVIFALAILFWL
RARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGCSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPR
GRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEVVDAQTEA
ADAQTEAADAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVDDPRARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQM
TIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQ
ALEAKDKELEHATTELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRWYAEKWASGPSGTPGPQALVDQYVRVLDSDYSDPEE
DQVDSNQEGAPPAGS