; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019456 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019456
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA-directed DNA polymerase, family B, conserved site
Genome locationscaffold28:636448..636966
RNA-Seq ExpressionMS019456
SyntenyMS019456
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022935266.1 uncharacterized protein LOC111442204 [Cucurbita moschata]2.3e-3455.31Show/hide
Query:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACVV---HDYSEAEAVDEVEKWRQLCGPSRVLF
        MDG LNKI   LIA+F LTFL L+AQILY+L   SRR     A    ++C + FCWRNQ RIEPKEIAC+    H++  A A   VE+W++L GPSR LF
Subjt:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACVV---HDYSEAEAVDEVEKWRQLCGPSRVLF

Query:  TINEAAEEEEREGIESSEPRFPDPDTTPFHTPCASPPYY-TPSP--SPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH
        TI    EEEE EGIESS        T  FHTPC SP YY TPSP  SPTR   +FP D      HNRT PF AIEI +H
Subjt:  TINEAAEEEEREGIESSEPRFPDPDTTPFHTPCASPPYY-TPSP--SPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH

XP_022983133.1 uncharacterized protein LOC111481774 [Cucurbita maxima]2.3e-3455.68Show/hide
Query:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACV--VHDYSEAEAVDEVEKWRQLCGPSRVLFT
        MDG LNKI   LIA+F LTFL L+AQILY+L   SRR     A    K+C + FCWRNQSRI+PKEIA +   H   EA AV  VE+W++L GPSR LFT
Subjt:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACV--VHDYSEAEAVDEVEKWRQLCGPSRVLFT

Query:  INEAAEEEEREGIESSEPRFPDPDTTPFHTPCASPP-YYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH
        I    EEEE EG ESS        T  FHTPC SP  Y+TPSPSPTR   +FP D      HNRT PF A+EI +H
Subjt:  INEAAEEEEREGIESSEPRFPDPDTTPFHTPCASPP-YYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH

XP_023005969.1 uncharacterized protein LOC111498829 [Cucurbita maxima]1.4e-3658.66Show/hide
Query:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACVVHDYS---EAEAVD-EVEKWRQLCGPSRVL
        MDG L KIATALI++F++TFL LLAQIL+     SR           + C + FCWR ++RIEP EIA  VHD S   E  AVD E+EKWR LCGPSR L
Subjt:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACVVHDYS---EAEAVD-EVEKWRQLCGPSRVL

Query:  FTINEAAEEEEREGIESSE--PRFPDPDTTPFHTPCASPPYYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH
        FTI E  EEEEREG+ES E  P+ PD DTTPF+TPC SPP++T  PSPT   ADFP    SP G +RTA F AIEI TH
Subjt:  FTINEAAEEEEREGIESSE--PRFPDPDTTPFHTPCASPPYYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH

XP_023528874.1 uncharacterized protein LOC111791670 [Cucurbita pepo subsp. pepo]7.8e-3556.25Show/hide
Query:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACV--VHDYSEAEAVDEVEKWRQLCGPSRVLFT
        MDG LNKI   LIA+F LTFL L+AQILY+L  R  +     A    ++C + FCWRNQ RIEPKEIAC+   H   EA AV  VE+W++L GPSR LFT
Subjt:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACV--VHDYSEAEAVDEVEKWRQLCGPSRVLFT

Query:  INEAAEEEEREGIESSEPRFPDPDTTPFHTPCASPPYY-TPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH
        I    EEEE EGIESS        T  FHTPC SP YY TPSPSPTR   +FP D      HNRT PF AIEI +H
Subjt:  INEAAEEEEREGIESSEPRFPDPDTTPFHTPCASPPYY-TPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH

XP_038905411.1 uncharacterized protein LOC120091450 [Benincasa hispida]3.5e-3554.74Show/hide
Query:  MDGHLNKIATALIAVFSLTFLALLAQILYVL-RHRSRRDSDSVAVLKVKECGFRFCW-RNQSRIEPKEIACVVHDYSEAE---------AVD-EVEKWRQ
        MDG LNK ATA+IAVF LTFL L+AQILY L R R +R  +       ++C + FCW RNQSRIEPKE+  +     + E         AVD E+EKW++
Subjt:  MDGHLNKIATALIAVFSLTFLALLAQILYVL-RHRSRRDSDSVAVLKVKECGFRFCW-RNQSRIEPKEIACVVHDYSEAE---------AVD-EVEKWRQ

Query:  LCGPSRVLFTINEAAEEEEREGIESSEPRFP------DPDTTPFHTPCASPPYYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRT
        LCGPSR LFTI E  EEEEREGI       P      D DTTPFHTPCASPPY+TPS SPTR + DFPA D   S  N T PF  I+I T
Subjt:  LCGPSRVLFTINEAAEEEEREGIESSEPRFP------DPDTTPFHTPCASPPYYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRT

TrEMBL top hitse value%identityAlignment
A0A5A7TPT7 Uncharacterized protein3.9e-3253.4Show/hide
Query:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACV--VHDYSEAEAV--------DEVEKWRQLC
        MDG LNKIAT LI +F   F  LLAQILY+L +R RR          ++C + F WRNQSRI+PKEI  +  +  +S   AV        DE+ KW++LC
Subjt:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACV--VHDYSEAEAV--------DEVEKWRQLC

Query:  GPSRVLFTINEAAEEEEREGI----ESSEPRFP----DPDTTPFHTPCASPPYYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH
        GPSR LFTI    EEEEREGI    E + P FP    D DTTPFHTPCASPPY TPS SP+R   DFPAD   P   N T PF AI+I TH
Subjt:  GPSRVLFTINEAAEEEEREGI----ESSEPRFP----DPDTTPFHTPCASPPYYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH

A0A6J1F429 uncharacterized protein LOC1114422041.1e-3455.31Show/hide
Query:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACVV---HDYSEAEAVDEVEKWRQLCGPSRVLF
        MDG LNKI   LIA+F LTFL L+AQILY+L   SRR     A    ++C + FCWRNQ RIEPKEIAC+    H++  A A   VE+W++L GPSR LF
Subjt:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACVV---HDYSEAEAVDEVEKWRQLCGPSRVLF

Query:  TINEAAEEEEREGIESSEPRFPDPDTTPFHTPCASPPYY-TPSP--SPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH
        TI    EEEE EGIESS        T  FHTPC SP YY TPSP  SPTR   +FP D      HNRT PF AIEI +H
Subjt:  TINEAAEEEEREGIESSEPRFPDPDTTPFHTPCASPPYY-TPSP--SPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH

A0A6J1FQJ7 uncharacterized protein LOC1114478772.1e-3356.74Show/hide
Query:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACVVHDYS---EAEAVD-EVEKWRQLCGPSRVL
        MDG LNKIATALI++F++TFL LLAQIL+     SR           + C + F WR ++RIEP EIA  V+D S   E  AVD E+EKWR LCGPSR L
Subjt:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACVVHDYS---EAEAVD-EVEKWRQLCGPSRVL

Query:  FTINEAAEEEEREGIESSE-PRFPDPDTTPFHTPCASPPYYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH
        FTI E  EEEEREGIES E    PD DTTPFHTPC SPP++  SPSPTR   DFP    S  G      F AIEI TH
Subjt:  FTINEAAEEEEREGIESSE-PRFPDPDTTPFHTPCASPPYYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH

A0A6J1J6W6 uncharacterized protein LOC1114817741.1e-3455.68Show/hide
Query:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACV--VHDYSEAEAVDEVEKWRQLCGPSRVLFT
        MDG LNKI   LIA+F LTFL L+AQILY+L   SRR     A    K+C + FCWRNQSRI+PKEIA +   H   EA AV  VE+W++L GPSR LFT
Subjt:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACV--VHDYSEAEAVDEVEKWRQLCGPSRVLFT

Query:  INEAAEEEEREGIESSEPRFPDPDTTPFHTPCASPP-YYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH
        I    EEEE EG ESS        T  FHTPC SP  Y+TPSPSPTR   +FP D      HNRT PF A+EI +H
Subjt:  INEAAEEEEREGIESSEPRFPDPDTTPFHTPCASPP-YYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH

A0A6J1L0U5 uncharacterized protein LOC1114988296.9e-3758.66Show/hide
Query:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACVVHDYS---EAEAVD-EVEKWRQLCGPSRVL
        MDG L KIATALI++F++TFL LLAQIL+     SR           + C + FCWR ++RIEP EIA  VHD S   E  AVD E+EKWR LCGPSR L
Subjt:  MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACVVHDYS---EAEAVD-EVEKWRQLCGPSRVL

Query:  FTINEAAEEEEREGIESSE--PRFPDPDTTPFHTPCASPPYYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH
        FTI E  EEEEREG+ES E  P+ PD DTTPF+TPC SPP++T  PSPT   ADFP    SP G +RTA F AIEI TH
Subjt:  FTINEAAEEEEREGIESSE--PRFPDPDTTPFHTPCASPPYYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G11640.1 unknown protein9.6e-0734.32Show/hide
Query:  LNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRF---CWRN--QSRIEPKE--IACVVHDYSEAEAVDEVE-----KWRQLCGP
        L      L+AV     L L A++ Y+   R    + S+     KE  F+F   C +N   SRIEP    ++  + +   A AV E E     KWR     
Subjt:  LNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRF---CWRN--QSRIEPKE--IACVVHDYSEAEAVDEVE-----KWRQLCGP

Query:  SRVLFTINE-------------AAEEEEREGIESSEPRFPDPDTTPFHTPCASPPYYTPSPSPTRRTAD
        SR+LFTI E             +AE +    ++   P     D TPF TPC SPPY+TPSPSP R   D
Subjt:  SRVLFTINE-------------AAEEEEREGIESSEPRFPDPDTTPFHTPCASPPYYTPSPSPTRRTAD

AT3G52480.1 unknown protein2.5e-0734.69Show/hide
Query:  LNKIATALIAVFSLTFLALLAQILYVL------RHR----------SRRDSDSVAV---LKVKECGFRFCWRN-QSRIEPKEIACVVHDYSEAEAVDEV-
        L   AT L+AVF+   +A+ AQ  YVL      R R          S R  D  A     K     F FC  N Q RI     A      + A  V++V 
Subjt:  LNKIATALIAVFSLTFLALLAQILYVL------RHR----------SRRDSDSVAV---LKVKECGFRFCWRN-QSRIEPKEIACVVHDYSEAEAVDEV-

Query:  EKW-----RQLCGPSRVLFTINE--AAEEEEREG---------------------------IESSEPRFPD-PDTTPFHTPCASPPYYTPSPSPTR
         KW       LCGPS  LFTI E   +E + R G                           I   E  F     TTPF TPCASPP+YTPSPSP R
Subjt:  EKW-----RQLCGPSRVLFTINE--AAEEEEREG---------------------------IESSEPRFPD-PDTTPFHTPCASPPYYTPSPSPTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGGCATTTGAACAAGATCGCCACCGCACTCATCGCCGTCTTTTCGCTCACTTTCCTCGCTCTCCTCGCCCAGATCCTCTACGTTCTCCGCCACCGCAGCCGCCG
TGATTCCGATTCTGTAGCTGTGTTGAAAGTGAAGGAATGCGGGTTCCGTTTCTGCTGGAGGAACCAATCGCGGATCGAACCCAAAGAGATCGCGTGTGTGGTCCACGATT
ATTCGGAGGCCGAGGCGGTGGACGAGGTCGAGAAGTGGCGGCAGCTGTGCGGACCCTCCAGAGTTCTGTTCACTATCAACGAAGCGGCCGAAGAAGAGGAGAGAGAAGGA
ATCGAATCTTCCGAGCCGCGTTTTCCAGATCCCGATACGACACCGTTTCACACCCCCTGCGCTTCTCCTCCCTACTACACGCCCTCGCCTTCGCCCACGCGCCGCACCGC
CGATTTCCCGGCGGACGATACTTCTCCCTCCGGTCACAACCGAACGGCACCGTTTTTCGCTATAGAAATTCGCACTCAC
mRNA sequenceShow/hide mRNA sequence
ATGGACGGGCATTTGAACAAGATCGCCACCGCACTCATCGCCGTCTTTTCGCTCACTTTCCTCGCTCTCCTCGCCCAGATCCTCTACGTTCTCCGCCACCGCAGCCGCCG
TGATTCCGATTCTGTAGCTGTGTTGAAAGTGAAGGAATGCGGGTTCCGTTTCTGCTGGAGGAACCAATCGCGGATCGAACCCAAAGAGATCGCGTGTGTGGTCCACGATT
ATTCGGAGGCCGAGGCGGTGGACGAGGTCGAGAAGTGGCGGCAGCTGTGCGGACCCTCCAGAGTTCTGTTCACTATCAACGAAGCGGCCGAAGAAGAGGAGAGAGAAGGA
ATCGAATCTTCCGAGCCGCGTTTTCCAGATCCCGATACGACACCGTTTCACACCCCCTGCGCTTCTCCTCCCTACTACACGCCCTCGCCTTCGCCCACGCGCCGCACCGC
CGATTTCCCGGCGGACGATACTTCTCCCTCCGGTCACAACCGAACGGCACCGTTTTTCGCTATAGAAATTCGCACTCAC
Protein sequenceShow/hide protein sequence
MDGHLNKIATALIAVFSLTFLALLAQILYVLRHRSRRDSDSVAVLKVKECGFRFCWRNQSRIEPKEIACVVHDYSEAEAVDEVEKWRQLCGPSRVLFTINEAAEEEEREG
IESSEPRFPDPDTTPFHTPCASPPYYTPSPSPTRRTADFPADDTSPSGHNRTAPFFAIEIRTH