; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g23940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g23940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed DNA polymerase
Genome locationchr4:17308206..17311150
RNA-Seq ExpressionMoc04g23940
SyntenyMoc04g23940
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7812290.1 uncharacterized protein G2W53_033266 [Senna tora]8.4e-3739.83Show/hide
Query:  LSDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCP------------------------------
        L DKAK WL  +P +SI+ W + A++FL KYF P K AK + DI +F     E + E+WERFKELLR CP                              
Subjt:  LSDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCP------------------------------

Query:  ----TMVFQAVLDIFERISANNYHWSDPRAVNDKSTQGATDNEAMLALKDQITNLTNMKLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPA
            +    A   + E +S+NN+ W   R V+ +   G  D++   +L  QI  LT                 L+DRSI +P G IEDVLV+V+K IFPA
Subjt:  ----TMVFQAVLDIFERISANNYHWSDPRAVNDKSTQGATDNEAMLALKDQITNLTNMKLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPA

Query:  DFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRVK
        DFI+LDYE D+E+ II GRP L+T  T++DVQ  EL MRV+
Subjt:  DFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRVK

KAF7832847.1 uncharacterized protein G2W53_015180 [Senna tora]3.4e-3837.81Show/hide
Query:  LSDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCP------------------------------
        L DKAK WL  +P  SIT W + A++FL KYF P K AK + DI +F     E + E+WERFKELLR CP                              
Subjt:  LSDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCP------------------------------

Query:  ----TMVFQAVLDIFERISANNYHWSDPRAVNDKSTQGATDNEAMLALKDQITNLTN--MKLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIF
            +    A   + E +S+NN+ W   R V+ +   G  D++   +L  QI  LT     LGI  A        ++DRSI +P G IE+VLV+V+KLI 
Subjt:  ----TMVFQAVLDIFERISANNYHWSDPRAVNDKSTQGATDNEAMLALKDQITNLTN--MKLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIF

Query:  PADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------KFPADMEKCSLLRLAYDLLTEETQTEEL
        PADFI+LDYE DK + II GRP L+TG T++DVQ  ELTM+V             K P + E+C  + +  +++    Q +EL
Subjt:  PADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------KFPADMEKCSLLRLAYDLLTEETQTEEL

KAF7833226.1 uncharacterized protein G2W53_015559 [Senna tora]3.4e-3837.11Show/hide
Query:  LSDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCP------------------------------
        L DKAK WL  +P  SIT W + A++FL KYF P K AK + DI +F   + E + E+WERFK+LLR CP                              
Subjt:  LSDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCP------------------------------

Query:  ----TMVFQAVLDIFERISANNYHWSDPRAVNDKSTQGATDNEAMLALKDQITNLTN--MKLGIGKARPTV--------VTIQLSDRSITHPEGKIEDVL
            +    A   + E +S+NN+ W   R V  +   G  D++   +L  QI  LT     LGI  A   V        +  +++DRSI +P G IE+VL
Subjt:  ----TMVFQAVLDIFERISANNYHWSDPRAVNDKSTQGATDNEAMLALKDQITNLTN--MKLGIGKARPTV--------VTIQLSDRSITHPEGKIEDVL

Query:  VQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------KFPADMEKCSLLRLAYDLLTEETQTEEL
        V+V+KLI P DFI+LDYE DK + II GRP LSTG T++DVQ  ELTMRV             K P + E+C  + +  +++    Q +EL
Subjt:  VQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------KFPADMEKCSLLRLAYDLLTEETQTEEL

KAF7843931.1 uncharacterized protein G2W53_000836 [Senna tora]2.9e-3736.63Show/hide
Query:  LSDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCP------------------------------
        L DKAK WL  +P  SI+ W + A++FL KYF P K AK + DI +F     E + E+WERFKELLR CP                              
Subjt:  LSDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCP------------------------------

Query:  ----TMVFQAVLDIFERISANNYHWSDPRAVNDKSTQGATDNEAMLALKDQITNLTNMKLGIG--------------------------------KARPT
            +        + E +S+NN+ W   R V+ +   G  D++   +L  QI  LT     +G                                   PT
Subjt:  ----TMVFQAVLDIFERISANNYHWSDPRAVNDKSTQGATDNEAMLALKDQITNLTNMKLGIG--------------------------------KARPT

Query:  VVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRVK
         V +QL+DRSI +P G IEDVLV+V+K IFP DFI+LDYE D+E+ II GRP L+TG T++DVQ  EL MRV+
Subjt:  VVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRVK

XP_022147186.1 uncharacterized protein LOC111016198 [Momordica charantia]3.8e-3769.77Show/hide
Query:  LTNMKLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------K
        L   KLGIG+ARPT VT+QL+DRSITHPEGK EDVLVQV+K IFPADFIILDYE +KEI II GRP LSTG  L+DV N ELTMRV             K
Subjt:  LTNMKLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------K

Query:  FPADMEKCSLLRLAYDLLTEETQTEELLD
        FPAD+E+CSLLRLA DL +EE QTEELLD
Subjt:  FPADMEKCSLLRLAYDLLTEETQTEELLD

TrEMBL top hitse value%identityAlignment
A0A5B6ULS1 Retrotrans_gag domain-containing protein5.7e-3130.41Show/hide
Query:  DPHLHL-----------SDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCPTMVFQ---------
        DP  HL            D AK  LN +P++    WND  ++FL++Y LPN NAK + +I +F Q   E + E+WERFK+LLR CP  VFQ         
Subjt:  DPHLHL-----------SDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCPTMVFQ---------

Query:  -----------------AVLD--------IFERISANNYHWSDPRAVNDKSTQGATDNEAMLALKDQITNLTNM--------------------------
                         A LD        I ERI+ N+Y +   RA   +   G+ +  A+ ++   +++L NM                          
Subjt:  -----------------AVLD--------IFERISANNYHWSDPRAVNDKSTQGATDNEAMLALKDQITNLTNM--------------------------

Query:  ----------------------------------------------------------------------------------KLGIGKARPTVVTIQLSD
                                                                                          KLGIG+ARPT+VT+QL+D
Subjt:  ----------------------------------------------------------------------------------KLGIGKARPTVVTIQLSD

Query:  RSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRVK
        RS    EGKIEDVLV+V+K  FP DFI+LD EADK++ II GRP L+TG +++D Q +ELTMRVK
Subjt:  RSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRVK

A0A6J1D1L0 uncharacterized protein LOC1110161981.8e-3769.77Show/hide
Query:  LTNMKLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------K
        L   KLGIG+ARPT VT+QL+DRSITHPEGK EDVLVQV+K IFPADFIILDYE +KEI II GRP LSTG  L+DV N ELTMRV             K
Subjt:  LTNMKLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------K

Query:  FPADMEKCSLLRLAYDLLTEETQTEELLD
        FPAD+E+CSLLRLA DL +EE QTEELLD
Subjt:  FPADMEKCSLLRLAYDLLTEETQTEELLD

A0A6J1DUG5 uncharacterized protein LOC1110244567.0e-2957.6Show/hide
Query:  KLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------KFPAD
        KLGIG+ARP  +T++L DRSI HP+GKIEDVLVQV+K IFPADFIILDYE DKE+ II GRP L TG  L+DV   ELTMRV             KFPA+
Subjt:  KLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------KFPAD

Query:  MEKCSLLRLAYDLLTEETQTEELLD
         E+C +L+L  + L +E +TE +L+
Subjt:  MEKCSLLRLAYDLLTEETQTEELLD

A0A6J1DV77 uncharacterized protein LOC1110238181.2e-3365.6Show/hide
Query:  KLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------KFPAD
        KLGIG+ARP  VT+QL+DRSIT+ EGKIEDVLVQV+K IFPADFIILDYEADKEI II GRP LSTG  L+DV N ELT+RV             K+P D
Subjt:  KLGIGKARPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRV-------------KFPAD

Query:  MEKCSLLRLAYDLLTEETQTEELLD
        +E+CS LR+A DL+++E QTEELL+
Subjt:  MEKCSLLRLAYDLLTEETQTEELLD

A0A6J1E1F3 uncharacterized protein LOC1110250651.7e-2725.84Show/hide
Query:  LSDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCPTMVFQAVLDI--FERISANNYHWSDPRAVN
        L D+A+ WL  +P +SIT W+D AEKFLMKYF P+KNAKY+ +INNF Q  GE VSESWE FK LL+SCP       + I  + +   +     DPRAV 
Subjt:  LSDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGDINNFHQRHGELVSESWERFKELLRSCPTMVFQAVLDI--FERISANNYHWSDPRAVN

Query:  DKSTQGATDNEAMLALKDQITNLTNM--------------------------------------------------------------------------
         KS++G  ++E+   L   I NLT +                                                                          
Subjt:  DKSTQGATDNEAMLALKDQITNLTNM--------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------KLGIGKA
                                                                                                     KLGIG+A
Subjt:  ---------------------------------------------------------------------------------------------KLGIGKA

Query:  RPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRVK
        RPT+VT+QL+DRSITHPEGKIEDVLV V+K  FPADFIILDY+ADKE+ II GRP L+TG  L+DV   ELTMRV+
Subjt:  RPTVVTIQLSDRSITHPEGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAACTACCAAGACTCTCTAATCCAGTTGAAGGAAATAATGGAGGAGAAGTGGCTATACCAGTAGCTGCTCCCCCTTTTGACGCCATTCTATTGGTAGAT
GATGGAGAAAGGGCTACAGAGCCTATGTTGCGCCTGTACTACATGTTTTTTGAAAATCTATCTGGAGACCCTCATTTGCACTTGAGTGACAAGGCAAAAGACTGG
TTAAACTATATGCCAGCTAAATCCATTACTTATTGGAATGATACTGCTGAAAAGTTTCTAATGAAGTATTTTCTCCCAAACAAGAATGCCAAGTATCAAGGAGAC
ATTAATAATTTCCATCAGAGGCATGGAGAATTGGTGAGTGAGTCGTGGGAAAGGTTCAAGGAACTGCTAAGGAGCTGCCCCACCATGGTTTTCCAAGCGGTACTT
GACATCTTCGAAAGGATTTCTGCTAATAACTACCACTGGTCAGATCCCAGAGCAGTGAATGACAAGAGCACTCAAGGGGCTACTGATAATGAGGCAATGCTTGCG
CTGAAGGATCAGATTACCAACCTAACTAACATGAAATTGGGAATTGGGAAGGCACGCCCCACCGTGGTGACCATACAGTTGTCTGATAGGTCAATAACGCATCCA
GAGGGTAAGATAGAAGATGTTTTAGTGCAGGTGAACAAATTAATCTTCCCAGCTGACTTCATCATATTAGACTACGAAGCTGACAAGGAAATCTCAATCATTTTT
GGAAGGCCTTCCCTCTCCACTGGCACAACTTTACTAGATGTACAAAATAAGGAATTAACGATGAGAGTAAAGTTCCCTGCTGATATGGAGAAATGTTCTCTGTTA
AGGCTTGCATATGACTTGCTTACGGAGGAAACGCAAACAGAGGAGTTGCTAGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCAACTACCAAGACTCTCTAATCCAGTTGAAGGAAATAATGGAGGAGAAGTGGCTATACCAGTAGCTGCTCCCCCTTTTGACGCCATTCTATTGGTAGAT
GATGGAGAAAGGGCTACAGAGCCTATGTTGCGCCTGTACTACATGTTTTTTGAAAATCTATCTGGAGACCCTCATTTGCACTTGAGTGACAAGGCAAAAGACTGG
TTAAACTATATGCCAGCTAAATCCATTACTTATTGGAATGATACTGCTGAAAAGTTTCTAATGAAGTATTTTCTCCCAAACAAGAATGCCAAGTATCAAGGAGAC
ATTAATAATTTCCATCAGAGGCATGGAGAATTGGTGAGTGAGTCGTGGGAAAGGTTCAAGGAACTGCTAAGGAGCTGCCCCACCATGGTTTTCCAAGCGGTACTT
GACATCTTCGAAAGGATTTCTGCTAATAACTACCACTGGTCAGATCCCAGAGCAGTGAATGACAAGAGCACTCAAGGGGCTACTGATAATGAGGCAATGCTTGCG
CTGAAGGATCAGATTACCAACCTAACTAACATGAAATTGGGAATTGGGAAGGCACGCCCCACCGTGGTGACCATACAGTTGTCTGATAGGTCAATAACGCATCCA
GAGGGTAAGATAGAAGATGTTTTAGTGCAGGTGAACAAATTAATCTTCCCAGCTGACTTCATCATATTAGACTACGAAGCTGACAAGGAAATCTCAATCATTTTT
GGAAGGCCTTCCCTCTCCACTGGCACAACTTTACTAGATGTACAAAATAAGGAATTAACGATGAGAGTAAAGTTCCCTGCTGATATGGAGAAATGTTCTCTGTTA
AGGCTTGCATATGACTTGCTTACGGAGGAAACGCAAACAGAGGAGTTGCTAGATTAG
Protein sequenceShow/hide protein sequence
MDQLPRLSNPVEGNNGGEVAIPVAAPPFDAILLVDDGERATEPMLRLYYMFFENLSGDPHLHLSDKAKDWLNYMPAKSITYWNDTAEKFLMKYFLPNKNAKYQGD
INNFHQRHGELVSESWERFKELLRSCPTMVFQAVLDIFERISANNYHWSDPRAVNDKSTQGATDNEAMLALKDQITNLTNMKLGIGKARPTVVTIQLSDRSITHP
EGKIEDVLVQVNKLIFPADFIILDYEADKEISIIFGRPSLSTGTTLLDVQNKELTMRVKFPADMEKCSLLRLAYDLLTEETQTEELLD