; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g04930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g04930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:3658709..3659293
RNA-Seq ExpressionMoc03g04930
SyntenyMoc03g04930
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PKI35403.1 hypothetical protein CRG98_044205, partial [Punica granatum]1.1e-4248Show/hide
Query:  LRDMISAYNPDIMVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSGLFTAVYGSPQRGTRKELWQ
        +++MI  + P+I+VI+EP+ISG  A++VC+ F ++S  RVEA  F GGIWV+WQ N V +T   +H QA H RIS+   +  FTAVY SP    R++LW 
Subjt:  LRDMISAYNPDIMVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSGLFTAVYGSPQRGTRKELWQ

Query:  FLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF
         L +I+  ++ PW ++GDFN IL   EK GGAPFNP  A+ F + +++C L+DL SSGP+FTW GP    + R+F
Subjt:  FLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF

XP_015934914.2 uncharacterized protein LOC107461000 [Arachis duranensis]4.2e-3741.75Show/hide
Query:  LSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAESVCK--SFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISK-GLFSG
        +SWN RGA S  F+  L++ +  YNPDI+++LE K+SG  A  + +   F NF     EA  F GGIW+ W++N +++T +  + Q  H R+ +      
Subjt:  LSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAESVCK--SFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISK-GLFSG

Query:  LFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF
          TAVY SPQ   R+ +WQ +E IA  + EPW LIGDFNEI    EK GG P N      F + ++ C L+DLG  G +FTWRGP  + + R+F
Subjt:  LFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF

XP_016182610.1 uncharacterized protein LOC107624669 [Arachis ipaensis]4.2e-3740Show/hide
Query:  MIFLSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAE-SVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFS
        MI  SWN RGA SP F  VL++ +S Y P+++ I E + SG VA+ ++ K+  NF+   ++A  F GGIW+ W    +++ EI ++ QA H  +S  L  
Subjt:  MIFLSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAE-SVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFS

Query:  GLFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF
           T VY +P    ++EL  ++ ++ P ++ PW L  DFN+I +  EK GGAP N +    F D M+SC  +DLG  G +FTWRGP  Q   R+F
Subjt:  GLFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF

XP_022137804.1 uncharacterized protein LOC111009151 [Momordica charantia]6.7e-5158.28Show/hide
Query:  MVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSGLFTAVYGSPQRGTRKELWQFLESIAPVLSEP
        MVI+EPKISG +A+SVC+SF +FS  RVEA+  KGGIWVFW+ +RV+L E+  + QA HFR  +   SG FT VYGSPQR +++ELW FL+S+ P    P
Subjt:  MVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSGLFTAVYGSPQRGTRKELWQFLESIAPVLSEP

Query:  WFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF
        W LIGDFN I S +EK G AP +P  A+ FL T++ CQL+DLGSSGPKFTW+GP +  F R+F
Subjt:  WFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF

XP_031402735.1 uncharacterized protein LOC116212324 [Punica granatum]8.1e-4948.97Show/hide
Query:  MIFLSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSG
        M  L WNCRGA S  F  V+++MI  + P+I+VI+EP+ISG  A++VC+ F ++S  RVEA  F GGIWV+WQ N V +T   +H QA H RIS+   + 
Subjt:  MIFLSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSG

Query:  LFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF
         FTAVY SP    R++LW  L +I+  ++ PW ++GDFN IL   EK GGAPFNP  A+ F + +++C L+DL SSGP+FTW GP    + R+F
Subjt:  LFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF

TrEMBL top hitse value%identityAlignment
A0A2I0HUV7 Reverse transcriptase domain-containing protein (Fragment)5.5e-4348Show/hide
Query:  LRDMISAYNPDIMVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSGLFTAVYGSPQRGTRKELWQ
        +++MI  + P+I+VI+EP+ISG  A++VC+ F ++S  RVEA  F GGIWV+WQ N V +T   +H QA H RIS+   +  FTAVY SP    R++LW 
Subjt:  LRDMISAYNPDIMVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSGLFTAVYGSPQRGTRKELWQ

Query:  FLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF
         L +I+  ++ PW ++GDFN IL   EK GGAPFNP  A+ F + +++C L+DL SSGP+FTW GP    + R+F
Subjt:  FLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF

A0A2N9GX50 Uncharacterized protein1.5e-3540.44Show/hide
Query:  LSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAESVCKSFV--NFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFH--FRISKGLFS
        L+WNCRG  +P F+  L D++   NP I+++ E ++ G  A  + KSF    F C+  +   F GGIW+ W+ N V +  +C   Q  H   ++     S
Subjt:  LSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAESVCKSFV--NFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFH--FRISKGLFS

Query:  GLFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTW
         L +A+Y SP+R  R+ LWQ L ++A + S PW ++GDFN+I S +EK GG   N S  S + D MN+C ++DLG SGPK+TW
Subjt:  GLFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTW

A0A6J1C8B2 uncharacterized protein LOC1110091513.2e-5158.28Show/hide
Query:  MVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSGLFTAVYGSPQRGTRKELWQFLESIAPVLSEP
        MVI+EPKISG +A+SVC+SF +FS  RVEA+  KGGIWVFW+ +RV+L E+  + QA HFR  +   SG FT VYGSPQR +++ELW FL+S+ P    P
Subjt:  MVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSGLFTAVYGSPQRGTRKELWQFLESIAPVLSEP

Query:  WFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF
        W LIGDFN I S +EK G AP +P  A+ FL T++ CQL+DLGSSGPKFTW+GP +  F R+F
Subjt:  WFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF

A0A6P4BTE3 uncharacterized protein LOC1074610002.0e-3741.75Show/hide
Query:  LSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAESVCK--SFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISK-GLFSG
        +SWN RGA S  F+  L++ +  YNPDI+++LE K+SG  A  + +   F NF     EA  F GGIW+ W++N +++T +  + Q  H R+ +      
Subjt:  LSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAESVCK--SFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISK-GLFSG

Query:  LFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF
          TAVY SPQ   R+ +WQ +E IA  + EPW LIGDFNEI    EK GG P N      F + ++ C L+DLG  G +FTWRGP  + + R+F
Subjt:  LFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF

A0A6P8E5K3 uncharacterized protein LOC1162123243.9e-4948.97Show/hide
Query:  MIFLSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSG
        M  L WNCRGA S  F  V+++MI  + P+I+VI+EP+ISG  A++VC+ F ++S  RVEA  F GGIWV+WQ N V +T   +H QA H RIS+   + 
Subjt:  MIFLSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSG

Query:  LFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF
         FTAVY SP    R++LW  L +I+  ++ PW ++GDFN IL   EK GGAPFNP  A+ F + +++C L+DL SSGP+FTW GP    + R+F
Subjt:  LFTAVYGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein1.3e-0433.78Show/hide
Query:  RKELWQ---FLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASL--FLDTMNSCQLLDLGSSGPKFTW
        R+ LW     L + +P+ + PW ++GDFN+I S  E     P N S   L      M    L+DL   G  +TW
Subjt:  RKELWQ---FLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASL--FLDTMNSCQLLDLGSSGPKFTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTTCTTTCATGGAATTGTAGGGGGGCTCGAAGCCCTCTCTTTAAGTCGGTGTTGAGAGATATGATATCTGCTTATAATCCTGATATTATGGTTATATTA
GAACCTAAGATTAGTGGTGTGGTAGCTGAGTCTGTTTGTAAAAGTTTTGTTAATTTTTCTTGTACCCGTGTTGAGGCAAATAGTTTTAAAGGAGGGATTTGGGTG
TTTTGGCAAGAAAATAGGGTCACACTCACAGAAATTTGTCAACATAATCAAGCTTTCCACTTTAGGATTTCTAAAGGCTTATTCTCTGGCTTGTTCACGGCCGTC
TATGGTAGTCCTCAGAGAGGTACAAGAAAGGAGCTATGGCAATTTTTAGAGTCTATTGCTCCTGTTCTTTCTGAACCATGGTTTCTCATAGGAGATTTTAATGAG
ATCCTTTCTGAGGAAGAAAAAACAGGAGGTGCCCCGTTCAATCCAAGCTCAGCATCGCTGTTCTTAGACACGATGAATAGTTGTCAGCTACTGGATCTTGGGAGT
TCGGGGCCGAAATTTACGTGGAGGGGACCTTCCCTCCAAAGGTTTAGGAGAATTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTTTTCTTTCATGGAATTGTAGGGGGGCTCGAAGCCCTCTCTTTAAGTCGGTGTTGAGAGATATGATATCTGCTTATAATCCTGATATTATGGTTATATTA
GAACCTAAGATTAGTGGTGTGGTAGCTGAGTCTGTTTGTAAAAGTTTTGTTAATTTTTCTTGTACCCGTGTTGAGGCAAATAGTTTTAAAGGAGGGATTTGGGTG
TTTTGGCAAGAAAATAGGGTCACACTCACAGAAATTTGTCAACATAATCAAGCTTTCCACTTTAGGATTTCTAAAGGCTTATTCTCTGGCTTGTTCACGGCCGTC
TATGGTAGTCCTCAGAGAGGTACAAGAAAGGAGCTATGGCAATTTTTAGAGTCTATTGCTCCTGTTCTTTCTGAACCATGGTTTCTCATAGGAGATTTTAATGAG
ATCCTTTCTGAGGAAGAAAAAACAGGAGGTGCCCCGTTCAATCCAAGCTCAGCATCGCTGTTCTTAGACACGATGAATAGTTGTCAGCTACTGGATCTTGGGAGT
TCGGGGCCGAAATTTACGTGGAGGGGACCTTCCCTCCAAAGGTTTAGGAGAATTTTTTAG
Protein sequenceShow/hide protein sequence
MIFLSWNCRGARSPLFKSVLRDMISAYNPDIMVILEPKISGVVAESVCKSFVNFSCTRVEANSFKGGIWVFWQENRVTLTEICQHNQAFHFRISKGLFSGLFTAV
YGSPQRGTRKELWQFLESIAPVLSEPWFLIGDFNEILSEEEKTGGAPFNPSSASLFLDTMNSCQLLDLGSSGPKFTWRGPSLQRFRRIF