; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS026227 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS026227
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold797:519675..520493
RNA-Seq ExpressionMS026227
SyntenyMS026227
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141766.1 uncharacterized protein LOC111012047 [Momordica charantia]7.6e-9188.17Show/hide
Query:  VMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHPSKGTVKLNV
        +MGLGVDCNCSFIDRWALWREN+T +ELLLAGIICWANWND+NQR NG  VADVHT+SDWITNYAMELLTRRRKTNEAQKNLPC AIWK PSKGTVKLNV
Subjt:  VMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHPSKGTVKLNV

Query:  DAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAIREVVRLAERYIHVGREANKPTHFLMGEALHQNNPVLWLSDF
        DA FDVRNQRGGVGVLV DD+NSILAALISSHNVNSPLL EICAIRE VRLAER IHVGRE NKP HFL GEALHQN+ VLWLSDF
Subjt:  DAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAIREVVRLAERYIHVGREANKPTHFLMGEALHQNNPVLWLSDF

XP_022145060.1 uncharacterized protein LOC111014578 [Momordica charantia]1.9e-1731.18Show/hide
Query:  PVCPICLEEDETTDHALVGCNKAGQIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAME
        P C +C ++ ETTDHAL  C +A ++W+I LP       D N S  D      E+++  +  L G+  WA WND+N  +  + + D   +SDWI  Y  +
Subjt:  PVCPICLEEDETTDHALVGCNKAGQIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAME

Query:  LLTR--------RRKTNEAQKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAI
           R        R + + +  +      W  P  G +K+NVDA       R G+G++  +++  ILAA  +SH+   PL+ E  A+
Subjt:  LLTR--------RRKTNEAQKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAI

XP_022155289.1 uncharacterized protein LOC111022426 [Momordica charantia]8.1e-1637.2Show/hide
Query:  QIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHP
        QIW   LP    LG   N  +++RW+ W +N++   L LA I CWA WND N   N K + +   K                      + +P    W+ P
Subjt:  QIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHP

Query:  SKG-TVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAIREVVRLAER
          G  VK+N DA   V  +  GVGVL+ D    I+AA+I  H V +PLL EI AIRE +RLA R
Subjt:  SKG-TVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAIREVVRLAER

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.3e-1832.98Show/hide
Query:  LPVCPICLEEDETTDHALVGCNKAGQIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAM
        LP C IC +  E+  HA   C +A QIW    P +  L  + N SF++ W+   E + PK+L LA I  W  WND+N   +GK V+ V  K +W+T + +
Subjt:  LPVCPICLEEDETTDHALVGCNKAGQIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAM

Query:  ELLTRRRKTNEAQK----NLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAAL-ISSHNVNSPLLVEICAIREVVRLA
        +  ++ + +N + +    + P    W+  S  ++KLN DA    R      G ++ D   S++AA  I      SPLL EI  I E ++ A
Subjt:  ELLTRRRKTNEAQK----NLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAAL-ISSHNVNSPLLVEICAIREVVRLA

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]3.6e-1638.81Show/hide
Query:  ENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDD
        +N++  EL L GI CWA WND++   N K + +   K +WI  YA E+  R +  N  ++ +P    W+ P  G VK+N DA   V  +  GVGVL+ + 
Subjt:  ENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDD

Query:  QNSILAALISSHNVNSPLLVEICAIREVVRLAER
           I+ A++  H V +PLL +I AIRE + LA R
Subjt:  QNSILAALISSHNVNSPLLVEICAIREVVRLAER

TrEMBL top hitse value%identityAlignment
A0A6J1CK80 uncharacterized protein LOC1110120473.7e-9188.17Show/hide
Query:  VMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHPSKGTVKLNV
        +MGLGVDCNCSFIDRWALWREN+T +ELLLAGIICWANWND+NQR NG  VADVHT+SDWITNYAMELLTRRRKTNEAQKNLPC AIWK PSKGTVKLNV
Subjt:  VMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHPSKGTVKLNV

Query:  DAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAIREVVRLAERYIHVGREANKPTHFLMGEALHQNNPVLWLSDF
        DA FDVRNQRGGVGVLV DD+NSILAALISSHNVNSPLL EICAIRE VRLAER IHVGRE NKP HFL GEALHQN+ VLWLSDF
Subjt:  DAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAIREVVRLAERYIHVGREANKPTHFLMGEALHQNNPVLWLSDF

A0A6J1CTE3 uncharacterized protein LOC1110145789.3e-1831.18Show/hide
Query:  PVCPICLEEDETTDHALVGCNKAGQIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAME
        P C +C ++ ETTDHAL  C +A ++W+I LP       D N S  D      E+++  +  L G+  WA WND+N  +  + + D   +SDWI  Y  +
Subjt:  PVCPICLEEDETTDHALVGCNKAGQIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAME

Query:  LLTR--------RRKTNEAQKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAI
           R        R + + +  +      W  P  G +K+NVDA       R G+G++  +++  ILAA  +SH+   PL+ E  A+
Subjt:  LLTR--------RRKTNEAQKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAI

A0A6J1DPU1 uncharacterized protein LOC1110224263.9e-1637.2Show/hide
Query:  QIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHP
        QIW   LP    LG   N  +++RW+ W +N++   L LA I CWA WND N   N K + +   K                      + +P    W+ P
Subjt:  QIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHP

Query:  SKG-TVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAIREVVRLAER
          G  VK+N DA   V  +  GVGVL+ D    I+AA+I  H V +PLL EI AIRE +RLA R
Subjt:  SKG-TVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAIREVVRLAER

A0A6J1DX30 uncharacterized protein LOC1110248741.1e-1832.98Show/hide
Query:  LPVCPICLEEDETTDHALVGCNKAGQIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAM
        LP C IC +  E+  HA   C +A QIW    P +  L  + N SF++ W+   E + PK+L LA I  W  WND+N   +GK V+ V  K +W+T + +
Subjt:  LPVCPICLEEDETTDHALVGCNKAGQIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAM

Query:  ELLTRRRKTNEAQK----NLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAAL-ISSHNVNSPLLVEICAIREVVRLA
        +  ++ + +N + +    + P    W+  S  ++KLN DA    R      G ++ D   S++AA  I      SPLL EI  I E ++ A
Subjt:  ELLTRRRKTNEAQK----NLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAAL-ISSHNVNSPLLVEICAIREVVRLA

A0A6J1DZK3 uncharacterized protein LOC1110249681.8e-1638.81Show/hide
Query:  ENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDD
        +N++  EL L GI CWA WND++   N K + +   K +WI  YA E+  R +  N  ++ +P    W+ P  G VK+N DA   V  +  GVGVL+ + 
Subjt:  ENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDD

Query:  QNSILAALISSHNVNSPLLVEICAIREVVRLAER
           I+ A++  H V +PLL +I AIRE + LA R
Subjt:  QNSILAALISSHNVNSPLLVEICAIREVVRLAER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.8e-0526.5Show/hide
Query:  CPICLEEDETTDHALVGCNKAGQIWDISLPMVMGLGVDCNCSFID-RWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAM--
        C  C +  ET +H L  C  A  +W IS       G   +  + +  W L  E   PK   +  ++ W  W     R N  +       +  +   AM  
Subjt:  CPICLEEDETTDHALVGCNKAGQIWDISLPMVMGLGVDCNCSFID-RWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAM--

Query:  --ELLTRRRKTNEA-----QKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDDQNSIL----AALISSHNVNSPLLVEICAIREVVRLAERY
          E  TRR    +A     ++NL     WK P    VK N DA + + N R G+G ++ ++   +L     AL  + NV   L  E+ A+R  V    R+
Subjt:  --ELLTRRRKTNEA-----QKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDDQNSIL----AALISSHNVNSPLLVEICAIREVVRLAERY

AT3G25270.1 Ribonuclease H-like superfamily protein3.6e-0624.26Show/hide
Query:  PVCPICLEEDETTDHALVGCNKAGQIWDIS-LP----MVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKN-----------QRKNGKVV
        P C  C +EDET+ H    C  A Q+W  S +P       G+ ++     +    L   N  P+   LA  I W  W  +N           Q    +  
Subjt:  PVCPICLEEDETTDHALVGCNKAGQIWDIS-LP----MVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKN-----------QRKNGKVV

Query:  ADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDD
         DV    D  TN  ++ L ++  ++  Q+       W+ P    +K N D  F+ + +    G L+ D+
Subjt:  ADVHTKSDWITNYAMELLTRRRKTNEAQKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGTCTTACCTGTTTGTCCTATTTGCCTTGAAGAGGATGAGACAACGGACCACGCCTTAGTGGGTTGCAATAAGGCGGGCCAGATTTGGGACATTTCCCTTCCTAT
GGTGATGGGCCTTGGTGTCGATTGCAACTGCAGTTTCATTGATAGATGGGCGCTTTGGAGAGAAAACATAACACCAAAGGAATTGCTTTTAGCGGGCATCATTTGTTGGG
CGAATTGGAATGACAAAAACCAAAGAAAAAATGGTAAAGTCGTTGCGGATGTTCACACAAAATCTGATTGGATCACAAATTATGCTATGGAGCTTTTAACTCGGAGAAGA
AAGACTAACGAGGCCCAGAAGAATTTACCTTGTCCGGCGATTTGGAAACATCCTTCGAAAGGAACAGTCAAGCTCAATGTGGACGCGCCTTTCGATGTCCGAAATCAAAG
GGGAGGGGTTGGGGTTTTGGTTTGTGATGATCAGAACTCCATCCTAGCAGCCTTAATTTCATCTCACAATGTGAATTCCCCTTTGTTGGTCGAAATTTGTGCTATTCGTG
AAGTAGTTCGCCTAGCTGAGAGATATATACATGTCGGAAGAGAGGCTAACAAACCGACTCATTTCCTTATGGGTGAGGCTCTGCATCAGAATAATCCAGTATTGTGGCTT
TCTGATTTCCCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGTCTTACCTGTTTGTCCTATTTGCCTTGAAGAGGATGAGACAACGGACCACGCCTTAGTGGGTTGCAATAAGGCGGGCCAGATTTGGGACATTTCCCTTCCTAT
GGTGATGGGCCTTGGTGTCGATTGCAACTGCAGTTTCATTGATAGATGGGCGCTTTGGAGAGAAAACATAACACCAAAGGAATTGCTTTTAGCGGGCATCATTTGTTGGG
CGAATTGGAATGACAAAAACCAAAGAAAAAATGGTAAAGTCGTTGCGGATGTTCACACAAAATCTGATTGGATCACAAATTATGCTATGGAGCTTTTAACTCGGAGAAGA
AAGACTAACGAGGCCCAGAAGAATTTACCTTGTCCGGCGATTTGGAAACATCCTTCGAAAGGAACAGTCAAGCTCAATGTGGACGCGCCTTTCGATGTCCGAAATCAAAG
GGGAGGGGTTGGGGTTTTGGTTTGTGATGATCAGAACTCCATCCTAGCAGCCTTAATTTCATCTCACAATGTGAATTCCCCTTTGTTGGTCGAAATTTGTGCTATTCGTG
AAGTAGTTCGCCTAGCTGAGAGATATATACATGTCGGAAGAGAGGCTAACAAACCGACTCATTTCCTTATGGGTGAGGCTCTGCATCAGAATAATCCAGTATTGTGGCTT
TCTGATTTCCCTCCTTGA
Protein sequenceShow/hide protein sequence
MNVLPVCPICLEEDETTDHALVGCNKAGQIWDISLPMVMGLGVDCNCSFIDRWALWRENITPKELLLAGIICWANWNDKNQRKNGKVVADVHTKSDWITNYAMELLTRRR
KTNEAQKNLPCPAIWKHPSKGTVKLNVDAPFDVRNQRGGVGVLVCDDQNSILAALISSHNVNSPLLVEICAIREVVRLAERYIHVGREANKPTHFLMGEALHQNNPVLWL
SDFPP