; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g16090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g16090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr1:10381294..10382918
RNA-Seq ExpressionMoc01g16090
SyntenyMoc01g16090
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833150.1 reverse transcriptase family protein [Synechococcus sp. PCC 7002]6.2e-3446.63Show/hide
Query:  KNDPKESEKEAPLTLEAKKPTSST-LPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------EDC
        +++PK  EKE  +        SS  LP  P  +P+ QRF+KK  D+QF KFL++FKKL+INIPFA+AL                             E+C
Subjt:  KNDPKESEKEAPLTLEAKKPTSST-LPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------EDC

Query:  SARIQRKLPPKLKDPGSFSIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF
        SA +Q+KLP KLKDPGSF+IP  +GS    +AL D  ASINL+PLS+ +KLNIGE++PTT+ +QL DRS   P  I+E+ L+KV KFI P +F
Subjt:  SARIQRKLPPKLKDPGSFSIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF

XP_022157217.1 uncharacterized protein LOC111023979 [Momordica charantia]5.6e-4356.92Show/hide
Query:  CTEEKNDPKESEKEAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------
        C E+K++ KE+ KEAP TLEA+KP    L  F    P  Q FQKK+ DAQFKKFLDIFKKLNINI FA+AL                             
Subjt:  CTEEKNDPKESEKEAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------

Query:  EDCSARIQRKLPPKLKDPGSFSIPRNLGSYGFKALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF
        E+ + RIQRKLP KLKD   FSIP NLGSYGF+ L D   +IN  PLSLC+KLNIGEIK T++MIQLVDRST  PY +IEN LIKVGKFILP++F
Subjt:  EDCSARIQRKLPPKLKDPGSFSIPRNLGSYGFKALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF

XP_024021985.1 uncharacterized protein LOC112091773 [Morus notabilis]4.7e-3442.92Show/hide
Query:  FFSQRRSNARLLKAFQLRVSVVLPSLAPLDRLFIGSSIASCTEEKNDPKESEKEAPLTLEAKKPTSSTLPTFPIN----LPFLQRFQKKSFDAQFKKFLD
        F S    N R  K   LR    L S +  + +     I S  + K+  KE   EA       +P S T P  P      LPF QRFQKK+ D QF+KFL+
Subjt:  FFSQRRSNARLLKAFQLRVSVVLPSLAPLDRLFIGSSIASCTEEKNDPKESEKEAPLTLEAKKPTSSTLPTFPIN----LPFLQRFQKKSFDAQFKKFLD

Query:  IFKKLNINIPFADAL-----------------------------EDCSARIQRKLPPKLKDPGSFSIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNI
        IFK+++INIPF DAL                             E+CSA I+R+LP KLKDPGSF+IP  +G     KAL D  ASINL+PLS+ +KL++
Subjt:  IFKKLNINIPFADAL-----------------------------EDCSARIQRKLPPKLKDPGSFSIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNI

Query:  GEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF
        GEI PTT+ +QL DRS T P  IIE+ L+K+ KFI P +F
Subjt:  GEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF

XP_024463338.1 uncharacterized protein LOC112328498 [Populus trichocarpa]9.5e-3552.22Show/hide
Query:  KNDPKESEKEAPLTLEAKKPTSSTL-PTFPI--NLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL--------------EDCSARIQRKLPPKLK
        K+  +E E++    L+  K +   L PT  I   +PF QR +K   D QF KFLD+FKKL INIPFADAL              E+CSA +Q+KLPPKLK
Subjt:  KNDPKESEKEAPLTLEAKKPTSSTL-PTFPI--NLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL--------------EDCSARIQRKLPPKLK

Query:  DPGSFSIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF
        DPGSF+IP ++G+  F KAL D  ASINL+PLS+  KL +GE KPTTV +QL DRS   P  IIE+ L+KVGKFI P +F
Subjt:  DPGSFSIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF

XP_034899370.1 LOW QUALITY PROTEIN: uncharacterized protein LOC118037487 [Populus alba]2.8e-3451.43Show/hide
Query:  KESEK--EAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL--------------EDCSARIQRKLPPKLKDPGSF
        KE E+  E    ++   P    +      +PF QR +K   D QF KFLD+FKKL INIPFADAL              E+CSA +Q+KLPPKLKDPGSF
Subjt:  KESEK--EAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL--------------EDCSARIQRKLPPKLKDPGSF

Query:  SIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF
        +IP ++G+  F KAL D  ASINL+PLS+ KKL +GE +PTTV +QL DRS   P  IIE+ L+KVGKFI P +F
Subjt:  SIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF

TrEMBL top hitse value%identityAlignment
A0A1S3EH80 uncharacterized protein LOC1058511345.6e-3347.83Show/hide
Query:  EKEAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL----------------------------EDCSARIQRKLP
        EKE  L ++  +P    LP   I LPF QR +++  + QF KFLDIFKKL INIPFA+AL                            E+CSA +QRKLP
Subjt:  EKEAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL----------------------------EDCSARIQRKLP

Query:  PKLKDPGSFSIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF
        PKLKDPGSFSIP  +G+  F KAL D  AS++L+PLS+ KKL IG++K T +M+Q  DRS   PY ++E+ L+KV KFI P++F
Subjt:  PKLKDPGSFSIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF

A0A2G9GK35 Reverse transcriptase1.1e-3145.74Show/hide
Query:  KESEKEAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------EDCSARIQ
        +E EKE    LE  KPT+       +  PF QR QK+  + QF KFL++FKKL+INIPFA+AL                             E+CSA IQ
Subjt:  KESEKEAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------EDCSARIQ

Query:  RKLPPKLKDPGSFSIPRNLGS-YGFKALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF
         KLPPKLKDPGSF+IP  +G+ +  +AL D  ASINL+P S+ + L +GE KPT++ +QL DRS T P  +IE+ L+KV KFI P +F
Subjt:  RKLPPKLKDPGSFSIPRNLGS-YGFKALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF

A0A2G9HYA0 Reverse transcriptase1.1e-3145.74Show/hide
Query:  KESEKEAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------EDCSARIQ
        +E EKE    LE  KPT+       +  PF QR QK+  + QF KFL++FKKL+INIPFA+AL                             E+CSA IQ
Subjt:  KESEKEAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------EDCSARIQ

Query:  RKLPPKLKDPGSFSIPRNLGS-YGFKALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF
         KLPPKLKDPGSF+IP  +G+ +  +AL D  ASINL+P S+ + L +GE KPT++ +QL DRS T P  +IE+ L+KV KFI P +F
Subjt:  RKLPPKLKDPGSFSIPRNLGS-YGFKALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF

A0A2P5F9V5 Uncharacterized protein5.6e-3348.17Show/hide
Query:  ESEKEAPLTLEAKKPTSSTLPTFPIN----LPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------EDCSA
        E  +E     E KKP S   P  P      +P+ QRFQK+  D  F KFLD+FKKL+INIPFADAL                             E+CSA
Subjt:  ESEKEAPLTLEAKKPTSSTLPTFPIN----LPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------EDCSA

Query:  RIQRKLPPKLKDPGSFSIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF
         +Q+KLPPKLKDPGSF+IP  +G+  F KAL D  ASINL+PLS+ KKL +GE KPTTV +QL DRS   P   IE+ L+KV +FI P +F
Subjt:  RIQRKLPPKLKDPGSFSIPRNLGSYGF-KALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF

A0A6J1DTZ8 uncharacterized protein LOC1110239792.7e-4356.92Show/hide
Query:  CTEEKNDPKESEKEAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------
        C E+K++ KE+ KEAP TLEA+KP    L  F    P  Q FQKK+ DAQFKKFLDIFKKLNINI FA+AL                             
Subjt:  CTEEKNDPKESEKEAPLTLEAKKPTSSTLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADAL-----------------------------

Query:  EDCSARIQRKLPPKLKDPGSFSIPRNLGSYGFKALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF
        E+ + RIQRKLP KLKD   FSIP NLGSYGF+ L D   +IN  PLSLC+KLNIGEIK T++MIQLVDRST  PY +IEN LIKVGKFILP++F
Subjt:  EDCSARIQRKLPPKLKDPGSFSIPRNLGSYGFKALRDFCASINLIPLSLCKKLNIGEIKPTTVMIQLVDRSTTDPYEIIENELIKVGKFILPIEF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACTCGCAGAAGCGCTTCCTTTGGCGACGGTCGCTCGGTGGTGGTACACATTTCATCGACTACGACGGAGGCAGACGTGGAGAAGATCAGGATGACTTACCGTGT
TCTCGACTACGTTCTTATACATTTGCCCTTGGAGGACAAGAGGATCGACTCCCCCTCCTGGGATGGCGAAGGCCAACAATTGACCTTGGGCTTGCGGAGCGCCCTTATGA
TGTTGAAGACAATTGCAGGCTTCCTGGGGTTCTATAGCTTTAACACTCATCATTGGGAGATATTGAAGCCTCCCATCTCCAACAAACATTTGAGTAACCGATGGTTTTTC
GTTGGTGGGACGTGGTTGGCTACTGGCAAGCCAATTTGTGGTCGGGTTCCCAGCCACTTCGGGGAGAACAGTTCGGGCTGCGATCCTTATCCTGACCAGGCAGGTCCCTT
GACTAGTGAGCATCCCGACCCCTTTTTCTCCCAGCGACGGAGTAATGCTCGACTTCTCAAAGCTTTTCAACTTCGAGTGTCAGTGGTGCTTCCTTCCCTGGCTCCTTTGG
ATCGTCTTTTTATTGGTTCCTCAATTGCTTCCTGCACTGAGGAAAAAAACGATCCAAAGGAGTCAGAGAAGGAAGCACCACTGACACTCGAGGCTAAAAAGCCTACTAGT
TCTACTCTTCCTACTTTTCCTATTAATTTACCTTTTCTTCAGCGTTTTCAAAAGAAAAGCTTTGATGCTCAATTTAAGAAATTCTTGGACATATTTAAAAAGCTTAATAT
TAATATTCCTTTTGCAGATGCACTTGAGGATTGCAGTGCAAGAATTCAACGAAAACTACCACCAAAACTCAAGGATCCAGGGAGTTTTTCTATTCCACGTAATCTTGGTA
GTTATGGTTTTAAAGCTTTACGTGATTTTTGTGCCAGCATTAATTTAATTCCTTTATCTTTGTGCAAAAAATTAAATATTGGAGAAATTAAGCCGACCACTGTAATGATC
CAATTAGTTGATAGATCGACTACAGATCCTTACGAAATCATTGAAAACGAGTTGATAAAAGTTGGCAAATTCATCCTTCCGATAGAGTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACTCGCAGAAGCGCTTCCTTTGGCGACGGTCGCTCGGTGGTGGTACACATTTCATCGACTACGACGGAGGCAGACGTGGAGAAGATCAGGATGACTTACCGTGT
TCTCGACTACGTTCTTATACATTTGCCCTTGGAGGACAAGAGGATCGACTCCCCCTCCTGGGATGGCGAAGGCCAACAATTGACCTTGGGCTTGCGGAGCGCCCTTATGA
TGTTGAAGACAATTGCAGGCTTCCTGGGGTTCTATAGCTTTAACACTCATCATTGGGAGATATTGAAGCCTCCCATCTCCAACAAACATTTGAGTAACCGATGGTTTTTC
GTTGGTGGGACGTGGTTGGCTACTGGCAAGCCAATTTGTGGTCGGGTTCCCAGCCACTTCGGGGAGAACAGTTCGGGCTGCGATCCTTATCCTGACCAGGCAGGTCCCTT
GACTAGTGAGCATCCCGACCCCTTTTTCTCCCAGCGACGGAGTAATGCTCGACTTCTCAAAGCTTTTCAACTTCGAGTGTCAGTGGTGCTTCCTTCCCTGGCTCCTTTGG
ATCGTCTTTTTATTGGTTCCTCAATTGCTTCCTGCACTGAGGAAAAAAACGATCCAAAGGAGTCAGAGAAGGAAGCACCACTGACACTCGAGGCTAAAAAGCCTACTAGT
TCTACTCTTCCTACTTTTCCTATTAATTTACCTTTTCTTCAGCGTTTTCAAAAGAAAAGCTTTGATGCTCAATTTAAGAAATTCTTGGACATATTTAAAAAGCTTAATAT
TAATATTCCTTTTGCAGATGCACTTGAGGATTGCAGTGCAAGAATTCAACGAAAACTACCACCAAAACTCAAGGATCCAGGGAGTTTTTCTATTCCACGTAATCTTGGTA
GTTATGGTTTTAAAGCTTTACGTGATTTTTGTGCCAGCATTAATTTAATTCCTTTATCTTTGTGCAAAAAATTAAATATTGGAGAAATTAAGCCGACCACTGTAATGATC
CAATTAGTTGATAGATCGACTACAGATCCTTACGAAATCATTGAAAACGAGTTGATAAAAGTTGGCAAATTCATCCTTCCGATAGAGTTTTAA
Protein sequenceShow/hide protein sequence
MATRRSASFGDGRSVVVHISSTTTEADVEKIRMTYRVLDYVLIHLPLEDKRIDSPSWDGEGQQLTLGLRSALMMLKTIAGFLGFYSFNTHHWEILKPPISNKHLSNRWFF
VGGTWLATGKPICGRVPSHFGENSSGCDPYPDQAGPLTSEHPDPFFSQRRSNARLLKAFQLRVSVVLPSLAPLDRLFIGSSIASCTEEKNDPKESEKEAPLTLEAKKPTS
STLPTFPINLPFLQRFQKKSFDAQFKKFLDIFKKLNINIPFADALEDCSARIQRKLPPKLKDPGSFSIPRNLGSYGFKALRDFCASINLIPLSLCKKLNIGEIKPTTVMI
QLVDRSTTDPYEIIENELIKVGKFILPIEF