; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:16075699..16076439
RNA-Seq ExpressionMoc06g20640
SyntenyMoc06g20640
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059143.1 uncharacterized protein E6C27_scaffold430G00550 [Cucumis melo var. makuwa]6.8e-8564.34Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV
        MS++   GK+  DRLVEIEEQ+L+L E+PD++RY+ESR+DEIS KA+ ID V  R++GL I+EL+ RV+ LE+    +RT N+E GESSS    HMEERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
         E+D +QKT+++M++ +++DFK T+D +R EIA++  +++LTMRA+ NQAPAGG I  +KVKVPEPKPFCGARDAKALEN+IFDLEQYFKAT+TV EE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF
        VTLATMHL++DAKLWWR+RYVD+QEG+CT+DTW+ LK+ELRSQF
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF

TYK03044.1 uncharacterized protein E5676_scaffold46G001390 [Cucumis melo var. makuwa]5.2e-8563.93Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV
        MS++   GK+  DRLVEIEEQ+L+L E+PD++RY+ESR+DEIS KA+ ID V  R++GL I+EL+ RV+ LE+    +RT N+E GESSS    HMEERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
         E+D +QKT+++M++ +++DFK T+D +R EIA++  +++LTMRA+ NQAPAGG I  +KVKVPEPKPFCGARDAKALEN+IFDLEQYFKAT+T+ EE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF
        VTLATMHL++DAKLWWR+RYVD+QEG+CT+DTW+ LK+ELRSQF
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF

TYK03099.1 reverse transcriptase [Cucumis melo var. makuwa]1.2e-8462.7Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLEDKVK--RTDNFECGESSSSSITHMEERV
        MS++  LGK+  DRLVE+EEQ+L+L E+PD++RY+ESRL+EIS K + ID V  R++G  I+ELM RV+ LE  V   RT N+E G+SS+ S+ H+EERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLEDKVK--RTDNFECGESSSSSITHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
        +E+D SQKT+++M++ +++DF+ T+D +R EIA++  +++LTMRA+ NQAPAGG I  ++VK+PEPKPFCGARDAKALEN+IFDLEQYF+AT+TVTEE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF
        VTLATMHL++DAKLWWR+R+VD+QEG+CTIDTW+ LK+ELRSQF
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF

TYK31632.1 uncharacterized protein E5676_scaffold340G00230 [Cucumis melo var. makuwa]1.4e-8563.93Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV
        MS++   GK+  DRLVEIEEQ+L+L E+PD++RY+ESR+DEIS KA+ ID V  R++GL I+EL+ RV+ LE+    +RT N+ECGESSS    HMEERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
         E+D +QKT+++M++ +++DF+ T+D +R EIA++  +++LTMRA+ NQAPAGG I  +KVKVPEPKPFCGARDAKALEN+IFDLEQYFKAT+T+ EE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF
        VTLATMHL++DAKLWWR+RYVD+QEG+CT+DTW+ LK+ELRSQF
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]1.1e-10184.23Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLEDKVKRTDNFECGESSSSSITHMEERVEE
        MS TKQLGKSH+DRLVEIEE+LLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLEDKVKRTDN E GESSSSSI HMEERVEE
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLEDKVKRTDNFECGESSSSSITHMEERVEE

Query:  IDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT
        IDIS+KTIVQMVSELTDDFK T+DEMRAEIAELGT                             KPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT
Subjt:  IDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT

Query:  LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQ
        LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQ
Subjt:  LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQ

TrEMBL top hitse value%identityAlignment
A0A5A7UT87 Retrotrans_gag domain-containing protein3.3e-8564.34Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV
        MS++   GK+  DRLVEIEEQ+L+L E+PD++RY+ESR+DEIS KA+ ID V  R++GL I+EL+ RV+ LE+    +RT N+E GESSS    HMEERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
         E+D +QKT+++M++ +++DFK T+D +R EIA++  +++LTMRA+ NQAPAGG I  +KVKVPEPKPFCGARDAKALEN+IFDLEQYFKAT+TV EE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF
        VTLATMHL++DAKLWWR+RYVD+QEG+CT+DTW+ LK+ELRSQF
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF

A0A5D3BV48 Retrotrans_gag domain-containing protein2.5e-8563.93Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV
        MS++   GK+  DRLVEIEEQ+L+L E+PD++RY+ESR+DEIS KA+ ID V  R++GL I+EL+ RV+ LE+    +RT N+E GESSS    HMEERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
         E+D +QKT+++M++ +++DFK T+D +R EIA++  +++LTMRA+ NQAPAGG I  +KVKVPEPKPFCGARDAKALEN+IFDLEQYFKAT+T+ EE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF
        VTLATMHL++DAKLWWR+RYVD+QEG+CT+DTW+ LK+ELRSQF
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF

A0A5D3BYE6 Reverse transcriptase5.7e-8562.7Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLEDKVK--RTDNFECGESSSSSITHMEERV
        MS++  LGK+  DRLVE+EEQ+L+L E+PD++RY+ESRL+EIS K + ID V  R++G  I+ELM RV+ LE  V   RT N+E G+SS+ S+ H+EERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLEDKVK--RTDNFECGESSSSSITHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
        +E+D SQKT+++M++ +++DF+ T+D +R EIA++  +++LTMRA+ NQAPAGG I  ++VK+PEPKPFCGARDAKALEN+IFDLEQYF+AT+TVTEE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF
        VTLATMHL++DAKLWWR+R+VD+QEG+CTIDTW+ LK+ELRSQF
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF

A0A5D3E6S8 Retrotrans_gag domain-containing protein6.7e-8663.93Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV
        MS++   GK+  DRLVEIEEQ+L+L E+PD++RY+ESR+DEIS KA+ ID V  R++GL I+EL+ RV+ LE+    +RT N+ECGESSS    HMEERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLED--KVKRTDNFECGESSSSSITHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
         E+D +QKT+++M++ +++DF+ T+D +R EIA++  +++LTMRA+ NQAPAGG I  +KVKVPEPKPFCGARDAKALEN+IFDLEQYFKAT+T+ EE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF
        VTLATMHL++DAKLWWR+RYVD+QEG+CT+DTW+ LK+ELRSQF
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQF

A0A6J1DLQ6 uncharacterized protein LOC1110223205.1e-10284.23Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLEDKVKRTDNFECGESSSSSITHMEERVEE
        MS TKQLGKSH+DRLVEIEE+LLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLEDKVKRTDN E GESSSSSI HMEERVEE
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLEDKVKRTDNFECGESSSSSITHMEERVEE

Query:  IDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT
        IDIS+KTIVQMVSELTDDFK T+DEMRAEIAELGT                             KPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT
Subjt:  IDISQKTIVQMVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT

Query:  LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQ
        LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQ
Subjt:  LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGCGACAAAGCAGTTGGGTAAGTCCCACGTCGACAGACTCGTCGAGATCGAAGAACAGCTGTTGTTCTTGAGGGAAATCCCTGACAACCTTAGATATGTGGAATC
TCGGCTGGATGAGATCTCCACCAAAGCTGACGGAATTGACGTCGTGAATGCCCGCATAGACGGGCTTGCTATACGTGAGTTAATGCTTCGGGTTGAGACCCTTGAAGACA
AGGTTAAGCGTACTGATAACTTTGAGTGTGGCGAAAGCTCATCGAGCTCAATCACCCACATGGAGGAGCGTGTCGAAGAAATAGACATCTCCCAAAAGACTATTGTGCAG
ATGGTCAGTGAGCTGACCGACGACTTCAAAGGCACCATCGACGAAATGAGGGCGGAGATTGCCGAATTAGGCACCAAAGTAAATCTCACCATGAGAGCGGTGGGAAACCA
GGCCCCAGCTGGGGGACCGATTCAGTTCAACAAGGTGAAAGTTCCCGAACCCAAGCCCTTCTGTGGGGCGCGAGATGCTAAAGCTCTTGAGAACTTCATCTTCGACCTTG
AACAGTACTTCAAGGCGACAAGCACGGTGACAGAGGAATCAAAAGTCACACTAGCCACAATGCATCTTGCTGACGATGCGAAGTTGTGGTGGAGAGCGAGATACGTAGAT
GTACAAGAGGGAAAATGTACCATCGACACGTGGGAAAAACTAAAGCAGGAACTCAGATCCCAATTTTCCCGGAGAATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCGCGACAAAGCAGTTGGGTAAGTCCCACGTCGACAGACTCGTCGAGATCGAAGAACAGCTGTTGTTCTTGAGGGAAATCCCTGACAACCTTAGATATGTGGAATC
TCGGCTGGATGAGATCTCCACCAAAGCTGACGGAATTGACGTCGTGAATGCCCGCATAGACGGGCTTGCTATACGTGAGTTAATGCTTCGGGTTGAGACCCTTGAAGACA
AGGTTAAGCGTACTGATAACTTTGAGTGTGGCGAAAGCTCATCGAGCTCAATCACCCACATGGAGGAGCGTGTCGAAGAAATAGACATCTCCCAAAAGACTATTGTGCAG
ATGGTCAGTGAGCTGACCGACGACTTCAAAGGCACCATCGACGAAATGAGGGCGGAGATTGCCGAATTAGGCACCAAAGTAAATCTCACCATGAGAGCGGTGGGAAACCA
GGCCCCAGCTGGGGGACCGATTCAGTTCAACAAGGTGAAAGTTCCCGAACCCAAGCCCTTCTGTGGGGCGCGAGATGCTAAAGCTCTTGAGAACTTCATCTTCGACCTTG
AACAGTACTTCAAGGCGACAAGCACGGTGACAGAGGAATCAAAAGTCACACTAGCCACAATGCATCTTGCTGACGATGCGAAGTTGTGGTGGAGAGCGAGATACGTAGAT
GTACAAGAGGGAAAATGTACCATCGACACGTGGGAAAAACTAAAGCAGGAACTCAGATCCCAATTTTCCCGGAGAATGTAG
Protein sequenceShow/hide protein sequence
MSATKQLGKSHVDRLVEIEEQLLFLREIPDNLRYVESRLDEISTKADGIDVVNARIDGLAIRELMLRVETLEDKVKRTDNFECGESSSSSITHMEERVEEIDISQKTIVQ
MVSELTDDFKGTIDEMRAEIAELGTKVNLTMRAVGNQAPAGGPIQFNKVKVPEPKPFCGARDAKALENFIFDLEQYFKATSTVTEESKVTLATMHLADDAKLWWRARYVD
VQEGKCTIDTWEKLKQELRSQFSRRM