; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g04510 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g04510
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:3354273..3359559
RNA-Seq ExpressionMoc03g04510
SyntenyMoc03g04510
Gene Ontology termsGO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]2.8e-1840.76Show/hide
Query:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI
        VAKE T + L+K+L D YEKPSAN K+ L  K F++ M++G  V  H+NE   I+N+L  + ++ +++V+A+ L+ SLP+S E M+ AVSNS+G   LK 
Subjt:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI

Query:  SAICDTVLSEETQRKLGKMSASTSGAEN----GVESALVAQNKEKAKMNYNGKQQQR
          + D +L EE  R++     STS A N    G +     QN+ ++K + NGK Q +
Subjt:  SAICDTVLSEETQRKLGKMSASTSGAEN----GVESALVAQNKEKAKMNYNGKQQQR

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]3.6e-1841.67Show/hide
Query:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI
        VAKE T + L+K+L D YEKPSAN K+ L  K F++ M++G  V  H+NE   I+N+L  + ++ +++V+A+ LL SLP+S E M+ AVSNS+G   LK 
Subjt:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI

Query:  SAICDTVLSEETQR-KLGKMSASTS-GAEN-GVESALVAQNKEKAKMNYNGKQQQR
          + D +L EE +R   G+ S S++   EN G +     QN+ ++K + NGK Q +
Subjt:  SAICDTVLSEETQR-KLGKMSASTS-GAEN-GVESALVAQNKEKAKMNYNGKQQQR

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]1.6e-1841.4Show/hide
Query:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI
        VAKE T + L+K+L D YEKPSAN K+ L  K F++ M++G  V  H+NE   I+N+L  + ++ +++V+A+ LL SLP+S E M+ AVSNS+G   LK 
Subjt:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI

Query:  SAICDTVLSEETQRKLGKMSASTSGAEN----GVESALVAQNKEKAKMNYNGKQQQR
          + D +L EE  R++     STS A N    G +     QN+ ++K + NGK Q +
Subjt:  SAICDTVLSEETQRKLGKMSASTSGAEN----GVESALVAQNKEKAKMNYNGKQQQR

KHN02029.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja]8.9e-1750Show/hide
Query:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI
        +  E T   L+K L D YEKPSA  K+ L  ++FN+ M +G SV  HINE   IL +LE + +K E++VKA+ LL+SLPDS     TAVS+S  EN+LK+
Subjt:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI

Query:  SAICDTVLSE
        S I D +LSE
Subjt:  SAICDTVLSE

VDD56318.1 unnamed protein product [Brassica oleracea]9.5e-1940.91Show/hide
Query:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI
        VAKE T + L+K+L D YEKPSAN K+ L  K F++ M++G  V  H+NE   I+N+L  + ++ E++V+A+ LL SLP+S E M+ AVSN +G   LK 
Subjt:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI

Query:  SAICDTVLSEETQRKLGKMSASTSGAENGVESALVAQNKEKAK-MNYNGKQQQR
        + + D +L+EE  R++    ASTS A N         N+   K  + NG+ Q +
Subjt:  SAICDTVLSEETQRKLGKMSASTSGAENGVESALVAQNKEKAK-MNYNGKQQQR

TrEMBL top hitse value%identityAlignment
A0A0D3AEM1 CCHC-type domain-containing protein1.6e-1946.09Show/hide
Query:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI
        V KE T + L+K+L D YEKPSAN K+ L  K F++ M++G  V  HINE   I+N+L  + ++ E++V+A+ LL SLP+S E+M+ AVSNS+G   LK 
Subjt:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI

Query:  SAICDTVLSEETQRKLGKMSASTSGAEN
        + + D +L+EE  R++    ASTS A N
Subjt:  SAICDTVLSEETQRKLGKMSASTSGAEN

A0A0D3BM55 Uncharacterized protein4.6e-1945.31Show/hide
Query:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI
        VAKE T + L+K+L D YEKPSAN K+ L  K F++ M++G  V  H+NE   I+N+L  + ++ +++V+A+ LL SLP+S E M+ AVSNS+G   LK 
Subjt:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI

Query:  SAICDTVLSEETQRKLGKMSASTSGAEN
        + + D +L+EE  R++    ASTS A N
Subjt:  SAICDTVLSEETQRKLGKMSASTSGAEN

A0A0D3CS45 Uncharacterized protein1.0e-1845.31Show/hide
Query:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI
        VAKE   + L+K+L D YEKPSAN K+ L  K F++ M++G  V  H+NE   I+N+L  + ++ E++V+A+ LL SLP+S E M+ AVSNS+G   LK 
Subjt:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI

Query:  SAICDTVLSEETQRKLGKMSASTSGAEN
        + + D +L+EE  R++    ASTS A N
Subjt:  SAICDTVLSEETQRKLGKMSASTSGAEN

A0A0D3DQC2 Abhydrolase_3 domain-containing protein6.0e-1940.26Show/hide
Query:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI
        VAKE TA+ L+K+L D YEKPSAN K+ L  K F++ M++G  V  H+NE   I+N+L  + ++ +++V+A+ LL SLP+S E M+ AVSNS+G   LK 
Subjt:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI

Query:  SAICDTVLSEETQRKLGKMSASTSGAENGVESALVAQNKEKAK-MNYNGKQQQR
        + + D  L+E+  R++    ASTS A N         N+   +  + NG+ Q +
Subjt:  SAICDTVLSEETQRKLGKMSASTSGAENGVESALVAQNKEKAK-MNYNGKQQQR

A0A3P6FTI2 Abhydrolase_3 domain-containing protein4.6e-1940.91Show/hide
Query:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI
        VAKE T + L+K+L D YEKPSAN K+ L  K F++ M++G  V  H+NE   I+N+L  + ++ E++V+A+ LL SLP+S E M+ AVSN +G   LK 
Subjt:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI

Query:  SAICDTVLSEETQRKLGKMSASTSGAENGVESALVAQNKEKAK-MNYNGKQQQR
        + + D +L+EE  R++    ASTS A N         N+   K  + NG+ Q +
Subjt:  SAICDTVLSEETQRKLGKMSASTSGAENGVESALVAQNKEKAK-MNYNGKQQQR

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.1e-0728.7Show/hide
Query:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI
        +  E TA+ +   L+  Y   +   K+ L  + + +HM +GT+   H+N    ++ +L  + VKIEE+ KA+ LL SLP S + + T + +  G+ ++++
Subjt:  VAKETTAKELLKILQDRYEKPSANKKILLWTKYFNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKI

Query:  SAICDTVLSEETQRK
          +   +L  E  RK
Subjt:  SAICDTVLSEETQRK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGCACCGCCGCCGTTGGTGGAAGGATTGTGGGTCTGCCCCAATTGGTCGTGCTGTGCGTGGGTGAGAAGTCTAAGGAGAGGGAGGAGGGGGTGTGCGTTAGGAT
TGAGAAGTGTCGAAGTGGGGATGAAAAACAAGGAAGCAGGACACTGCAGGCCACTTTGAGCCGTTTTTCAGATGGTTTCATGGCAGTTTCAAGGGAGAGGCTCGGGCATT
GGCAGAGTCTGGTGGCGAAAGAGACTACAGCAAAGGAATTGTTGAAGATCTTGCAAGACAGGTATGAAAAACCTTCTGCCAATAAAAAAATACTTCTATGGACAAAGTAT
TTTAATATCCACATGGATGATGGAACCTCGGTGAATTTCCATATTAATGAGATCATTGATATCTTGAACAAATTAGAAGGGATGAGTGTCAAGATTGAAGAGAAGGTGAA
AGCTATGAGGCTGTTGACGTCTTTGCCTGACAGTTCGGAGACGATGAAGACCGCAGTGTCAAATTCGCTAGGGGAAAATAGCTTGAAAATTTCAGCTATTTGTGATACCG
TCTTATCTGAAGAAACTCAAAGAAAATTAGGGAAAATGTCTGCGTCTACTTCGGGGGCAGAAAACGGGGTTGAATCAGCTTTGGTAGCTCAGAACAAAGAGAAGGCAAAG
ATGAATTACAATGGGAAGCAGCAGCAGAGATTAACAGGGATAGTGGGAGTTCTAGTGGGGAAGTGGAGTGTTATTATTGCCACAAGAAGGGACACGTTAAACATTTTTGC
AGGAAGTTCAAAGAATATTTTGAGAAGGGGAACTAACCTGCAAATGTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGCACCGCCGCCGTTGGTGGAAGGATTGTGGGTCTGCCCCAATTGGTCGTGCTGTGCGTGGGTGAGAAGTCTAAGGAGAGGGAGGAGGGGGTGTGCGTTAGGAT
TGAGAAGTGTCGAAGTGGGGATGAAAAACAAGGAAGCAGGACACTGCAGGCCACTTTGAGCCGTTTTTCAGATGGTTTCATGGCAGTTTCAAGGGAGAGGCTCGGGCATT
GGCAGAGTCTGGTGGCGAAAGAGACTACAGCAAAGGAATTGTTGAAGATCTTGCAAGACAGGTATGAAAAACCTTCTGCCAATAAAAAAATACTTCTATGGACAAAGTAT
TTTAATATCCACATGGATGATGGAACCTCGGTGAATTTCCATATTAATGAGATCATTGATATCTTGAACAAATTAGAAGGGATGAGTGTCAAGATTGAAGAGAAGGTGAA
AGCTATGAGGCTGTTGACGTCTTTGCCTGACAGTTCGGAGACGATGAAGACCGCAGTGTCAAATTCGCTAGGGGAAAATAGCTTGAAAATTTCAGCTATTTGTGATACCG
TCTTATCTGAAGAAACTCAAAGAAAATTAGGGAAAATGTCTGCGTCTACTTCGGGGGCAGAAAACGGGGTTGAATCAGCTTTGGTAGCTCAGAACAAAGAGAAGGCAAAG
ATGAATTACAATGGGAAGCAGCAGCAGAGATTAACAGGGATAGTGGGAGTTCTAGTGGGGAAGTGGAGTGTTATTATTGCCACAAGAAGGGACACGTTAAACATTTTTGC
AGGAAGTTCAAAGAATATTTTGAGAAGGGGAACTAACCTGCAAATGTTGTAA
Protein sequenceShow/hide protein sequence
MESTAAVGGRIVGLPQLVVLCVGEKSKEREEGVCVRIEKCRSGDEKQGSRTLQATLSRFSDGFMAVSRERLGHWQSLVAKETTAKELLKILQDRYEKPSANKKILLWTKY
FNIHMDDGTSVNFHINEIIDILNKLEGMSVKIEEKVKAMRLLTSLPDSSETMKTAVSNSLGENSLKISAICDTVLSEETQRKLGKMSASTSGAENGVESALVAQNKEKAK
MNYNGKQQQRLTGIVGVLVGKWSVIIATRRDTLNIFAGSSKNILRRGTNLQML