; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g22880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g22880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr4:16621350..16628163
RNA-Seq ExpressionMoc04g22880
SyntenyMoc04g22880
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW36054.1 Retrovirus-related Pol polyprotein from transposon RE2 [Vitis vinifera]3.0e-1434Show/hide
Query:  DEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKK--------ERNFGDLLVSDK------SKDIGSS
        +E +EK  E   F    +A++   + SK+ KW+LD+GCSRHMTGD+SKF  L+K  GG+VTFGDN K        E + G L + DK       +D    
Subjt:  DEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKK--------ERNFGDLLVSDK------SKDIGSS

Query:  KQEVSI-DENKVDGFSS--MLKEWKYAPSHPKDLILGDSEQDDTYTNTEEELEGRPPKRRRLHWTHQQILT--IFKANEKLIR--VKRKSGEEERLSEWP
        K  +++    +V G SS  + K+WK+  +HP+D I+G+     + +     L     + + L WT  Q L    F    K I+  +KR + EE ++ + P
Subjt:  KQEVSI-DENKVDGFSS--MLKEWKYAPSHPKDLILGDSEQDDTYTNTEEELEGRPPKRRRLHWTHQQILT--IFKANEKLIR--VKRKSGEEERLSEWP

XP_022155998.1 uncharacterized protein LOC111022973 [Momordica charantia]2.9e-1759.77Show/hide
Query:  MAHGDKEDEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS
        MAH DKEDE D++               VCLKASKKSKWYLD+GCSRHMTGDQSKFVT SK DG FVTF DNKK +  G   + ++S
Subjt:  MAHGDKEDEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS

XP_022156978.1 uncharacterized protein LOC111023806 [Momordica charantia]9.5e-1671.43Show/hide
Query:  IALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS
        +A  VCLKASKKSKWYLD+ CSRHMTGDQSKFVT SK DGGFVTFGDNKK +  G   + ++S
Subjt:  IALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS

XP_022158792.1 uncharacterized protein LOC111025259 [Momordica charantia]6.3e-4489.32Show/hide
Query:  VCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKSKDIGSSKQEVSIDENKVDGFSSMLKEWKYAPSHPKDLILGD
        VCLKASKK KWYLD+GCSR+MTGDQSKFVT SK DGGFVTFGD+KKERNFGDLLVSDKSK+I SSKQEVSI+ENKVDGFSSM KEWKYAPSHPKDLILGD
Subjt:  VCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKSKDIGSSKQEVSIDENKVDGFSSMLKEWKYAPSHPKDLILGD

Query:  SEQ
         EQ
Subjt:  SEQ

XP_022950378.1 uncharacterized protein LOC111453493 [Cucurbita moschata]9.5e-1656.32Show/hide
Query:  MAHGDKEDEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS
        MAH DKEDE +++               VCLKASKK+KWYLD+GCSRHMTG+ SKFV LSK DGG VTFGDNKK +  G   + + S
Subjt:  MAHGDKEDEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS

TrEMBL top hitse value%identityAlignment
A0A438DKP4 Retrovirus-related Pol polyprotein from transposon RE21.5e-1434Show/hide
Query:  DEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKK--------ERNFGDLLVSDK------SKDIGSS
        +E +EK  E   F    +A++   + SK+ KW+LD+GCSRHMTGD+SKF  L+K  GG+VTFGDN K        E + G L + DK       +D    
Subjt:  DEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKK--------ERNFGDLLVSDK------SKDIGSS

Query:  KQEVSI-DENKVDGFSS--MLKEWKYAPSHPKDLILGDSEQDDTYTNTEEELEGRPPKRRRLHWTHQQILT--IFKANEKLIR--VKRKSGEEERLSEWP
        K  +++    +V G SS  + K+WK+  +HP+D I+G+     + +     L     + + L WT  Q L    F    K I+  +KR + EE ++ + P
Subjt:  KQEVSI-DENKVDGFSS--MLKEWKYAPSHPKDLILGDSEQDDTYTNTEEELEGRPPKRRRLHWTHQQILT--IFKANEKLIR--VKRKSGEEERLSEWP

A0A6J1DPE4 uncharacterized protein LOC1110229731.4e-1759.77Show/hide
Query:  MAHGDKEDEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS
        MAH DKEDE D++               VCLKASKKSKWYLD+GCSRHMTGDQSKFVT SK DG FVTF DNKK +  G   + ++S
Subjt:  MAHGDKEDEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS

A0A6J1DS74 uncharacterized protein LOC1110238064.6e-1671.43Show/hide
Query:  IALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS
        +A  VCLKASKKSKWYLD+ CSRHMTGDQSKFVT SK DGGFVTFGDNKK +  G   + ++S
Subjt:  IALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS

A0A6J1DY46 uncharacterized protein LOC1110252593.1e-4489.32Show/hide
Query:  VCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKSKDIGSSKQEVSIDENKVDGFSSMLKEWKYAPSHPKDLILGD
        VCLKASKK KWYLD+GCSR+MTGDQSKFVT SK DGGFVTFGD+KKERNFGDLLVSDKSK+I SSKQEVSI+ENKVDGFSSM KEWKYAPSHPKDLILGD
Subjt:  VCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKSKDIGSSKQEVSIDENKVDGFSSMLKEWKYAPSHPKDLILGD

Query:  SEQ
         EQ
Subjt:  SEQ

A0A6J1GER0 uncharacterized protein LOC1114534934.6e-1656.32Show/hide
Query:  MAHGDKEDEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS
        MAH DKEDE +++               VCLKASKK+KWYLD+GCSRHMTG+ SKFV LSK DGG VTFGDNKK +  G   + + S
Subjt:  MAHGDKEDEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCATGGTGACAAGGAGGATGAACATGATGAGAAAAATTTCGAGCTAGGCACCTTCCCCAATCCCGAGATTGCCCTTAATGTTTGTTTGAAAGCTTCCAAG
AAAAGTAAGTGGTACTTGGATAATGGTTGCTCGAGGCACATGACGGGAGACCAATCCAAGTTTGTTACTCTCTCCAAAAATGATGGAGGTTTTGTAACATTTGGT
GACAACAAGAAAGAAAGAAATTTTGGAGATTTACTTGTTAGTGACAAAAGCAAAGACATTGGTTCAAGTAAGCAAGAAGTGAGCATCGACGAAAATAAGGTGGAC
GGTTTTTCATCCATGCTAAAGGAGTGGAAGTATGCTCCATCCCATCCTAAGGATTTAATCCTTGGTGATTCCGAACAAGATGACACCTACACCAACACGGAAGAA
GAGCTTGAGGGAAGACCCCCCAAACGAAGGAGACTTCATTGGACACATCAACAAATTTTGACTATTTTCAAAGCCAATGAAAAACTCATTAGGGTCAAGAGGAAG
AGTGGAGAAGAAGAGAGGTTAAGCGAATGGCCCCATTTCAAGGACAAGCCTATCGAGAGACCACTCACACGCAAGCACCGCCAGACCCACGCCTCTGCGCATGCG
CACCCCTGCCATGCACGCCAACATGCCGCTAACCTCACAGCACTGATGCCATTCACGCGCTGTCATCCTCCCACGCCTGTGCCAGCCGTCTCCGCCCGCGCGCTA
ACGCCCCTGCACGCGCAAGCGTCCGCGCGCCATGCCGTTACCAACACCCATACCTGTGCGCATCGCACGCTCATCAGCACCCCTGCTGCCCATACCGCTCGCCCA
GCGCCCGCACACATCCGCAACCATCCTACCGCTCAGCGCCCACGCCGCCCGCACCATGCCACTCACCAAGCACCAGCGCGCCCTCATGCCAGCGCCCTTATCGCC
CGCCAGCATGCCCTGCCCCAACGTCCACGCGCGCGTCCCCGTGCCAGCGCGTGCCCCAACGCCCAGACATCCGCCAGTGCGCCCCTGGCCCAGCGCTCATGCGCG
TGCGTCCATCGTGCCAATGAAGAACCCACTATGGTCAAAAGAAAGCGTGGAGAAGAAGAGCGGTCAAGCAAGTGGCTCCATTTCAAGGACAAGCCTATTGAAAGG
CCACTTGCACGCAAGCACCGCCAGTCGCACGCCCTACGCGTGCGCACCCCTGCCGCTAACCTCACAACGCCCATGCCTGCGCACGCTGCCGCCCGCCCACTCCTG
TGCCAGACGCCTCTGCCCGCGTGCCAACACTCCCTGCACGCCCAAGCACCCGAGCATCCTGTCACCACTCACGCGCCCTACCGCCACTCGCGCGCCTATGCCGCC
ACCAGCGCCCATGCCCGCGCACGCCACACGTTCAAAACGTCCCTACCGCACATCACCAACGCCTCTGCCGCCCAGCGCCAGCGCCCTGCCGCGCATCGCCAGCAC
CCCTACCGCGCATCGTTAGCACCTGTCCCAGTGCCCATGCGCACGCCAAGAGCGCCTGCCCCAGCGCCCAACGCCCATACCGCACGCCAAGCGCGCCTGCCCCAA
CGCCCATACCGCATGCCAAGCGCGCCTGCCCCAACGCCCAGAGCCCATGCTGCACGCCCAGTGCGCCTGCCCCAGCGCCCAGCGACCATGCCGCACGCCCAGCGT
GCCCCACTCGACGTCTATGAGCGCTCGCCACCCTTGCCAGCCATGCCTTGCCGCCCACTCCTCCCAAATAAGGAATGTCTCTCCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCATGGTGACAAGGAGGATGAACATGATGAGAAAAATTTCGAGCTAGGCACCTTCCCCAATCCCGAGATTGCCCTTAATGTTTGTTTGAAAGCTTCCAAG
AAAAGTAAGTGGTACTTGGATAATGGTTGCTCGAGGCACATGACGGGAGACCAATCCAAGTTTGTTACTCTCTCCAAAAATGATGGAGGTTTTGTAACATTTGGT
GACAACAAGAAAGAAAGAAATTTTGGAGATTTACTTGTTAGTGACAAAAGCAAAGACATTGGTTCAAGTAAGCAAGAAGTGAGCATCGACGAAAATAAGGTGGAC
GGTTTTTCATCCATGCTAAAGGAGTGGAAGTATGCTCCATCCCATCCTAAGGATTTAATCCTTGGTGATTCCGAACAAGATGACACCTACACCAACACGGAAGAA
GAGCTTGAGGGAAGACCCCCCAAACGAAGGAGACTTCATTGGACACATCAACAAATTTTGACTATTTTCAAAGCCAATGAAAAACTCATTAGGGTCAAGAGGAAG
AGTGGAGAAGAAGAGAGGTTAAGCGAATGGCCCCATTTCAAGGACAAGCCTATCGAGAGACCACTCACACGCAAGCACCGCCAGACCCACGCCTCTGCGCATGCG
CACCCCTGCCATGCACGCCAACATGCCGCTAACCTCACAGCACTGATGCCATTCACGCGCTGTCATCCTCCCACGCCTGTGCCAGCCGTCTCCGCCCGCGCGCTA
ACGCCCCTGCACGCGCAAGCGTCCGCGCGCCATGCCGTTACCAACACCCATACCTGTGCGCATCGCACGCTCATCAGCACCCCTGCTGCCCATACCGCTCGCCCA
GCGCCCGCACACATCCGCAACCATCCTACCGCTCAGCGCCCACGCCGCCCGCACCATGCCACTCACCAAGCACCAGCGCGCCCTCATGCCAGCGCCCTTATCGCC
CGCCAGCATGCCCTGCCCCAACGTCCACGCGCGCGTCCCCGTGCCAGCGCGTGCCCCAACGCCCAGACATCCGCCAGTGCGCCCCTGGCCCAGCGCTCATGCGCG
TGCGTCCATCGTGCCAATGAAGAACCCACTATGGTCAAAAGAAAGCGTGGAGAAGAAGAGCGGTCAAGCAAGTGGCTCCATTTCAAGGACAAGCCTATTGAAAGG
CCACTTGCACGCAAGCACCGCCAGTCGCACGCCCTACGCGTGCGCACCCCTGCCGCTAACCTCACAACGCCCATGCCTGCGCACGCTGCCGCCCGCCCACTCCTG
TGCCAGACGCCTCTGCCCGCGTGCCAACACTCCCTGCACGCCCAAGCACCCGAGCATCCTGTCACCACTCACGCGCCCTACCGCCACTCGCGCGCCTATGCCGCC
ACCAGCGCCCATGCCCGCGCACGCCACACGTTCAAAACGTCCCTACCGCACATCACCAACGCCTCTGCCGCCCAGCGCCAGCGCCCTGCCGCGCATCGCCAGCAC
CCCTACCGCGCATCGTTAGCACCTGTCCCAGTGCCCATGCGCACGCCAAGAGCGCCTGCCCCAGCGCCCAACGCCCATACCGCACGCCAAGCGCGCCTGCCCCAA
CGCCCATACCGCATGCCAAGCGCGCCTGCCCCAACGCCCAGAGCCCATGCTGCACGCCCAGTGCGCCTGCCCCAGCGCCCAGCGACCATGCCGCACGCCCAGCGT
GCCCCACTCGACGTCTATGAGCGCTCGCCACCCTTGCCAGCCATGCCTTGCCGCCCACTCCTCCCAAATAAGGAATGTCTCTCCTTATGA
Protein sequenceShow/hide protein sequence
MAHGDKEDEHDEKNFELGTFPNPEIALNVCLKASKKSKWYLDNGCSRHMTGDQSKFVTLSKNDGGFVTFGDNKKERNFGDLLVSDKSKDIGSSKQEVSIDENKVD
GFSSMLKEWKYAPSHPKDLILGDSEQDDTYTNTEEELEGRPPKRRRLHWTHQQILTIFKANEKLIRVKRKSGEEERLSEWPHFKDKPIERPLTRKHRQTHASAHA
HPCHARQHAANLTALMPFTRCHPPTPVPAVSARALTPLHAQASARHAVTNTHTCAHRTLISTPAAHTARPAPAHIRNHPTAQRPRRPHHATHQAPARPHASALIA
RQHALPQRPRARPRASACPNAQTSASAPLAQRSCACVHRANEEPTMVKRKRGEEERSSKWLHFKDKPIERPLARKHRQSHALRVRTPAANLTTPMPAHAAARPLL
CQTPLPACQHSLHAQAPEHPVTTHAPYRHSRAYAATSAHARARHTFKTSLPHITNASAAQRQRPAAHRQHPYRASLAPVPVPMRTPRAPAPAPNAHTARQARLPQ
RPYRMPSAPAPTPRAHAARPVRLPQRPATMPHAQRAPLDVYERSPPLPAMPCRPLLPNKECLSL