; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g17040 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g17040
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr4:12574018..12580086
RNA-Seq ExpressionMoc04g17040
SyntenyMoc04g17040
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-4040.88Show/hide
Query:  VMATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT-----------------------------------------------
        +++++ + +L A KLN  NY  WK+ +N +L+IDDLRFVL EDCPQ   +NAT                                               
Subjt:  VMATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT-----------------------------------------------

Query:  ---------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSK
                                                             SQV+FILESLP+SFL F +NAVMNK+ YT+TTLLNELQT++SLMK K
Subjt:  ---------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSK

Query:  GQEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT
        GQ+GEANVATS + F+RGS+SGTKS PS SG+K +KKKK  G+G+K + A A   K KAKAA KG CF CN +GHWKRNCPK L  +      KMT
Subjt:  GQEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT

KAA0032020.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-4041.69Show/hide
Query:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------
        M ++ + +L A KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ    NAT                                                
Subjt:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------

Query:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG
                                                            SQV+FILESLP+SFL F +NAVMNK+ YT+TTLLNELQT++SLMK KG
Subjt:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG

Query:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT
        Q+GEANVATS + F+RGS+SGTKS PS SGSK +KKKK  G+G+K + A A   K KAKAA KG CF CN +GHWKRNCPK L         KMT
Subjt:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]7.9e-4040.94Show/hide
Query:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------
        M ++ + +L A KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ   +NAT                                                
Subjt:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------

Query:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG
                                                            SQV+FILESLP+SFL F +NAVMNK+ YT+TTLLNELQT++SLMK KG
Subjt:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG

Query:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMTLKV
        Q+GEANVATS + F+RGS+SGTKS PS SG+K +KKKK  G+G+K + A A   K KAKAA KG CF CN +GHWKRNCPK L  +      K  L V
Subjt:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMTLKV

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-4041.36Show/hide
Query:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------
        M ++ + +L A KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ   +NAT                                                
Subjt:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------

Query:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG
                                                            SQV+FILESLP+SFL F +NAVMNK+ YT+TTLLNELQT++SLMK KG
Subjt:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG

Query:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT
        Q+GEANVATS + F+RGS+SGTKS PS SG+K +KKKK  G+G+K + A A   K KAKAA KG CF CN +GHWKRNCPK L  +      KMT
Subjt:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT

KAA0055498.1 gag-pol fusion protein [Cucumis melo var. makuwa]2.0e-4346.67Show/hide
Query:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------
        M ++ + +L A KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ   +NAT                                                
Subjt:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------

Query:  ----------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKGQEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAA
                    SQV+FILESLP+SFL F +NAVMNK+ YT+TTLLNELQT++SLMK KGQ+GE NVATS + F+R S+SGTKS PS SG+K +KKK   
Subjt:  ----------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKGQEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAA

Query:  GKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT
        G+G+K + A A   K KAKAA KG CF CN +GHWKRNCPK L  +  +   KMT
Subjt:  GKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.8e-4040.94Show/hide
Query:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------
        M ++ + +L A KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ   +NAT                                                
Subjt:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------

Query:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG
                                                            SQV+FILESLP+SFL F +NAVMNK+ YT+TTLLNELQT++SLMK KG
Subjt:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG

Query:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMTLKV
        Q+GEANVATS + F+RGS+SGTKS PS SG+K +KKKK  G+G+K + A A   K KAKAA KG CF CN +GHWKRNCPK L  +      K  L V
Subjt:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMTLKV

A0A5A7SMN4 Gag/pol protein1.0e-4041.69Show/hide
Query:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------
        M ++ + +L A KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ    NAT                                                
Subjt:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------

Query:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG
                                                            SQV+FILESLP+SFL F +NAVMNK+ YT+TTLLNELQT++SLMK KG
Subjt:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG

Query:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT
        Q+GEANVATS + F+RGS+SGTKS PS SGSK +KKKK  G+G+K + A A   K KAKAA KG CF CN +GHWKRNCPK L         KMT
Subjt:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT

A0A5A7SNP8 Gag/pol protein1.7e-4040.88Show/hide
Query:  VMATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT-----------------------------------------------
        +++++ + +L A KLN  NY  WK+ +N +L+IDDLRFVL EDCPQ   +NAT                                               
Subjt:  VMATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT-----------------------------------------------

Query:  ---------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSK
                                                             SQV+FILESLP+SFL F +NAVMNK+ YT+TTLLNELQT++SLMK K
Subjt:  ---------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSK

Query:  GQEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT
        GQ+GEANVATS + F+RGS+SGTKS PS SG+K +KKKK  G+G+K + A A   K KAKAA KG CF CN +GHWKRNCPK L  +      KMT
Subjt:  GQEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT

A0A5A7U869 Gag/pol protein7.7e-4141.36Show/hide
Query:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------
        M ++ + +L A KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ   +NAT                                                
Subjt:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------

Query:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG
                                                            SQV+FILESLP+SFL F +NAVMNK+ YT+TTLLNELQT++SLMK KG
Subjt:  --------------------------------------------------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKG

Query:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT
        Q+GEANVATS + F+RGS+SGTKS PS SG+K +KKKK  G+G+K + A A   K KAKAA KG CF CN +GHWKRNCPK L  +      KMT
Subjt:  QEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT

A0A5A7UKD1 Gag-pol fusion protein9.7e-4446.67Show/hide
Query:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------
        M ++ + +L A KLN  NY  WK+ +NT+L+IDDLRFVL E+CPQ   +NAT                                                
Subjt:  MATSIIALLVAQKLNSENYKQWKSNLNTILVIDDLRFVLQEDCPQAHMSNAT------------------------------------------------

Query:  ----------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKGQEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAA
                    SQV+FILESLP+SFL F +NAVMNK+ YT+TTLLNELQT++SLMK KGQ+GE NVATS + F+R S+SGTKS PS SG+K +KKK   
Subjt:  ----------VLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKGQEGEANVATS-KWFNRGSSSGTKSAPSFSGSKTFKKKKAA

Query:  GKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT
        G+G+K + A A   K KAKAA KG CF CN +GHWKRNCPK L  +  +   KMT
Subjt:  GKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAAAAATACCCGAGACAATGCCTCGACACCTAAGGGCAGCGCCACGGCACTACTGCAGCAAAAAGCTGCAGCAACAATGCCGCGGCGCTACAATGCAACGTCGTG
GCACTGTGTGCTGTCCCTGCAGCGCCATGATGTTATCTGCGTGGCCTGCTTCTTCTGCAAAAATAGGACAGGGAATAACAAAGAAGGTGAACTCCCAACATCTCTTTCAA
GCAATCTCTTTCAATTTCCCTTGTCGTTCCAAAGAAACGCTCCCACAAGCACGATTTTGGTACCAGAGGATAGCTCGGAAGATCGTTTGGTGGTGTTAGGGAATTCTTTT
GAAGAATTGTTCAACAAAGTCTGTAAGTCTGGGTCGCTCGAAAGCTGTTTGTGGCGCCGGATTGGGCGAAAAGTTGCAGAAAACAGCGAAGAAGACGAAGCAGACCGTGC
AGACAGCGCCATGGCACTGCAAGGACAGCACACAATGCCACAGCGCTACATTGTAGTGTCGTGGCGCTGTGCAGCACCATGGCGCCATGCTATAGCGCCGCGACGCTGCC
CTCAGGCGCCAAGGCGCTGTCCCGTCATGGCTACTTCTATTATTGCACTCCTAGTCGCGCAAAAACTTAACAGCGAGAATTACAAACAATGGAAATCGAATCTAAACACT
ATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCATATGTCTAACGCCACTGTGCTGAGTCAGGTCAACTTCATTCTGGAATCTCTTCC
GAAGAGTTTCTTGCCATTCCACAACAATGCGGTTATGAATAAGCTGGAGTACACTGTTACCACGCTCTTAAACGAGTTGCAGACCTACCAGTCTCTCATGAAAAGTAAGG
GACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGTGGTTCAACCGAGGTTCGTCCTCTGGAACCAAGTCTGCACCCTCTTTTTCTGGAAGTAAGACTTTCAAGAAGAAG
AAGGCTGCTGGTAAGGGGTCTAAACCTGACTCCGCTGTTGCCGCTGCCAAGAAAGGCAAGGCCAAGGCTGCAGACAAAGGAAAATGTTTCCCTTGCAACCTGGACGGGCA
TTGGAAGCGCAATTGCCCGAAGGAATTAGTTCCTAGAGGCAGCTTGACATCCGAAAAGATGACTCTCAAGGTCGGAACGAGAGAGGTCGTCTCAACTGCGGCATCGCCAA
CCACCTTCTCTTCCTCTCTTCCTCCGGCGACCGGCGGTACCAGCGACAACTCGACCACACGCGGCCGCAGGCTCGACGACGGGTCTTCTCGTTTCGGTTGCTTCGAGCAG
ACCTGTGAACGCCGTGTTGCGGCAGCCCTACGACGATCCTCGGTGACGTTCGTCTATCCCCGACAATCCCATAGCAGCGGCGGCCGGCGACCCACGACGTTCCAGCGGCT
GCGAATCCAAGTGGAGAGTATATGGACTAGAGATCGGGTGGAGAGTATATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAAAAATACCCGAGACAATGCCTCGACACCTAAGGGCAGCGCCACGGCACTACTGCAGCAAAAAGCTGCAGCAACAATGCCGCGGCGCTACAATGCAACGTCGTG
GCACTGTGTGCTGTCCCTGCAGCGCCATGATGTTATCTGCGTGGCCTGCTTCTTCTGCAAAAATAGGACAGGGAATAACAAAGAAGGTGAACTCCCAACATCTCTTTCAA
GCAATCTCTTTCAATTTCCCTTGTCGTTCCAAAGAAACGCTCCCACAAGCACGATTTTGGTACCAGAGGATAGCTCGGAAGATCGTTTGGTGGTGTTAGGGAATTCTTTT
GAAGAATTGTTCAACAAAGTCTGTAAGTCTGGGTCGCTCGAAAGCTGTTTGTGGCGCCGGATTGGGCGAAAAGTTGCAGAAAACAGCGAAGAAGACGAAGCAGACCGTGC
AGACAGCGCCATGGCACTGCAAGGACAGCACACAATGCCACAGCGCTACATTGTAGTGTCGTGGCGCTGTGCAGCACCATGGCGCCATGCTATAGCGCCGCGACGCTGCC
CTCAGGCGCCAAGGCGCTGTCCCGTCATGGCTACTTCTATTATTGCACTCCTAGTCGCGCAAAAACTTAACAGCGAGAATTACAAACAATGGAAATCGAATCTAAACACT
ATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCATATGTCTAACGCCACTGTGCTGAGTCAGGTCAACTTCATTCTGGAATCTCTTCC
GAAGAGTTTCTTGCCATTCCACAACAATGCGGTTATGAATAAGCTGGAGTACACTGTTACCACGCTCTTAAACGAGTTGCAGACCTACCAGTCTCTCATGAAAAGTAAGG
GACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGTGGTTCAACCGAGGTTCGTCCTCTGGAACCAAGTCTGCACCCTCTTTTTCTGGAAGTAAGACTTTCAAGAAGAAG
AAGGCTGCTGGTAAGGGGTCTAAACCTGACTCCGCTGTTGCCGCTGCCAAGAAAGGCAAGGCCAAGGCTGCAGACAAAGGAAAATGTTTCCCTTGCAACCTGGACGGGCA
TTGGAAGCGCAATTGCCCGAAGGAATTAGTTCCTAGAGGCAGCTTGACATCCGAAAAGATGACTCTCAAGGTCGGAACGAGAGAGGTCGTCTCAACTGCGGCATCGCCAA
CCACCTTCTCTTCCTCTCTTCCTCCGGCGACCGGCGGTACCAGCGACAACTCGACCACACGCGGCCGCAGGCTCGACGACGGGTCTTCTCGTTTCGGTTGCTTCGAGCAG
ACCTGTGAACGCCGTGTTGCGGCAGCCCTACGACGATCCTCGGTGACGTTCGTCTATCCCCGACAATCCCATAGCAGCGGCGGCCGGCGACCCACGACGTTCCAGCGGCT
GCGAATCCAAGTGGAGAGTATATGGACTAGAGATCGGGTGGAGAGTATATGA
Protein sequenceShow/hide protein sequence
MSKNTRDNASTPKGSATALLQQKAAATMPRRYNATSWHCVLSLQRHDVICVACFFCKNRTGNNKEGELPTSLSSNLFQFPLSFQRNAPTSTILVPEDSSEDRLVVLGNSF
EELFNKVCKSGSLESCLWRRIGRKVAENSEEDEADRADSAMALQGQHTMPQRYIVVSWRCAAPWRHAIAPRRCPQAPRRCPVMATSIIALLVAQKLNSENYKQWKSNLNT
ILVIDDLRFVLQEDCPQAHMSNATVLSQVNFILESLPKSFLPFHNNAVMNKLEYTVTTLLNELQTYQSLMKSKGQEGEANVATSKWFNRGSSSGTKSAPSFSGSKTFKKK
KAAGKGSKPDSAVAAAKKGKAKAADKGKCFPCNLDGHWKRNCPKELVPRGSLTSEKMTLKVGTREVVSTAASPTTFSSSLPPATGGTSDNSTTRGRRLDDGSSRFGCFEQ
TCERRVAAALRRSSVTFVYPRQSHSSGGRRPTTFQRLRIQVESIWTRDRVESI