; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g02080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g02080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr1:1394343..1399321
RNA-Seq ExpressionMoc01g02080
SyntenyMoc01g02080
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]3.2e-4046.49Show/hide
Query:  LSEGSSVREHVLNMMVHFNVAEVN--------------------------NAVMNKI-------------------------------SQKKFHKGSSSG
        + EG+SVREHVL+MM+HFN+AEVN                          NA +NKI                               +++KF +GSSS 
Subjt:  LSEGSSVREHVLNMMVHFNVAEVN--------------------------NAVMNKI-------------------------------SQKKFHKGSSSG

Query:  SKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSI
        +K GPS      Q KKK  GKGKA   +K K+   KGKCFHCN++GHWKRNCPKYL EK+AEK  QGK+DLLV+ETC+VE D +T ILD GA NH+C S 
Subjt:  SKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSI

Query:  KETCSWRQLEEDEVTIWVGSGELISAKA
        +ET SW++L+E E+T+ VG+GE++SA+A
Subjt:  KETCSWRQLEEDEVTIWVGSGELISAKA

KAA0041922.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-4150Show/hide
Query:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ---------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAK
        + EG+SVREHVL+MM+HFN+AEVN   +++ +Q                     +KF +GSSS  K GPS   + I+KK    GK K S   K K+   K
Subjt:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ---------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAK

Query:  GKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA
        GKC+HC E+GHW RNC KY+ +K+AEKE QGKFDLLV+ETC+VEN+++T ILD GAINH+C S +E  SW++L E ++T+ VG+ E+ SAKA
Subjt:  GKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]7.2e-4046.53Show/hide
Query:  LYNSKMAGLHLQDLLSEGSSVREHVLNMMVHFNVAEVN--------------------------NAVMNKI-----------------------------
        +YN++M         +EG+SVREHVLNMMVHFNVAE+N                          NAVMNKI                             
Subjt:  LYNSKMAGLHLQDLLSEGSSVREHVLNMMVHFNVAEVN--------------------------NAVMNKI-----------------------------

Query:  --SQKKFHKGSSSGSKSGPSSQKKGIQKKKK---DNGKGKASADAKDKEKIAKGKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDD
          S +KFH+GS+SG+KS PSS      KKKK    N    A+A    K K AKG CFHCN+ GHWKRNCPKYL EK+  K KQGK+DLLV+ETC+VENDD
Subjt:  --SQKKFHKGSSSGSKSGPSSQKKGIQKKKK---DNGKGKASADAKDKEKIAKGKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDD

Query:  TTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA
        +  I+D GA NHVCSS +   SWRQLE  E+T+ VG+G ++SA A
Subjt:  TTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA

KAA0056663.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-4049.21Show/hide
Query:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKC
        + EG+SVREHVL+MM+H N+AEVN  V+++ +Q                  +KF +GSSS +K GPS  K  I+KK+    KG+     K K+   KGKC
Subjt:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKC

Query:  FHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA
        +HC++ GHW RNCPKYL +K+AEKE QGK+DLLV+ETC+VE +++T ILDLGA  H+C S +E  SW++L E E+T+ VG+ E++SA+A
Subjt:  FHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA

TYK04622.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-4049.21Show/hide
Query:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKC
        + EG+SVREHVL+MM+H N+AEVN  V+++ +Q                  +KF +GSSS +K GPS  K  I+KK+    KG+     K K+   KGKC
Subjt:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKC

Query:  FHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA
        +HC++ GHW RNCPKYL +K+AEKE QGK+DLLV+ETC+VE +++T ILDLGA  H+C S +E  SW++L E E+T+ VG+ E++SA+A
Subjt:  FHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.5e-4046.53Show/hide
Query:  LYNSKMAGLHLQDLLSEGSSVREHVLNMMVHFNVAEVN--------------------------NAVMNKI-----------------------------
        +YN++M         +EG+SVREHVLNMMVHFNVAE+N                          NAVMNKI                             
Subjt:  LYNSKMAGLHLQDLLSEGSSVREHVLNMMVHFNVAEVN--------------------------NAVMNKI-----------------------------

Query:  --SQKKFHKGSSSGSKSGPSSQKKGIQKKKK---DNGKGKASADAKDKEKIAKGKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDD
          S +KFH+GS+SG+KS PSS      KKKK    N    A+A    K K AKG CFHCN+ GHWKRNCPKYL EK+  K KQGK+DLLV+ETC+VENDD
Subjt:  --SQKKFHKGSSSGSKSGPSSQKKGIQKKKK---DNGKGKASADAKDKEKIAKGKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDD

Query:  TTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA
        +  I+D GA NHVCSS +   SWRQLE  E+T+ VG+G ++SA A
Subjt:  TTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA

A0A5A7UL81 Gag/pol protein5.3e-4149.21Show/hide
Query:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKC
        + EG+SVREHVL+MM+H N+AEVN  V+++ +Q                  +KF +GSSS +K GPS  K  I+KK+    KG+     K K+   KGKC
Subjt:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKC

Query:  FHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA
        +HC++ GHW RNCPKYL +K+AEKE QGK+DLLV+ETC+VE +++T ILDLGA  H+C S +E  SW++L E E+T+ VG+ E++SA+A
Subjt:  FHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA

A0A5D3C0C2 Gag/pol protein5.3e-4149.21Show/hide
Query:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKC
        + EG+SVREHVL+MM+H N+AEVN  V+++ +Q                  +KF +GSSS +K GPS  K  I+KK+    KG+     K K+   KGKC
Subjt:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKC

Query:  FHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA
        +HC++ GHW RNCPKYL +K+AEKE QGK+DLLV+ETC+VE +++T ILDLGA  H+C S +E  SW++L E E+T+ VG+ E++SA+A
Subjt:  FHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA

A0A5D3D345 Gag/pol protein1.8e-4150Show/hide
Query:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ---------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAK
        + EG+SVREHVL+MM+HFN+AEVN   +++ +Q                     +KF +GSSS  K GPS   + I+KK    GK K S   K K+   K
Subjt:  LSEGSSVREHVLNMMVHFNVAEVNNAVMNKISQ---------------------KKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAK

Query:  GKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA
        GKC+HC E+GHW RNC KY+ +K+AEKE QGKFDLLV+ETC+VEN+++T ILD GAINH+C S +E  SW++L E ++T+ VG+ E+ SAKA
Subjt:  GKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKA

E2GK51 Gag/pol protein (Fragment)1.6e-4046.49Show/hide
Query:  LSEGSSVREHVLNMMVHFNVAEVN--------------------------NAVMNKI-------------------------------SQKKFHKGSSSG
        + EG+SVREHVL+MM+HFN+AEVN                          NA +NKI                               +++KF +GSSS 
Subjt:  LSEGSSVREHVLNMMVHFNVAEVN--------------------------NAVMNKI-------------------------------SQKKFHKGSSSG

Query:  SKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSI
        +K GPS      Q KKK  GKGKA   +K K+   KGKCFHCN++GHWKRNCPKYL EK+AEK  QGK+DLLV+ETC+VE D +T ILD GA NH+C S 
Subjt:  SKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLGAINHVCSSI

Query:  KETCSWRQLEEDEVTIWVGSGELISAKA
        +ET SW++L+E E+T+ VG+GE++SA+A
Subjt:  KETCSWRQLEEDEVTIWVGSGELISAKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTCTTCCATTAGGTCACACCGTGAGATTCTTATATTGCACCTGCTTGGCGTTTATAAGCGCCTATCCCTTTGGAGGGTTGTAACAGAGGATTCGGAACAC
CGTGAACTTCAGAAATGGATAGGGTTACTTAGGTCCGTTTTCAGCAATTGTCTCTCCTTCGAGGGCTTGTTTGATTCGGATCTCTATAATTCGAAAATGGCGGGT
CTACACTTACAAGACTTGCTAAGTGAAGGATCTTCCGTAAGGGAGCATGTTTTGAACATGATGGTTCACTTCAACGTTGCAGAAGTGAACAATGCAGTCATGAAC
AAAATAAGTCAGAAGAAATTCCACAAGGGATCTTCCTCTGGGAGTAAATCTGGACCTTCTTCTCAGAAGAAAGGAATTCAGAAGAAGAAGAAGGACAATGGGAAG
GGGAAGGCTTCGGCTGATGCAAAAGACAAGGAAAAGATTGCAAAAGGAAAATGTTTCCATTGCAATGAAAATGGGCACTGGAAAAGAAATTGCCCCAAATACCTC
ACCGAGAAAAGAGCTGAGAAGGAAAAGCAAGGTAAATTCGATTTACTAGTCATTGAAACATGTATAGTGGAGAATGATGATACTACCTTGATACTAGATTTAGGA
GCCATTAATCATGTTTGTTCTTCTATTAAGGAAACTTGTTCCTGGAGACAACTTGAAGAAGATGAGGTCACTATCTGGGTTGGATCAGGGGAGCTCATCTCAGCA
AAAGCAGGGAAGGCATGGTGTCAGCTCTCCACCACCCCTGGGCCCTGTGGCCTGAGTGAGCTACCCCATGGCACTTGCGTGTGTCATGGCGTGGGCGAGCGTTGC
AAGACCAACTTGGGTGTCTTCATTCGAGAATGGGAATCTCAAGGAGTGTGTCGGACGGCATGGGTGTGCCGGCATTGCACCTCGTCCCATGGGTTGGAGGCTGAA
CATAGGGGTCGTTATGCGGTACGGTATGCTCGAACGACACGGCACGGGCACAAGAGTCGTCGTGCCTCGATGGTGTCCGGATGGCTGGATGTCCGAAAGGCCTCG
ATCATAAATACAAGTCCATGGGTTGGTTCTCGATATGCGTGCCAAAGTGAGCAGTTCAATGACATCCTGATGCTCGGGGTGGCATCCTACAGTACCGGTGGGCAC
TCGGTAAGAATCATCCTGGCAATACCTGGTAAGGCGTTTAGCCCGTTATTGACTAGTTTCTCGGCAGGAATAGTCTTAGAGGCGTCCGATGGACTCGATAAGCTA
GTTTTTACATGGTTAAGCGGTAGGAATCTCCCTGTTGTACCCGATTGTCGTTTTAAGCTCGTTCTGACCCCGTTATGGGTTAGAACTTATCCCGGTAAGGTCCTG
GCTCATCATAGGAGCGGAACGACCGAGTGTCTGAGAACCAAGATGATTGGAAATGAGTTGCGACGTCAGAATTCCACTGCAAGTGAAGCTCGATCGAGGAAGAAA
GAACCTCGCCTAAGGTCTGACGAAGCCACAAAGGGCACGAAAAGTGGATTGGGGCTAGATTTTCCTATTGAAGACTCCGCAATTATGTCAAGAGTGCTAGTTCGG
CAGGGCGGTTACGTTAGTCAGTATCAGAGGTTGCATGACGAGCTTAGAGTGGACGACTTCGAAGAGAAGTTAAGAGGCCAGTGTTTGCTCAACAAGGAGCGTCAG
CCTGGTGCGATTGTGTTCACAAGAAGCTGCAGGCAGAGAAGAGTCGTGCAGCGTATTCGCCACATGAAGGTGGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTCTTCCATTAGGTCACACCGTGAGATTCTTATATTGCACCTGCTTGGCGTTTATAAGCGCCTATCCCTTTGGAGGGTTGTAACAGAGGATTCGGAACAC
CGTGAACTTCAGAAATGGATAGGGTTACTTAGGTCCGTTTTCAGCAATTGTCTCTCCTTCGAGGGCTTGTTTGATTCGGATCTCTATAATTCGAAAATGGCGGGT
CTACACTTACAAGACTTGCTAAGTGAAGGATCTTCCGTAAGGGAGCATGTTTTGAACATGATGGTTCACTTCAACGTTGCAGAAGTGAACAATGCAGTCATGAAC
AAAATAAGTCAGAAGAAATTCCACAAGGGATCTTCCTCTGGGAGTAAATCTGGACCTTCTTCTCAGAAGAAAGGAATTCAGAAGAAGAAGAAGGACAATGGGAAG
GGGAAGGCTTCGGCTGATGCAAAAGACAAGGAAAAGATTGCAAAAGGAAAATGTTTCCATTGCAATGAAAATGGGCACTGGAAAAGAAATTGCCCCAAATACCTC
ACCGAGAAAAGAGCTGAGAAGGAAAAGCAAGGTAAATTCGATTTACTAGTCATTGAAACATGTATAGTGGAGAATGATGATACTACCTTGATACTAGATTTAGGA
GCCATTAATCATGTTTGTTCTTCTATTAAGGAAACTTGTTCCTGGAGACAACTTGAAGAAGATGAGGTCACTATCTGGGTTGGATCAGGGGAGCTCATCTCAGCA
AAAGCAGGGAAGGCATGGTGTCAGCTCTCCACCACCCCTGGGCCCTGTGGCCTGAGTGAGCTACCCCATGGCACTTGCGTGTGTCATGGCGTGGGCGAGCGTTGC
AAGACCAACTTGGGTGTCTTCATTCGAGAATGGGAATCTCAAGGAGTGTGTCGGACGGCATGGGTGTGCCGGCATTGCACCTCGTCCCATGGGTTGGAGGCTGAA
CATAGGGGTCGTTATGCGGTACGGTATGCTCGAACGACACGGCACGGGCACAAGAGTCGTCGTGCCTCGATGGTGTCCGGATGGCTGGATGTCCGAAAGGCCTCG
ATCATAAATACAAGTCCATGGGTTGGTTCTCGATATGCGTGCCAAAGTGAGCAGTTCAATGACATCCTGATGCTCGGGGTGGCATCCTACAGTACCGGTGGGCAC
TCGGTAAGAATCATCCTGGCAATACCTGGTAAGGCGTTTAGCCCGTTATTGACTAGTTTCTCGGCAGGAATAGTCTTAGAGGCGTCCGATGGACTCGATAAGCTA
GTTTTTACATGGTTAAGCGGTAGGAATCTCCCTGTTGTACCCGATTGTCGTTTTAAGCTCGTTCTGACCCCGTTATGGGTTAGAACTTATCCCGGTAAGGTCCTG
GCTCATCATAGGAGCGGAACGACCGAGTGTCTGAGAACCAAGATGATTGGAAATGAGTTGCGACGTCAGAATTCCACTGCAAGTGAAGCTCGATCGAGGAAGAAA
GAACCTCGCCTAAGGTCTGACGAAGCCACAAAGGGCACGAAAAGTGGATTGGGGCTAGATTTTCCTATTGAAGACTCCGCAATTATGTCAAGAGTGCTAGTTCGG
CAGGGCGGTTACGTTAGTCAGTATCAGAGGTTGCATGACGAGCTTAGAGTGGACGACTTCGAAGAGAAGTTAAGAGGCCAGTGTTTGCTCAACAAGGAGCGTCAG
CCTGGTGCGATTGTGTTCACAAGAAGCTGCAGGCAGAGAAGAGTCGTGCAGCGTATTCGCCACATGAAGGTGGCGTGA
Protein sequenceShow/hide protein sequence
MVSSIRSHREILILHLLGVYKRLSLWRVVTEDSEHRELQKWIGLLRSVFSNCLSFEGLFDSDLYNSKMAGLHLQDLLSEGSSVREHVLNMMVHFNVAEVNNAVMN
KISQKKFHKGSSSGSKSGPSSQKKGIQKKKKDNGKGKASADAKDKEKIAKGKCFHCNENGHWKRNCPKYLTEKRAEKEKQGKFDLLVIETCIVENDDTTLILDLG
AINHVCSSIKETCSWRQLEEDEVTIWVGSGELISAKAGKAWCQLSTTPGPCGLSELPHGTCVCHGVGERCKTNLGVFIREWESQGVCRTAWVCRHCTSSHGLEAE
HRGRYAVRYARTTRHGHKSRRASMVSGWLDVRKASIINTSPWVGSRYACQSEQFNDILMLGVASYSTGGHSVRIILAIPGKAFSPLLTSFSAGIVLEASDGLDKL
VFTWLSGRNLPVVPDCRFKLVLTPLWVRTYPGKVLAHHRSGTTECLRTKMIGNELRRQNSTASEARSRKKEPRLRSDEATKGTKSGLGLDFPIEDSAIMSRVLVR
QGGYVSQYQRLHDELRVDDFEEKLRGQCLLNKERQPGAIVFTRSCRQRRVVQRIRHMKVA