; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g17860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g17860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr6:13966992..13972632
RNA-Seq ExpressionMoc06g17860
SyntenyMoc06g17860
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156835.1 uncharacterized protein LOC111023669 [Momordica charantia]4.5e-7263.83Show/hide
Query:  LKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKG---------------------------------VGTSTGKVN
        ++IE YY GLDD TRLVIDA TN  LL KPYAEA NILERISSNNHSWSDPR IQG+G                                 VG S GK N
Subjt:  LKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKG---------------------------------VGTSTGKVN

Query:  VGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGT
        V HIQ ISC F EG+HHYNN P NPESVYYLGN QNN  N+YSNTYNPGWRN PNFSWSGNQGGNN GTSN  AYQ K  Y P F+NQGQV  Q  +EG+
Subjt:  VGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGT

Query:  IASLEKSMKQYMANSDATVQSQAASLRNLELQVGQ
         ASLE  MK+ M  +D TVQSQAASLRNLE+QVGQ
Subjt:  IASLEKSMKQYMANSDATVQSQAASLRNLELQVGQ

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]1.0e-5258.26Show/hide
Query:  SLKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVYYLG
        S++IETYY GLD+ TRLVIDA  N  LL KPYA+A NILERISS+NHSWSD R I+GK                 S         Y       E +  L 
Subjt:  SLKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVYYLG

Query:  NLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNLELQ
           NNRN +YSNTYNP  RN PNF WSGNQGG+N G SN   +Q K  Y PGFA QGQ+     ++G+I SLE  MKQYMAN+DATVQSQAASLRNLELQ
Subjt:  NLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNLELQ

Query:  VGQLATDLKSLPYGALPS
        VGQLA DLKS P GALPS
Subjt:  VGQLATDLKSLPYGALPS

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]2.3e-6061.4Show/hide
Query:  LKIETYYNGLDDTTRL----VIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVY
        ++IETYY  L+D TRL     +   ++  L+    +E+   L     N        ++Q   VG  TG  NV  IQ ISC F EGDHHYNNCPGNPESVY
Subjt:  LKIETYYNGLDDTTRL----VIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVY

Query:  YLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNL
        YLGN QNNRNN YSNTYNPGWRN PNFSWSG+QGG+N GTS+  A+Q K  Y PGF NQGQ+ A++ +EG+IASLEK MKQYMAN+DATVQSQA SLRNL
Subjt:  YLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNL

Query:  ELQVGQLATDLKSLP
        +LQVGQLATDLKS P
Subjt:  ELQVGQLATDLKSLP

XP_022158740.1 uncharacterized protein LOC111025203 [Momordica charantia]3.3e-5168.94Show/hide
Query:  KGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQG
        K    +  K NV HIQ IS  F EG+HHYN+CP NP+SVYYLGN  NN NN YSNTYN GW + PNFSWS NQG N+VGTSN  AYQ KG Y P  ANQG
Subjt:  KGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQG

Query:  QVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNLELQVGQLATDLKSLPYGALPS
        Q A QKP +G+ ASLE  MKQYM  ++ TVQS AASLRNLELQVGQLATDLKS PYGALPS
Subjt:  QVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNLELQVGQLATDLKSLPYGALPS

XP_022159060.1 uncharacterized protein LOC111025500 [Momordica charantia]4.2e-4647.66Show/hide
Query:  LKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKG---------------------------------VGTSTGKVN
        ++I+TYYNGLDD TRLVIDA  N  LLAKPYAEA NILERISSNN SWSDPR I GKG                                 VG S GK N
Subjt:  LKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKG---------------------------------VGTSTGKVN

Query:  VGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGT
        V HIQ ISC F  G++ YNNCPGNPESV+YLGN QNN NN YS             +W+G                                      GT
Subjt:  VGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGT

Query:  IASL------EKSMKQYMANSDATVQSQAASLRNLELQVGQLATDLKSLPYGALPS
        I +L      +++M +YM N+D TVQSQA SLRNLE+QVGQLATDLKS P G LPS
Subjt:  IASL------EKSMKQYMANSDATVQSQAASLRNLELQVGQLATDLKSLPYGALPS

TrEMBL top hitse value%identityAlignment
A0A6J1DRG1 uncharacterized protein LOC1110236692.2e-7263.83Show/hide
Query:  LKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKG---------------------------------VGTSTGKVN
        ++IE YY GLDD TRLVIDA TN  LL KPYAEA NILERISSNNHSWSDPR IQG+G                                 VG S GK N
Subjt:  LKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKG---------------------------------VGTSTGKVN

Query:  VGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGT
        V HIQ ISC F EG+HHYNN P NPESVYYLGN QNN  N+YSNTYNPGWRN PNFSWSGNQGGNN GTSN  AYQ K  Y P F+NQGQV  Q  +EG+
Subjt:  VGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGT

Query:  IASLEKSMKQYMANSDATVQSQAASLRNLELQVGQ
         ASLE  MK+ M  +D TVQSQAASLRNLE+QVGQ
Subjt:  IASLEKSMKQYMANSDATVQSQAASLRNLELQVGQ

A0A6J1DWK1 uncharacterized protein LOC1110250535.0e-5358.26Show/hide
Query:  SLKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVYYLG
        S++IETYY GLD+ TRLVIDA  N  LL KPYA+A NILERISS+NHSWSD R I+GK                 S         Y       E +  L 
Subjt:  SLKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVYYLG

Query:  NLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNLELQ
           NNRN +YSNTYNP  RN PNF WSGNQGG+N G SN   +Q K  Y PGFA QGQ+     ++G+I SLE  MKQYMAN+DATVQSQAASLRNLELQ
Subjt:  NLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNLELQ

Query:  VGQLATDLKSLPYGALPS
        VGQLA DLKS P GALPS
Subjt:  VGQLATDLKSLPYGALPS

A0A6J1DWN2 uncharacterized protein LOC1110252031.6e-5168.94Show/hide
Query:  KGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQG
        K    +  K NV HIQ IS  F EG+HHYN+CP NP+SVYYLGN  NN NN YSNTYN GW + PNFSWS NQG N+VGTSN  AYQ KG Y P  ANQG
Subjt:  KGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQG

Query:  QVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNLELQVGQLATDLKSLPYGALPS
        Q A QKP +G+ ASLE  MKQYM  ++ TVQS AASLRNLELQVGQLATDLKS PYGALPS
Subjt:  QVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNLELQVGQLATDLKSLPYGALPS

A0A6J1DXK5 uncharacterized protein LOC1110255002.1e-4647.66Show/hide
Query:  LKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKG---------------------------------VGTSTGKVN
        ++I+TYYNGLDD TRLVIDA  N  LLAKPYAEA NILERISSNN SWSDPR I GKG                                 VG S GK N
Subjt:  LKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKG---------------------------------VGTSTGKVN

Query:  VGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGT
        V HIQ ISC F  G++ YNNCPGNPESV+YLGN QNN NN YS             +W+G                                      GT
Subjt:  VGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGT

Query:  IASL------EKSMKQYMANSDATVQSQAASLRNLELQVGQLATDLKSLPYGALPS
        I +L      +++M +YM N+D TVQSQA SLRNLE+QVGQLATDLKS P G LPS
Subjt:  IASL------EKSMKQYMANSDATVQSQAASLRNLELQVGQLATDLKSLPYGALPS

A0A6J1E1F3 uncharacterized protein LOC1110250651.1e-6061.4Show/hide
Query:  LKIETYYNGLDDTTRL----VIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVY
        ++IETYY  L+D TRL     +   ++  L+    +E+   L     N        ++Q   VG  TG  NV  IQ ISC F EGDHHYNNCPGNPESVY
Subjt:  LKIETYYNGLDDTTRL----VIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKGVGTSTGKVNVGHIQEISCLFFEGDHHYNNCPGNPESVY

Query:  YLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNL
        YLGN QNNRNN YSNTYNPGWRN PNFSWSG+QGG+N GTS+  A+Q K  Y PGF NQGQ+ A++ +EG+IASLEK MKQYMAN+DATVQSQA SLRNL
Subjt:  YLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGTIASLEKSMKQYMANSDATVQSQAASLRNL

Query:  ELQVGQLATDLKSLP
        +LQVGQLATDLKS P
Subjt:  ELQVGQLATDLKSLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGGCTGGAGTACACTCTTACCACGCTCTTAAACGAGCTGCAGACCTACCAGTCTTTTATGAAAAGTAAGGGACAAGAAGGGGAGGCAAACATTGCCACCTCAAA
AAGGTTCAACCGAGGTTCGTCCTCTGGAACCAAGTCTGCGCCCTCTTCTTCTAGAAGTAAGACTTTTAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCCGCTG
TTGCCGCTGCCAAGAAAGGCAAGGTGAAGGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATTGGAAGCACAACTGCCTAAAGTACTTGGCCGAAAAG
AAGAAAGCCAACGAAGGTAAATATGATTTACTTGTTCTGGAAACATGTTTAGTGGAGAACGATGACTCCGCCTGGATATTGGATTCAGGAGCCATTAATCACGTAGTGAT
TAACGAGATTTCTGAAGAGGCTACAAACACATCAACAAGAGTTGTTGATCAAACTGGCACTACAACAAGAGTTGTTGATGAAGCCAGTACATCACGTCAGTCACATCCAC
CTCAAGTGTTGAGGGTGCCTCGACGTAGTGGGAGAGTTGTGTCACAACCTGACCGCTATGTGGAAGTGTTCTGCAACAGAAAAAAGGCCTCAACGAAGAACTTCTGTGAT
TTCTCTCAATCAAGCTCTCTCCCTCAACTCTCCCTGACATTCCAAACAAAACGCTCCCACAAGCGTGTTCTCGAAACCCAAGAGGATAGCAAGGAAGACTCGGTGGTGCT
GTTCGGGTGGAAACCGTGGAAGAAAAGTTCTTTAAAGATTGAAACATACTACAATGGATTGGATGACACTACACGTTTGGTCATTGATGCCCCAACAAATGACACATTGC
TAGCAAAACCTTATGCTGAAGCTTGCAATATCTTGGAGAGGATATCATCGAACAATCATTCATGGTCAGACCCTAGAGTCATTCAAGGTAAAGGAGTGGGAACATCAACT
GGTAAGGTAAACGTCGGCCACATCCAGGAGATTTCTTGCTTATTCTTCGAGGGAGATCATCATTATAACAATTGCCCTGGCAATCCAGAGTCGGTTTACTATCTAGGGAA
TCTACAGAACAATAGAAACAACACATATTCCAACACATATAACCCCGGCTGGAGAAATCAACCCAATTTCAGTTGGAGTGGTAATCAGGGAGGAAATAATGTTGGCACCT
CCAATGGTTCAGCGTACCAGCCGAAAGGGAAATATACCCCAGGATTTGCGAATCAAGGTCAGGTAGCAGCACAGAAGCCCGCAGAAGGAACAATTGCGTCATTGGAAAAG
TCGATGAAGCAATATATGGCCAATAGCGATGCTACTGTGCAAAGCCAAGCCGCATCACTAAGAAATCTAGAACTGCAAGTAGGACAGTTAGCAACCGATTTGAAGAGCTT
ACCTTATGGAGCATTGCCAAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATAGGCTGGAGTACACTCTTACCACGCTCTTAAACGAGCTGCAGACCTACCAGTCTTTTATGAAAAGTAAGGGACAAGAAGGGGAGGCAAACATTGCCACCTCAAA
AAGGTTCAACCGAGGTTCGTCCTCTGGAACCAAGTCTGCGCCCTCTTCTTCTAGAAGTAAGACTTTTAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCCGCTG
TTGCCGCTGCCAAGAAAGGCAAGGTGAAGGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATTGGAAGCACAACTGCCTAAAGTACTTGGCCGAAAAG
AAGAAAGCCAACGAAGGTAAATATGATTTACTTGTTCTGGAAACATGTTTAGTGGAGAACGATGACTCCGCCTGGATATTGGATTCAGGAGCCATTAATCACGTAGTGAT
TAACGAGATTTCTGAAGAGGCTACAAACACATCAACAAGAGTTGTTGATCAAACTGGCACTACAACAAGAGTTGTTGATGAAGCCAGTACATCACGTCAGTCACATCCAC
CTCAAGTGTTGAGGGTGCCTCGACGTAGTGGGAGAGTTGTGTCACAACCTGACCGCTATGTGGAAGTGTTCTGCAACAGAAAAAAGGCCTCAACGAAGAACTTCTGTGAT
TTCTCTCAATCAAGCTCTCTCCCTCAACTCTCCCTGACATTCCAAACAAAACGCTCCCACAAGCGTGTTCTCGAAACCCAAGAGGATAGCAAGGAAGACTCGGTGGTGCT
GTTCGGGTGGAAACCGTGGAAGAAAAGTTCTTTAAAGATTGAAACATACTACAATGGATTGGATGACACTACACGTTTGGTCATTGATGCCCCAACAAATGACACATTGC
TAGCAAAACCTTATGCTGAAGCTTGCAATATCTTGGAGAGGATATCATCGAACAATCATTCATGGTCAGACCCTAGAGTCATTCAAGGTAAAGGAGTGGGAACATCAACT
GGTAAGGTAAACGTCGGCCACATCCAGGAGATTTCTTGCTTATTCTTCGAGGGAGATCATCATTATAACAATTGCCCTGGCAATCCAGAGTCGGTTTACTATCTAGGGAA
TCTACAGAACAATAGAAACAACACATATTCCAACACATATAACCCCGGCTGGAGAAATCAACCCAATTTCAGTTGGAGTGGTAATCAGGGAGGAAATAATGTTGGCACCT
CCAATGGTTCAGCGTACCAGCCGAAAGGGAAATATACCCCAGGATTTGCGAATCAAGGTCAGGTAGCAGCACAGAAGCCCGCAGAAGGAACAATTGCGTCATTGGAAAAG
TCGATGAAGCAATATATGGCCAATAGCGATGCTACTGTGCAAAGCCAAGCCGCATCACTAAGAAATCTAGAACTGCAAGTAGGACAGTTAGCAACCGATTTGAAGAGCTT
ACCTTATGGAGCATTGCCAAGCTAG
Protein sequenceShow/hide protein sequence
MNRLEYTLTTLLNELQTYQSFMKSKGQEGEANIATSKRFNRGSSSGTKSAPSSSRSKTFKKKAAGKGSKPDSAVAAAKKGKVKVAEKGKCFHCNMDGHWKHNCLKYLAEK
KKANEGKYDLLVLETCLVENDDSAWILDSGAINHVVINEISEEATNTSTRVVDQTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVEVFCNRKKASTKNFCD
FSQSSSLPQLSLTFQTKRSHKRVLETQEDSKEDSVVLFGWKPWKKSSLKIETYYNGLDDTTRLVIDAPTNDTLLAKPYAEACNILERISSNNHSWSDPRVIQGKGVGTST
GKVNVGHIQEISCLFFEGDHHYNNCPGNPESVYYLGNLQNNRNNTYSNTYNPGWRNQPNFSWSGNQGGNNVGTSNGSAYQPKGKYTPGFANQGQVAAQKPAEGTIASLEK
SMKQYMANSDATVQSQAASLRNLELQVGQLATDLKSLPYGALPS