; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g00760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g00760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr3:534708..535544
RNA-Seq ExpressionMoc03g00760
SyntenyMoc03g00760
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVX06074.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.6e-3236.65Show/hide
Query:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLG--FKNSGQRS
        M+P P +NK FSLV QEE+ R +  + S +  S         +A +G        +  SKTR +R  C++CG  GH  D+CY L GYP G  FKN G  S
Subjt:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLG--FKNSGQRS

Query:  SSVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSKASSNE---TAPSYLAANHDRFLPVNKQWIFDSGASALICCSAILFQS
        S    S+ ++S          S+++  S T M    QCQQLIQLL +Q+  + ++S E   T PS      ++    NK WI DSGA+  +C    LF S
Subjt:  SSVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSKASSNE---TAPSYLAANHDRFLPVNKQWIFDSGASALICCSAILFQS

Query:  MMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGK
         + + +V V LP   +  ++  GS+ +S D+ L NV ++P F YNL+S+SA   + LS+ ++F  D+C++Q  S  + IGK
Subjt:  MMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGK

XP_012836458.1 PREDICTED: uncharacterized protein LOC105957084 [Erythranthe guttata]2.6e-3237.83Show/hide
Query:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLGFK--------
        M+P P INK F+LV+QEE+QR IH++ +    S AF++   Q+ T     ++  FN ++  R ER FCTHC + GHTID+CY LHGYP G+K        
Subjt:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLGFK--------

Query:  -----NSGQRSSSVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSK------ASSNETAPSYL---------AANHDRFLPV
             +  Q S+  +    +SS S SQP    S  +  + +    AAQCQQLI    +QM   K      +  +E   +++         A+ H  F P 
Subjt:  -----NSGQRSSSVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSK------ASSNETAPSYL---------AANHDRFLPV

Query:  NKQWIFDSGASALICCSAILFQSMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFR
           WI DSGAS  IC    LF S+  ++   V LP+ S   VE+ G + +S D+ L NVFY+P F +NL+S+SAL  + L   V+F ++S L+QDK   +
Subjt:  NKQWIFDSGASALICCSAILFQSMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFR

Query:  TIGK
         IGK
Subjt:  TIGK

XP_012856897.1 PREDICTED: uncharacterized protein LOC105976150 [Erythranthe guttata]1.2e-3239.93Show/hide
Query:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNS-KTRSERPFCTHCGLPGHTIDQCYTLHGYPLGFKNSGQRSS
        M+P P INK F+LV+QEE+QR IH++ +    S AF++   QS       S  N  Y S   R ER FCTHC + GHTID+CY LHGYP G+K +  R S
Subjt:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNS-KTRSERPFCTHCGLPGHTIDQCYTLHGYPLGFKNSGQRSS

Query:  SVTKSVGSSSSSASQPEPTE-------SQAMAVSS----TNMDMAAQCQQLIQLLQSQMLHSKASSNETAPS---------------YLAANHDRFLPVN
        S+ +S  S +  A+   P +       SQ   VSS     NM  AAQCQQL+    +QM   K  S + +                   A+ H+ F P  
Subjt:  SVTKSVGSSSSSASQPEPTE-------SQAMAVSS----TNMDMAAQCQQLIQLLQSQMLHSKASSNETAPS---------------YLAANHDRFLPVN

Query:  KQWIFDSGASALICCSAILFQSMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRT
          WI DSGAS  IC    LF S+  ++   V LP+ S   VE+ G + +S D+ L NVFY+P F +NL+S+SAL  + L   V+F + S L+QDK   + 
Subjt:  KQWIFDSGASALICCSAILFQSMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRT

Query:  IGK
        IGK
Subjt:  IGK

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]9.9e-3239.93Show/hide
Query:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSK-PNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLGFKNSGQRSS
        M+PPP +NKA SLV Q+EQQR I  + ++ +A S FAL    SA+  KP SK  N+  N K                       LHGYP G++ SGQR S
Subjt:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSK-PNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLGFKNSGQRSS

Query:  SVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSK-ASSNETAPSYLAANHDRFLPVNKQWIFDSGASALICCSAILFQSMMP
          +++ G+S SS       E  A+A S  N    +  QQL QLLQSQ+   K  +  +T  SY      + L      I D GASA IC    LF  +  
Subjt:  SVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSK-ASSNETAPSYLAANHDRFLPVNKQWIFDSGASALICCSAILFQSMMP

Query:  ISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGK
        IS V VNLPNK  F VE++G + +S  +S+  V YIP+F++NL+S++ L ++  S+ V F ND+C++QDKS  +TI K
Subjt:  ISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGK

XP_022871010.1 uncharacterized protein LOC111390234 [Olea europaea var. sylvestris]5.6e-2730.21Show/hide
Query:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAV---PQSATSGKPVSKPNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLGFK---NS
        M+P P INK F+LV Q++ QR ++A   +++ + + A  V   P+ +T+G+ +       N   ++ERP CT+CG  GH +D+CY LHG+P G+K    +
Subjt:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAV---PQSATSGKPVSKPNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLGFK---NS

Query:  GQRSSSVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSKASSNETA------PSYLAANHDRFLPVNKQWIFDSGASALICC
        G+ S+     V    S   Q +      M   +     AAQ Q L+ +L   +  +K  + E A       + L+ +    L   + W+ DSGA++ I  
Subjt:  GQRSSSVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSKASSNETA------PSYLAANHDRFLPVNKQWIFDSGASALICC

Query:  SAILFQSMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGK
        S   F +M P+ +  V LPN     +++  ++ +S +  L +V ++P F +NL+S+S+L ++  S+ + F  D+C++Q   T + IGK
Subjt:  SAILFQSMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGK

TrEMBL top hitse value%identityAlignment
A0A151T1V6 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-2738.87Show/hide
Query:  EPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLG-FKNSGQRSSS
        +P P I   FSL+ QEE Q+ I  T S N  S   A  V Q A      +K  F     T+ ERP C HC + GHT D+CY L GYP   FKN   R   
Subjt:  EPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLG-FKNSGQRSSS

Query:  VTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSKASSNETAPSYLAAN------HDRFLPVNKQWIFDSGASALICCSAILFQ
        V   V +S+ S           + ++S++    AQCQQLI  L +QM   ++ +N  A   LA N      +  F   +  WI DSGA++ ICCS  ++ 
Subjt:  VTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSKASSNETAPSYLAAN------HDRFLPVNKQWIFDSGASALICCSAILFQ

Query:  SMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGKV
        S  P+ +  V LPN +   VE  GSI ++ DI L NV +IP+F +NL+S+  L  E     VL  N SC+LQD  T R IG V
Subjt:  SMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGKV

A0A2Z7AT15 Cysteine-rich RLK (Receptor-like protein kinase) 82.7e-2734.15Show/hide
Query:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSK-TRSERPFCTHCGLPGHTIDQCYTLHGYPLGFKNSGQRSS
        +EP P I K F+LV QEE+QR IH    V+ A    +  +    +S    +    + NSK  R +R  C+HC    HT+D+CY LHGYP G      + S
Subjt:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSK-TRSERPFCTHCGLPGHTIDQCYTLHGYPLGFKNSGQRSS

Query:  SVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQ--------MLHSKASSNETAPSYLAANHDRFLPVNKQWIFDSGASALICCSAI
          +     +SSS+     T  +   +  ++    +QC+QLI+ L S+        M H   ++        +A         K WI D+GA+  ICCS  
Subjt:  SVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQ--------MLHSKASSNETAPSYLAANHDRFLPVNKQWIFDSGASALICCSAI

Query:  LFQSMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIG
        +F+S   I S VV LPN  +  V  AG++ ++S++ L NV Y+P F +NL+S+S+L     +  V F +DSC +QD S  R IG
Subjt:  LFQSMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIG

A0A2Z7CMI0 Uncharacterized protein2.7e-2734.15Show/hide
Query:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSK-TRSERPFCTHCGLPGHTIDQCYTLHGYPLGFKNSGQRSS
        +EP P I K F+LV QEE+QR IH    V+ A    +  +    +S    +    + NSK  R +R  C+HC    HT+D+CY LHGYP G      + S
Subjt:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSK-TRSERPFCTHCGLPGHTIDQCYTLHGYPLGFKNSGQRSS

Query:  SVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQ--------MLHSKASSNETAPSYLAANHDRFLPVNKQWIFDSGASALICCSAI
          +     +SSS+     T  +   +  ++    +QC+QLI+ L S+        M H   ++        +A         K WI D+GA+  ICCS  
Subjt:  SVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQ--------MLHSKASSNETAPSYLAANHDRFLPVNKQWIFDSGASALICCSAI

Query:  LFQSMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIG
        +F+S   I S VV LPN  +  V  AG++ ++S++ L NV Y+P F +NL+S+S+L     +  V F +DSC +QD S  R IG
Subjt:  LFQSMMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIG

A0A438JAT7 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-3236.65Show/hide
Query:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLG--FKNSGQRS
        M+P P +NK FSLV QEE+ R +  + S +  S         +A +G        +  SKTR +R  C++CG  GH  D+CY L GYP G  FKN G  S
Subjt:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLG--FKNSGQRS

Query:  SSVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSKASSNE---TAPSYLAANHDRFLPVNKQWIFDSGASALICCSAILFQS
        S    S+ ++S          S+++  S T M    QCQQLIQLL +Q+  + ++S E   T PS      ++    NK WI DSGA+  +C    LF S
Subjt:  SSVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSKASSNE---TAPSYLAANHDRFLPVNKQWIFDSGASALICCSAILFQS

Query:  MMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGK
         + + +V V LP   +  ++  GS+ +S D+ L NV ++P F YNL+S+SA   + LS+ ++F  D+C++Q  S  + IGK
Subjt:  MMPISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGK

A0A6J1CR17 uncharacterized protein LOC1110134414.8e-3239.93Show/hide
Query:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSK-PNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLGFKNSGQRSS
        M+PPP +NKA SLV Q+EQQR I  + ++ +A S FAL    SA+  KP SK  N+  N K                       LHGYP G++ SGQR S
Subjt:  MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSK-PNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLGFKNSGQRSS

Query:  SVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSK-ASSNETAPSYLAANHDRFLPVNKQWIFDSGASALICCSAILFQSMMP
          +++ G+S SS       E  A+A S  N    +  QQL QLLQSQ+   K  +  +T  SY      + L      I D GASA IC    LF  +  
Subjt:  SVTKSVGSSSSSASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSK-ASSNETAPSYLAANHDRFLPVNKQWIFDSGASALICCSAILFQSMMP

Query:  ISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGK
        IS V VNLPNK  F VE++G + +S  +S+  V YIP+F++NL+S++ L ++  S+ V F ND+C++QDKS  +TI K
Subjt:  ISSVVVNLPNKSSFTVEFAGSIGISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCACCACCGTATATAAATAAGGCTTTTTCTCTTGTAAATCAAGAGGAACAACAACGTTTGATCCACGCAACGCAGTCGGTGAATTCTGCCTCTTCTGCTTTCGC
GCTTGCTGTTCCGCAGTCGGCGACTTCTGGAAAGCCAGTATCAAAACCGAATTTCAATTACAATTCGAAGACCAGGAGTGAACGACCATTTTGTACACACTGTGGCCTTC
CAGGTCATACCATTGACCAATGCTATACGTTGCACGGGTATCCTCTTGGTTTTAAGAATTCCGGACAGCGTTCTTCTTCAGTAACCAAATCAGTTGGTTCTTCATCGTCT
TCTGCTTCACAACCTGAGCCCACTGAATCTCAGGCTATGGCTGTTTCTTCCACTAACATGGACATGGCTGCTCAATGTCAGCAACTCATCCAGCTTCTTCAATCTCAGAT
GCTGCACTCTAAGGCCTCTTCTAATGAGACTGCACCTTCATATTTGGCAGCTAATCATGATCGATTTTTACCTGTCAATAAGCAATGGATTTTTGATTCAGGCGCTTCTG
CACTCATTTGTTGTTCTGCCATTCTTTTTCAGTCTATGATGCCGATTTCCTCTGTGGTTGTGAATTTACCAAATAAATCAAGTTTCACTGTGGAGTTTGCCGGATCTATT
GGGATATCATCAGACATTTCTCTAACTAATGTCTTTTATATTCCTGATTTCCACTACAACTTGGTTTCGATCAGTGCGTTAAGAAAGGAGTTTCTGAGTATACAAGTATT
ATTTGCTAATGACTCTTGTCTTCTTCAGGATAAGTCCACTTTCAGGACGATTGGCAAGGTGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCACCACCGTATATAAATAAGGCTTTTTCTCTTGTAAATCAAGAGGAACAACAACGTTTGATCCACGCAACGCAGTCGGTGAATTCTGCCTCTTCTGCTTTCGC
GCTTGCTGTTCCGCAGTCGGCGACTTCTGGAAAGCCAGTATCAAAACCGAATTTCAATTACAATTCGAAGACCAGGAGTGAACGACCATTTTGTACACACTGTGGCCTTC
CAGGTCATACCATTGACCAATGCTATACGTTGCACGGGTATCCTCTTGGTTTTAAGAATTCCGGACAGCGTTCTTCTTCAGTAACCAAATCAGTTGGTTCTTCATCGTCT
TCTGCTTCACAACCTGAGCCCACTGAATCTCAGGCTATGGCTGTTTCTTCCACTAACATGGACATGGCTGCTCAATGTCAGCAACTCATCCAGCTTCTTCAATCTCAGAT
GCTGCACTCTAAGGCCTCTTCTAATGAGACTGCACCTTCATATTTGGCAGCTAATCATGATCGATTTTTACCTGTCAATAAGCAATGGATTTTTGATTCAGGCGCTTCTG
CACTCATTTGTTGTTCTGCCATTCTTTTTCAGTCTATGATGCCGATTTCCTCTGTGGTTGTGAATTTACCAAATAAATCAAGTTTCACTGTGGAGTTTGCCGGATCTATT
GGGATATCATCAGACATTTCTCTAACTAATGTCTTTTATATTCCTGATTTCCACTACAACTTGGTTTCGATCAGTGCGTTAAGAAAGGAGTTTCTGAGTATACAAGTATT
ATTTGCTAATGACTCTTGTCTTCTTCAGGATAAGTCCACTTTCAGGACGATTGGCAAGGTGATATGA
Protein sequenceShow/hide protein sequence
MEPPPYINKAFSLVNQEEQQRLIHATQSVNSASSAFALAVPQSATSGKPVSKPNFNYNSKTRSERPFCTHCGLPGHTIDQCYTLHGYPLGFKNSGQRSSSVTKSVGSSSS
SASQPEPTESQAMAVSSTNMDMAAQCQQLIQLLQSQMLHSKASSNETAPSYLAANHDRFLPVNKQWIFDSGASALICCSAILFQSMMPISSVVVNLPNKSSFTVEFAGSI
GISSDISLTNVFYIPDFHYNLVSISALRKEFLSIQVLFANDSCLLQDKSTFRTIGKVI