; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g10550 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g10550
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:8090487..8091377
RNA-Seq ExpressionMoc10g10550
SyntenyMoc10g10550
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7035581.1 unnamed protein product [Microthlaspi erraticum]4.6e-4340.53Show/hide
Query:  MDYDVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHN
        M  D+M E   Y+  + MW+ALK  FG TS TK+R+L  +F  YKKRPN +MRQH+R MSNMI ELK AG +++DEQQVQA +RSL ++W+HM++ +THN
Subjt:  MDYDVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHN

Query:  ERILTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKR----HKRKHQGGFKGKGDHKNKKPKQGQNDK---------------------------
          + T  D+SRHLELE++R+EA  P+N   Y    S  + KR    +K   Q   K +  +  +  + G+ DK                           
Subjt:  ERILTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKR----HKRKHQGGFKGKGDHKNKKPKQGQNDK---------------------------

Query:  ------------------------DSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG
                                DSGATDHVA+ R  +VE+RR+P G RWLYVGNN++V V+G
Subjt:  ------------------------DSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG

KAG7595359.1 Reverse transcriptase RNA-dependent DNA polymerase [Arabidopsis thaliana x Arabidopsis arenosa]1.3e-4542.8Show/hide
Query:  MDYDVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHN
        M  D+M E   Y+  + MW+ALK  FG TS TK+R+L  KF+ +KKRPN +MRQH+R MSNMI ELK AG +++DEQQ+QA +RSL ++W+HM+V +THN
Subjt:  MDYDVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHN

Query:  ERILTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSS-----HENKRHKRKHQGGFKGKGDHKNKKPKQGQNDK--------------------------
          + T  D+SRHLELE++R+EA  P+N EA VA   S       NK +K  +Q G K +     +  + G+ DK                          
Subjt:  ERILTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSS-----HENKRHKRKHQGGFKGKGDHKNKKPKQGQNDK--------------------------

Query:  ------------------------DSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG
                                DSGATDHVA+DR  +VE+RRIP G+RWLYVGNN++V V G
Subjt:  ------------------------DSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG

KYP30908.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]8.9e-4745.62Show/hide
Query:  DVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHNERI
        D+M E  S+   Q MW ALK  FGGTS +K+R+LTIKFDTYK +PN  ++QH+REM+NMI ELKEAGH L+DEQQVQA IRSL  +WEHMKV+LTHNE I
Subjt:  DVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHNERI

Query:  LTLTDVSRHLELEEDRIEA----TGPYNTEA---YVAGPSSHENKRHKRKHQGG----------FKGKGDHKNKK----------------------PKQ
         T +DV++HLELE++R+EA       Y TEA     A P+S + K  +RK + G          FK KG+   KK                       KQ
Subjt:  LTLTDVSRHLELEEDRIEA----TGPYNTEA---YVAGPSSHENKRHKRKHQGG----------FKGKGDHKNKK----------------------PKQ

Query:  GQNDK-----------------------------DSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG
         +++K                             DSG+TDHV +DRG ++E+RR+P+ +RWLYVGNNAR+ VKG
Subjt:  GQNDK-----------------------------DSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG

TXG60589.1 hypothetical protein EZV62_015162 [Acer yangbiense]2.8e-4854.11Show/hide
Query:  VMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHNERIL
        +M E   +D  Q+MW  LK  FGGTS TK+R+LTIKFDTY+KR NH+MRQH+REMSNMI ELK AGH L DEQQ+QA IRSL  +WE+MK+++THN  I 
Subjt:  VMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHNERIL

Query:  TLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKRHKRKHQGGFKGKGDHKNKKPKQGQND--KDSGATDHVAKDRGVYVEFRRIPSGARWLYVGNN
        T  D+SRH+ELE++R+EA    + + Y+A  S  + K  K   + G   KG+ +  K K    +  K+SGATDHVA+DR  +VE+ RI  G RW+YV NN
Subjt:  TLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKRHKRKHQGGFKGKGDHKNKKPKQGQND--KDSGATDHVAKDRGVYVEFRRIPSGARWLYVGNN

Query:  ARVPVKG
        +RV VKG
Subjt:  ARVPVKG

XP_031252128.1 uncharacterized protein LOC116110025 [Pistacia vera]2.3e-4751.2Show/hide
Query:  MDYDVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHN
        MD D+M++   Y L ++MW      FGG S+TK+R LTIKFDTYKK+P HNMR H+R +SNMISEL +AGHVLTDEQQVQA IRSL QNWEHMK+HLTHN
Subjt:  MDYDVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHN

Query:  ERILTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKRHKRKHQGGFKGKGDHKNKKPKQGQNDKDSGATDHVAKDRGVYVEFRRIPSGARWLYVG
        E I+ L DV  HLELEEDR+ A+   NT+ Y+AG SSH++ +                                DH  KDR V        +G +W+YVG
Subjt:  ERILTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKRHKRKHQGGFKGKGDHKNKKPKQGQNDKDSGATDHVAKDRGVYVEFRRIPSGARWLYVG

Query:  NNARVPVKG
        NN +V VKG
Subjt:  NNARVPVKG

TrEMBL top hitse value%identityAlignment
A0A151QKT8 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-4745.62Show/hide
Query:  DVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHNERI
        D+M E  S+   Q MW ALK  FGGTS +K+R+LTIKFDTYK +PN  ++QH+REM+NMI ELKEAGH L+DEQQVQA IRSL  +WEHMKV+LTHNE I
Subjt:  DVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHNERI

Query:  LTLTDVSRHLELEEDRIEA----TGPYNTEA---YVAGPSSHENKRHKRKHQGG----------FKGKGDHKNKK----------------------PKQ
         T +DV++HLELE++R+EA       Y TEA     A P+S + K  +RK + G          FK KG+   KK                       KQ
Subjt:  LTLTDVSRHLELEEDRIEA----TGPYNTEA---YVAGPSSHENKRHKRKHQGG----------FKGKGDHKNKK----------------------PKQ

Query:  GQNDK-----------------------------DSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG
         +++K                             DSG+TDHV +DRG ++E+RR+P+ +RWLYVGNNAR+ VKG
Subjt:  GQNDK-----------------------------DSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG

A0A1R3GLW6 Zinc finger, CCHC-type1.3e-4046.22Show/hide
Query:  MDYDVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHN
        M  +++ E   Y   Q MW+ LK  FG  S TK++QL IK+D YKKRP +NMRQH++EMSNM+ ELK  GHVL D QQVQA +RSL + WE+M   LTHN
Subjt:  MDYDVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHN

Query:  ERILTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKRHKRKHQGGFKGKGDH---------KNKKPKQGQ-----NDKD---------------S
        ++  T  DV RHLELEE+R+ A   +  E  +A  S      HKRK +GG   K D          K  K K+GQ      DK                S
Subjt:  ERILTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKRHKRKHQGGFKGKGDH---------KNKKPKQGQ-----NDKD---------------S

Query:  GATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG
         ATDHVA+ R  YVE+RRIP+G RW+Y+G    V VKG
Subjt:  GATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG

A0A5C7HUX5 Uncharacterized protein1.3e-4854.11Show/hide
Query:  VMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHNERIL
        +M E   +D  Q+MW  LK  FGGTS TK+R+LTIKFDTY+KR NH+MRQH+REMSNMI ELK AGH L DEQQ+QA IRSL  +WE+MK+++THN  I 
Subjt:  VMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHNERIL

Query:  TLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKRHKRKHQGGFKGKGDHKNKKPKQGQND--KDSGATDHVAKDRGVYVEFRRIPSGARWLYVGNN
        T  D+SRH+ELE++R+EA    + + Y+A  S  + K  K   + G   KG+ +  K K    +  K+SGATDHVA+DR  +VE+ RI  G RW+YV NN
Subjt:  TLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKRHKRKHQGGFKGKGDHKNKKPKQGQND--KDSGATDHVAKDRGVYVEFRRIPSGARWLYVGNN

Query:  ARVPVKG
        +RV VKG
Subjt:  ARVPVKG

A0A6D2J7K8 Uncharacterized protein2.2e-4340.53Show/hide
Query:  MDYDVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHN
        M  D+M E   Y+  + MW+ALK  FG TS TK+R+L  +F  YKKRPN +MRQH+R MSNMI ELK AG +++DEQQVQA +RSL ++W+HM++ +THN
Subjt:  MDYDVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHN

Query:  ERILTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKR----HKRKHQGGFKGKGDHKNKKPKQGQNDK---------------------------
          + T  D+SRHLELE++R+EA  P+N   Y    S  + KR    +K   Q   K +  +  +  + G+ DK                           
Subjt:  ERILTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKR----HKRKHQGGFKGKGDHKNKKPKQGQNDK---------------------------

Query:  ------------------------DSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG
                                DSGATDHVA+ R  +VE+RR+P G RWLYVGNN++V V+G
Subjt:  ------------------------DSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKG

A0A6P5EGH5 uncharacterized protein LOC1097042092.7e-4149.76Show/hide
Query:  DVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHNERI
        D++ E         MW  L   FG T+V K++QLTIKFDTYKK PNH ++QH+REMSNMI ELK AGH LTD QQ QA IRSL  +WEHMKV+LT+N+ I
Subjt:  DVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHNERI

Query:  LTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKRHKRKHQGGFKGKGDHKNKKPKQGQNDKDSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNA
         T  DV+R++ELE++R+ AT      AY+   SSH+             G G              DSGAT H+A DRG +VE+RR+ +G +W+YVGNNA
Subjt:  LTLTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKRHKRKHQGGFKGKGDHKNKKPKQGQNDKDSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNA

Query:  RVPVK
        +V VK
Subjt:  RVPVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTACGATGTTATGCGTGAGTCTCGCTCTTATGACCTTATCCAGAACATGTGGGATGCGTTAAAGTTGATATTTGGTGGGACCTCTGTCACTAAAGTTAGG
CAACTTACTATTAAGTTTGATACTTATAAAAAACGTCCTAATCATAATATGAGACAACACATGAGAGAGATGTCTAATATGATTAGTGAGCTGAAAGAAGCTGGT
CATGTCCTAACTGATGAACAACAAGTTCAAGCTGCTATTCGATCTCTTTCCCAAAATTGGGAACACATGAAGGTGCATCTTACCCATAATGAGAGAATTTTAACT
CTTACTGATGTTTCACGTCATCTTGAACTGGAAGAAGACCGTATAGAGGCAACAGGGCCTTACAACACTGAAGCCTATGTGGCTGGCCCTAGCTCACATGAGAAC
AAAAGGCACAAGCGTAAACATCAGGGAGGTTTTAAAGGGAAAGGGGATCACAAGAATAAGAAACCCAAGCAAGGTCAGAATGACAAGGACTCAGGAGCAACCGAC
CACGTAGCTAAAGATCGTGGGGTGTATGTGGAATTTCGTCGAATTCCATCAGGAGCTAGGTGGCTGTATGTAGGAAACAATGCCCGTGTGCCTGTAAAAGGGCAT
TGGTTCCTACAAGTTAAACATGAGGAATGGACGCACTCAACTTCTACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTACGATGTTATGCGTGAGTCTCGCTCTTATGACCTTATCCAGAACATGTGGGATGCGTTAAAGTTGATATTTGGTGGGACCTCTGTCACTAAAGTTAGG
CAACTTACTATTAAGTTTGATACTTATAAAAAACGTCCTAATCATAATATGAGACAACACATGAGAGAGATGTCTAATATGATTAGTGAGCTGAAAGAAGCTGGT
CATGTCCTAACTGATGAACAACAAGTTCAAGCTGCTATTCGATCTCTTTCCCAAAATTGGGAACACATGAAGGTGCATCTTACCCATAATGAGAGAATTTTAACT
CTTACTGATGTTTCACGTCATCTTGAACTGGAAGAAGACCGTATAGAGGCAACAGGGCCTTACAACACTGAAGCCTATGTGGCTGGCCCTAGCTCACATGAGAAC
AAAAGGCACAAGCGTAAACATCAGGGAGGTTTTAAAGGGAAAGGGGATCACAAGAATAAGAAACCCAAGCAAGGTCAGAATGACAAGGACTCAGGAGCAACCGAC
CACGTAGCTAAAGATCGTGGGGTGTATGTGGAATTTCGTCGAATTCCATCAGGAGCTAGGTGGCTGTATGTAGGAAACAATGCCCGTGTGCCTGTAAAAGGGCAT
TGGTTCCTACAAGTTAAACATGAGGAATGGACGCACTCAACTTCTACATGA
Protein sequenceShow/hide protein sequence
MDYDVMRESRSYDLIQNMWDALKLIFGGTSVTKVRQLTIKFDTYKKRPNHNMRQHMREMSNMISELKEAGHVLTDEQQVQAAIRSLSQNWEHMKVHLTHNERILT
LTDVSRHLELEEDRIEATGPYNTEAYVAGPSSHENKRHKRKHQGGFKGKGDHKNKKPKQGQNDKDSGATDHVAKDRGVYVEFRRIPSGARWLYVGNNARVPVKGH
WFLQVKHEEWTHSTST