; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g03570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g03570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr1:2296480..2302667
RNA-Seq ExpressionMoc01g03570
SyntenyMoc01g03570
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]5.7e-2458.87Show/hide
Query:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATA
        S  R I+VS G +TWLQFK AFF QYY AIT +RKQ + LNLKQ NRSVEEY+R               EA K +RFI+ LKD+ +GFVA L   DYATA
Subjt:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATA

Query:  LRAAALIDNHSANEPQVPSGLGSS
        LR AALIDN SA+  QVP G GSS
Subjt:  LRAAALIDNHSANEPQVPSGLGSS

XP_038880159.1 uncharacterized protein LOC120071839 [Benincasa hispida]3.2e-1439.35Show/hide
Query:  AMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATAL
        A R I  S G  TW QFK  F+ +Y+ A  R+ KQA+ +NLKQG  +VEEYE                EAK+ +RF+ GL+D+++G V AL   +YATA 
Subjt:  AMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATAL

Query:  RAAALIDNHSANEPQVPSGLGSSSGQKRKFDQESSNPQQLQDNFSTLNQAQGQRQ
        RAAA +   S  E         +SGQKRK +Q++  P Q      + +++QG  Q
Subjt:  RAAALIDNHSANEPQVPSGLGSSSGQKRKFDQESSNPQQLQDNFSTLNQAQGQRQ

XP_038880466.1 uncharacterized protein LOC120072126 [Benincasa hispida]9.2e-1442.55Show/hide
Query:  AMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATAL
        A + I+ S  L TW QFK  F+ +Y+ A TR+ KQ + LNLKQG   V+EYE+               E  +T+RFI GL+ +++G V AL L  YATAL
Subjt:  AMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATAL

Query:  RAAALI--DNHSANEPQVPSGLGSSSGQKRKFDQESSNPQQ
        RA   I  D+    E    S + +S+GQKRK DQ++S  QQ
Subjt:  RAAALI--DNHSANEPQVPSGLGSSSGQKRKFDQESSNPQQ

XP_038884794.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120075457 [Benincasa hispida]1.0e-1235.98Show/hide
Query:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATA
        SA R + V    +TW QFK  F+ +Y+ A  R+ KQ + L L+QG+RSVE+Y++               EA + +RF+ GLKD IQG V A     +  A
Subjt:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATA

Query:  LRAAALIDNHSANEPQVPSGLGSSSGQKRKFDQESSNPQQLQDNFSTLNQAQGQRQPPNRQQCP
        LR    +D  S +E     G+G S GQKRK DQ++  P   Q    T      +R+  + ++CP
Subjt:  LRAAALIDNHSANEPQVPSGLGSSSGQKRKFDQESSNPQQLQDNFSTLNQAQGQRQPPNRQQCP

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]3.2e-1440.43Show/hide
Query:  AMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATAL
        A R + V    +TW QFK  F+ +Y+ A  R+ KQ + L L+QG+RSVEEY++               EA + +RFI GLK+ I+G V A     +  AL
Subjt:  AMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATAL

Query:  RAAALIDNHSANEPQVPSGLGSSSGQKRKFDQESSNPQQLQ
        R AA +D  S +E  +  G G SSGQKRK DQ+   P   Q
Subjt:  RAAALIDNHSANEPQVPSGLGSSSGQKRKFDQESSNPQQLQ

TrEMBL top hitse value%identityAlignment
A0A5A7SMY9 Gag-protease polyprotein1.3e-1035.86Show/hide
Query:  VEPTSAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYE--------------RHEAKKTKRFIMGLKDDIQGFVAALPLAD
        +E     R +  +   ITW QFK  F+ +++ A  +  KQ + LNL+QGN +VE+Y+              R EA +T++F+ GL  D Q  V AL  A 
Subjt:  VEPTSAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYE--------------RHEAKKTKRFIMGLKDDIQGFVAALPLAD

Query:  YATALRAAALIDNHSANEPQVPSGLGSSSGQKRKFD-QESSNPQQ
        +A ALR A  +  H   +P   +G GSS GQK K + Q +  PQ+
Subjt:  YATALRAAALIDNHSANEPQVPSGLGSSSGQKRKFD-QESSNPQQ

A0A5A7SPG1 Pol protein1.0e-1036.88Show/hide
Query:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYE--------------RHEAKKTKRFIMGLKDDIQGFVAALPLADYATA
        +A R +    G ITW QFK +F+ +++ A  +  KQ K +NLKQ + +VE+Y+              R EA +T++F+ GL+ D+QG V AL  A +A A
Subjt:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYE--------------RHEAKKTKRFIMGLKDDIQGFVAALPLADYATA

Query:  LRAAALIDNHSANEPQVPSGLGSSSGQKRKFD-QESSNPQQ
        LR A  +  H   +    +G GS+ GQKRK + Q    PQ+
Subjt:  LRAAALIDNHSANEPQVPSGLGSSSGQKRKFD-QESSNPQQ

A0A5D3BCJ4 Pol protein1.0e-1036.88Show/hide
Query:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYE--------------RHEAKKTKRFIMGLKDDIQGFVAALPLADYATA
        +A R +    G ITW QFK +F+ +++ A  +  KQ K +NLKQ + +VE+Y+              R EA +T++F+ GL+ D+QG V AL  A +A A
Subjt:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYE--------------RHEAKKTKRFIMGLKDDIQGFVAALPLADYATA

Query:  LRAAALIDNHSANEPQVPSGLGSSSGQKRKFD-QESSNPQQ
        LR A  +  H   +    +G GS+ GQKRK + Q    PQ+
Subjt:  LRAAALIDNHSANEPQVPSGLGSSSGQKRKFD-QESSNPQQ

A0A6J1DSJ6 uncharacterized protein LOC1110235122.8e-2458.87Show/hide
Query:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATA
        S  R I+VS G +TWLQFK AFF QYY AIT +RKQ + LNLKQ NRSVEEY+R               EA K +RFI+ LKD+ +GFVA L   DYATA
Subjt:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATA

Query:  LRAAALIDNHSANEPQVPSGLGSS
        LR AALIDN SA+  QVP G GSS
Subjt:  LRAAALIDNHSANEPQVPSGLGSS

A0A6J1FB78 uncharacterized protein LOC1114438451.4e-1231.39Show/hide
Query:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATA
        S  R+I   E  ITW QF+ AF  + + ++ R++KQ + L+++QGNRSVEEYER               E  K + F+MGL+ DI+G V      DYATA
Subjt:  SAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERH--------------EAKKTKRFIMGLKDDIQGFVAALPLADYATA

Query:  LRAAALIDNHSANEPQVPSGLGSSSGQKRKFDQESSNPQQLQDNFSTLNQAQGQRQ--------PPNRQQCPN-----------------NERPEGQIAI
        L+ A  +D          +   +S  QKRK +Q S    ++  + +  N  Q  RQ        P NR +C N                 N   EG +  
Subjt:  LRAAALIDNHSANEPQVPSGLGSSSGQKRKFDQESSNPQQLQDNFSTLNQAQGQRQ--------PPNRQQCPN-----------------NERPEGQIAI

Query:  NCHEGNTPVDAKKPCNVNQSPLR
        +C +G      + P N   S LR
Subjt:  NCHEGNTPVDAKKPCNVNQSPLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGCTGACGACAGAGAGGAGCTGCTACTGCCATTTCATGCTACTGTCTTCCATCTGCGTATCTCAGAAGGGAGTCGTGTCGAGTTGAGTAATAGCAGTCGTTTTCA
TGCTGGAGTAGAGATTCCAGGAGGAAGCCACGTGTCCTTCAGACAAGTAGGTTCGCCCGAGAACATGAAAACCTGCAACAGAAAACTCGACGTCGGTGCAGTGTCCCGTG
AAGGCCTTCCATATATAGCCTCTGGCGACCGACCAGGAAGGATGACGGTTGAGGGTGGCAAGGGCATGCCGCGCGCAGGGAGACGTGCCACCTGTACTGGAAATGTGTTG
TCCCTTGCTAGCAATTCGCACGGTGGATATGCGTCCTGCTTGGTTTTCACTCTCAAGAGTCAGCTTTCTTACGAGGTAAGTGATTCTGCTAGGTTGGAAATTTTATATGA
TGTTTTGAATGACTATACGATGGATGCTGTTTGTGAATTGAATATTATGTGGAGTTGTTGTGCTGCATTACAGAAAGATTGGGTTAATTGTTCACATTGGCCAAAAGAAA
AGGAAAAGGAAAAGGAAAAGAATTTTACTCATAAAGTTTTAGGGAATAACAGGTTTGTGACGAAGATATCTGCAAGCTGTGCCATAGGCATTAATGTATCACCATGGCCA
GTTACACAAATATCTTGTCATCGCCAGATTATGGCTCCACGTAGAAGGGTTTTTGTGAGACGGGGTGGGCTAGGTAAAGGAGCAAACCCTGAGATAGTAGAGCCAACTTC
TGCAATGAGGTCTATCAATGTTAGTGAAGGCCTAATCACATGGTTGCAGTTTAAATATGCTTTCTTCCTACAGTATTACCTAGCGATCACCCGATTCAGGAAACAGGCAA
AAATTCTAAACCTGAAGCAAGGTAACAGATCAGTGGAGGAATATGAGAGACACGAGGCCAAGAAGACCAAACGATTCATCATGGGCCTGAAGGATGATATTCAAGGCTTT
GTGGCAGCTCTCCCCCTAGCAGATTATGCTACAGCACTTCGAGCAGCTGCATTGATTGATAATCATTCAGCAAATGAGCCCCAAGTGCCTTCGGGACTAGGTTCTTCCTC
AGGACAGAAAAGAAAGTTTGACCAAGAATCTAGTAATCCACAGCAACTTCAAGATAATTTTTCCACGCTGAACCAAGCTCAAGGTCAGAGACAGCCGCCTAATCGACAAC
AATGTCCTAACAATGAAAGACCAGAAGGACAGATTGCTATAAATTGTCATGAGGGCAACACTCCAGTTGATGCAAAAAAACCTTGTAATGTGAACCAGTCACCCTTACGA
TCGCTCAAGGTTCCAATGCTGCCCTTATGTCTAGATCGGCATCATGGCATCTGCTCGGCCATGGGCAAGCCTATGGAGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGCTGACGACAGAGAGGAGCTGCTACTGCCATTTCATGCTACTGTCTTCCATCTGCGTATCTCAGAAGGGAGTCGTGTCGAGTTGAGTAATAGCAGTCGTTTTCA
TGCTGGAGTAGAGATTCCAGGAGGAAGCCACGTGTCCTTCAGACAAGTAGGTTCGCCCGAGAACATGAAAACCTGCAACAGAAAACTCGACGTCGGTGCAGTGTCCCGTG
AAGGCCTTCCATATATAGCCTCTGGCGACCGACCAGGAAGGATGACGGTTGAGGGTGGCAAGGGCATGCCGCGCGCAGGGAGACGTGCCACCTGTACTGGAAATGTGTTG
TCCCTTGCTAGCAATTCGCACGGTGGATATGCGTCCTGCTTGGTTTTCACTCTCAAGAGTCAGCTTTCTTACGAGGTAAGTGATTCTGCTAGGTTGGAAATTTTATATGA
TGTTTTGAATGACTATACGATGGATGCTGTTTGTGAATTGAATATTATGTGGAGTTGTTGTGCTGCATTACAGAAAGATTGGGTTAATTGTTCACATTGGCCAAAAGAAA
AGGAAAAGGAAAAGGAAAAGAATTTTACTCATAAAGTTTTAGGGAATAACAGGTTTGTGACGAAGATATCTGCAAGCTGTGCCATAGGCATTAATGTATCACCATGGCCA
GTTACACAAATATCTTGTCATCGCCAGATTATGGCTCCACGTAGAAGGGTTTTTGTGAGACGGGGTGGGCTAGGTAAAGGAGCAAACCCTGAGATAGTAGAGCCAACTTC
TGCAATGAGGTCTATCAATGTTAGTGAAGGCCTAATCACATGGTTGCAGTTTAAATATGCTTTCTTCCTACAGTATTACCTAGCGATCACCCGATTCAGGAAACAGGCAA
AAATTCTAAACCTGAAGCAAGGTAACAGATCAGTGGAGGAATATGAGAGACACGAGGCCAAGAAGACCAAACGATTCATCATGGGCCTGAAGGATGATATTCAAGGCTTT
GTGGCAGCTCTCCCCCTAGCAGATTATGCTACAGCACTTCGAGCAGCTGCATTGATTGATAATCATTCAGCAAATGAGCCCCAAGTGCCTTCGGGACTAGGTTCTTCCTC
AGGACAGAAAAGAAAGTTTGACCAAGAATCTAGTAATCCACAGCAACTTCAAGATAATTTTTCCACGCTGAACCAAGCTCAAGGTCAGAGACAGCCGCCTAATCGACAAC
AATGTCCTAACAATGAAAGACCAGAAGGACAGATTGCTATAAATTGTCATGAGGGCAACACTCCAGTTGATGCAAAAAAACCTTGTAATGTGAACCAGTCACCCTTACGA
TCGCTCAAGGTTCCAATGCTGCCCTTATGTCTAGATCGGCATCATGGCATCTGCTCGGCCATGGGCAAGCCTATGGAGACCTGA
Protein sequenceShow/hide protein sequence
MRADDREELLLPFHATVFHLRISEGSRVELSNSSRFHAGVEIPGGSHVSFRQVGSPENMKTCNRKLDVGAVSREGLPYIASGDRPGRMTVEGGKGMPRAGRRATCTGNVL
SLASNSHGGYASCLVFTLKSQLSYEVSDSARLEILYDVLNDYTMDAVCELNIMWSCCAALQKDWVNCSHWPKEKEKEKEKNFTHKVLGNNRFVTKISASCAIGINVSPWP
VTQISCHRQIMAPRRRVFVRRGGLGKGANPEIVEPTSAMRSINVSEGLITWLQFKYAFFLQYYLAITRFRKQAKILNLKQGNRSVEEYERHEAKKTKRFIMGLKDDIQGF
VAALPLADYATALRAAALIDNHSANEPQVPSGLGSSSGQKRKFDQESSNPQQLQDNFSTLNQAQGQRQPPNRQQCPNNERPEGQIAINCHEGNTPVDAKKPCNVNQSPLR
SLKVPMLPLCLDRHHGICSAMGKPMET