; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g26810 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g26810
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr6:20220645..20226371
RNA-Seq ExpressionMoc06g26810
SyntenyMoc06g26810
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154299.1 uncharacterized protein LOC111021593 [Momordica charantia]5.1e-6972.16Show/hide
Query:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ
        M+ +N QRLGQ+ P  V TQ  NQKARVFALTR+E  +AE VV GTVLV + PAYVLFDSGSS TFIS+ FVRQ  LEL PLG LL VSTPSGS++I+SQ
Subjt:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ

Query:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD
         VK G LSFD Q L ARLIQLD++DFDVI+GMDWLATNQA+INCS++EVSFQLP G SF FKGVT  VPR VSAL+AR LLQ GAWG+LA+VVD
Subjt:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD

XP_022154844.1 uncharacterized protein LOC111022005 [Momordica charantia]2.1e-7077.9Show/hide
Query:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ
        MS ANTQRLGQR P  + TQG N++ARVFALTRKEAADAET+V G VLVH+VP Y LFDS SSHTFIS+ FVRQATL++E LGILLSVSTPSG+++IASQ
Subjt:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ

Query:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLL
         V+A +LSFDNQTL+ARLIQLD++DFDVI+GMDWLATNQANINC RREVSFQLP GRSFTFKGVT  VP+ VS LKARR L
Subjt:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLL

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]3.1e-6669.07Show/hide
Query:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ
        M+  NTQ LGQRIP   + QG   +ARVFALTR + A AE VV+GTVLV  +PAY LFDS SSH+FI+STFVR A LELE LG LLSVSTPSGS+L+ SQ
Subjt:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ

Query:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD
         VK G+LSFD QTL  +LIQLDMQDFDVI+GMDWLA NQANI+CS++E SF+LP  ++FTFKGV +RVPR VSALKA   LQ GAW YLA+VVD
Subjt:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]1.7e-6467.53Show/hide
Query:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ
        M+ +NTQ LGQRIP   + QG   +ARVFALTR +   AE VV  TVLV  +PAY LFDSGSSH+FI+STFV  A LELE LG LLSVSTPSGS+L+ SQ
Subjt:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ

Query:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD
         VK G+LSFD QTL  +LIQLDMQDFDVI+GMDWLA N+ANI+CS+++VSF+LP G++FTFKGV + VPR V ALKA  LLQ GAW YLA+VVD
Subjt:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]6.0e-7069.86Show/hide
Query:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ
        M+AANTQRLGQR  P VSTQG                       GT LVH+VPAYVLFD GSSHTFIS+ FVRQATLELEPLG LLSVSTPSGS+LIASQ
Subjt:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ

Query:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVDI--IPP
         V+AGELSFDNQTL ARLIQLDM+DFDVI+GMDWLATNQANINCS+REVSFQLP GRSFTFKGV+  VPR VSALKARRLL NGAW YLA+VVDI   PP
Subjt:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVDI--IPP

Query:  GQVLDELDRSEVELAVEDV
              +D + V     DV
Subjt:  GQVLDELDRSEVELAVEDV

TrEMBL top hitse value%identityAlignment
A0A6J1DLN2 uncharacterized protein LOC1110215932.5e-6972.16Show/hide
Query:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ
        M+ +N QRLGQ+ P  V TQ  NQKARVFALTR+E  +AE VV GTVLV + PAYVLFDSGSS TFIS+ FVRQ  LEL PLG LL VSTPSGS++I+SQ
Subjt:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ

Query:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD
         VK G LSFD Q L ARLIQLD++DFDVI+GMDWLATNQA+INCS++EVSFQLP G SF FKGVT  VPR VSAL+AR LLQ GAWG+LA+VVD
Subjt:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD

A0A6J1DNG3 uncharacterized protein LOC1110220051.0e-7077.9Show/hide
Query:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ
        MS ANTQRLGQR P  + TQG N++ARVFALTRKEAADAET+V G VLVH+VP Y LFDS SSHTFIS+ FVRQATL++E LGILLSVSTPSG+++IASQ
Subjt:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ

Query:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLL
         V+A +LSFDNQTL+ARLIQLD++DFDVI+GMDWLATNQANINC RREVSFQLP GRSFTFKGVT  VP+ VS LKARR L
Subjt:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLL

A0A6J1DTE5 uncharacterized protein LOC1110238211.5e-6669.07Show/hide
Query:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ
        M+  NTQ LGQRIP   + QG   +ARVFALTR + A AE VV+GTVLV  +PAY LFDS SSH+FI+STFVR A LELE LG LLSVSTPSGS+L+ SQ
Subjt:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ

Query:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD
         VK G+LSFD QTL  +LIQLDMQDFDVI+GMDWLA NQANI+CS++E SF+LP  ++FTFKGV +RVPR VSALKA   LQ GAW YLA+VVD
Subjt:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD

A0A6J1DWP4 uncharacterized protein LOC1110252158.2e-6567.53Show/hide
Query:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ
        M+ +NTQ LGQRIP   + QG   +ARVFALTR +   AE VV  TVLV  +PAY LFDSGSSH+FI+STFV  A LELE LG LLSVSTPSGS+L+ SQ
Subjt:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ

Query:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD
         VK G+LSFD QTL  +LIQLDMQDFDVI+GMDWLA N+ANI+CS+++VSF+LP G++FTFKGV + VPR V ALKA  LLQ GAW YLA+VVD
Subjt:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVD

A0A6J1DYU5 uncharacterized protein LOC1110255172.9e-7069.86Show/hide
Query:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ
        M+AANTQRLGQR  P VSTQG                       GT LVH+VPAYVLFD GSSHTFIS+ FVRQATLELEPLG LLSVSTPSGS+LIASQ
Subjt:  MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQ

Query:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVDI--IPP
         V+AGELSFDNQTL ARLIQLDM+DFDVI+GMDWLATNQANINCS+REVSFQLP GRSFTFKGV+  VPR VSALKARRLL NGAW YLA+VVDI   PP
Subjt:  KVKAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVDI--IPP

Query:  GQVLDELDRSEVELAVEDV
              +D + V     DV
Subjt:  GQVLDELDRSEVELAVEDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCCGCAAATACGCAGAGGTTGGGTCAGAGGATTCCACCACCAGTTTCGACGCAGGGAAATAACCAAAAGGCTCGTGTCTTCGCACTTACTCGCAAGGAAGCGGC
GGATGCCGAAACAGTAGTCATAGGTACTGTTTTAGTCCATGATGTGCCTGCGTATGTATTGTTTGATTCGGGGTCGAGCCACACCTTCATCTCATCTACGTTCGTTCGTC
AGGCAACCCTCGAATTAGAGCCGTTAGGGATTTTGTTGTCGGTTTCTACACCATCAGGGTCGATTTTGATCGCTAGTCAAAAGGTGAAGGCAGGTGAGTTGTCTTTTGAT
AATCAGACTCTAAGGGCAAGGCTGATCCAGCTGGACATGCAAGATTTTGACGTTATTGTGGGCATGGATTGGCTAGCTACCAACCAAGCCAACATTAATTGCTCGAGAAG
AGAAGTCTCCTTTCAACTACCTTTGGGTCGGAGCTTTACGTTTAAAGGGGTTACGAGTAGAGTCCCAAGGACAGTATCAGCGTTGAAGGCAAGACGCCTGTTGCAGAATG
GAGCTTGGGGATATTTGGCCAACGTTGTCGACATTATACCTCCTGGACAAGTGTTAGATGAGTTGGACCGTTCTGAGGTGGAGCTAGCGGTAGAAGATGTGTCAGCAGTG
CTAGCTCAACTCTCGGTCAAACCCACCCTAAGACAACGAATCATCGCTGCACAAAAGGGAGACTCCAGTCTGAGCAAGGGTTTCGTGGATGAAACATTGTGCTATAAAGA
AGTACCCGTTGGGATCGTAGTAAGAGAGACCAAAGTGCTGCAGAACCGGGTGATTGATTTGGTGAAGGTCTTGTGGAGGAACCACCAAATAGAAGAGGCCACCTGGGAGC
GAGAAGACGAATTCAGGGCCCAGTATCCTGAATTGATCGAGCAACGAACTTTCGAGGACGAAACTGCAGGCGGCATCGATACACGCGGGGCTGTGTTCGCGGCGTCCTTT
CTCCGATTCAACAAGCCTAACGACCTCGGAGTTAGATTTGGAGTACCCACACCCAAACGAAATCGATTTAGAATACCCACACCTAAGCGGGGTTGTTTTGTAGCAAGGGA
CATTGAAACGAAGCCATTGGAGATCGTATTGGACGCTTTTCGCTGCTGCAAAAACGTGGACAGCAGCGTATTGGTGGTGTTCGGCGATTATCTACATCCGTTTGAAACCC
GATTTACGCTACCCACGTCTTGGCAAGCTAGATCTAATTTACCCACACCTATACGAAGTCGTTTTGCGTGTGAGGCCGACGCAAACTTAAACACGTGGTTGAGACCAATA
CGCTGGGAAGTCATCGGTACCTTGGGAATAAACGGCAAGGACCGGTGCACAGTTCAGGCCTTGGGAATAAATGGCAAGGCCGAACGTCAAGTTTCTGGAGAGGAGTCGGA
CATCAAGTACTGGAGAAGGAGGAGGTACGGTGACTTGGGAATAAATGTCAAGAGCCGTCATGCCTCGAGAAGTTACGCGGGGAGTGATAAAGGGGGGTGTTGTGGAAGGT
TTAGTACTATGAAACAAGGGCTAGCGATTCATGGGTTGTGGTATTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGGCCGCAAATACGCAGAGGTTGGGTCAGAGGATTCCACCACCAGTTTCGACGCAGGGAAATAACCAAAAGGCTCGTGTCTTCGCACTTACTCGCAAGGAAGCGGC
GGATGCCGAAACAGTAGTCATAGGTACTGTTTTAGTCCATGATGTGCCTGCGTATGTATTGTTTGATTCGGGGTCGAGCCACACCTTCATCTCATCTACGTTCGTTCGTC
AGGCAACCCTCGAATTAGAGCCGTTAGGGATTTTGTTGTCGGTTTCTACACCATCAGGGTCGATTTTGATCGCTAGTCAAAAGGTGAAGGCAGGTGAGTTGTCTTTTGAT
AATCAGACTCTAAGGGCAAGGCTGATCCAGCTGGACATGCAAGATTTTGACGTTATTGTGGGCATGGATTGGCTAGCTACCAACCAAGCCAACATTAATTGCTCGAGAAG
AGAAGTCTCCTTTCAACTACCTTTGGGTCGGAGCTTTACGTTTAAAGGGGTTACGAGTAGAGTCCCAAGGACAGTATCAGCGTTGAAGGCAAGACGCCTGTTGCAGAATG
GAGCTTGGGGATATTTGGCCAACGTTGTCGACATTATACCTCCTGGACAAGTGTTAGATGAGTTGGACCGTTCTGAGGTGGAGCTAGCGGTAGAAGATGTGTCAGCAGTG
CTAGCTCAACTCTCGGTCAAACCCACCCTAAGACAACGAATCATCGCTGCACAAAAGGGAGACTCCAGTCTGAGCAAGGGTTTCGTGGATGAAACATTGTGCTATAAAGA
AGTACCCGTTGGGATCGTAGTAAGAGAGACCAAAGTGCTGCAGAACCGGGTGATTGATTTGGTGAAGGTCTTGTGGAGGAACCACCAAATAGAAGAGGCCACCTGGGAGC
GAGAAGACGAATTCAGGGCCCAGTATCCTGAATTGATCGAGCAACGAACTTTCGAGGACGAAACTGCAGGCGGCATCGATACACGCGGGGCTGTGTTCGCGGCGTCCTTT
CTCCGATTCAACAAGCCTAACGACCTCGGAGTTAGATTTGGAGTACCCACACCCAAACGAAATCGATTTAGAATACCCACACCTAAGCGGGGTTGTTTTGTAGCAAGGGA
CATTGAAACGAAGCCATTGGAGATCGTATTGGACGCTTTTCGCTGCTGCAAAAACGTGGACAGCAGCGTATTGGTGGTGTTCGGCGATTATCTACATCCGTTTGAAACCC
GATTTACGCTACCCACGTCTTGGCAAGCTAGATCTAATTTACCCACACCTATACGAAGTCGTTTTGCGTGTGAGGCCGACGCAAACTTAAACACGTGGTTGAGACCAATA
CGCTGGGAAGTCATCGGTACCTTGGGAATAAACGGCAAGGACCGGTGCACAGTTCAGGCCTTGGGAATAAATGGCAAGGCCGAACGTCAAGTTTCTGGAGAGGAGTCGGA
CATCAAGTACTGGAGAAGGAGGAGGTACGGTGACTTGGGAATAAATGTCAAGAGCCGTCATGCCTCGAGAAGTTACGCGGGGAGTGATAAAGGGGGGTGTTGTGGAAGGT
TTAGTACTATGAAACAAGGGCTAGCGATTCATGGGTTGTGGTATTGGTAG
Protein sequenceShow/hide protein sequence
MSAANTQRLGQRIPPPVSTQGNNQKARVFALTRKEAADAETVVIGTVLVHDVPAYVLFDSGSSHTFISSTFVRQATLELEPLGILLSVSTPSGSILIASQKVKAGELSFD
NQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFQLPLGRSFTFKGVTSRVPRTVSALKARRLLQNGAWGYLANVVDIIPPGQVLDELDRSEVELAVEDVSAV
LAQLSVKPTLRQRIIAAQKGDSSLSKGFVDETLCYKEVPVGIVVRETKVLQNRVIDLVKVLWRNHQIEEATWEREDEFRAQYPELIEQRTFEDETAGGIDTRGAVFAASF
LRFNKPNDLGVRFGVPTPKRNRFRIPTPKRGCFVARDIETKPLEIVLDAFRCCKNVDSSVLVVFGDYLHPFETRFTLPTSWQARSNLPTPIRSRFACEADANLNTWLRPI
RWEVIGTLGINGKDRCTVQALGINGKAERQVSGEESDIKYWRRRRYGDLGINVKSRHASRSYAGSDKGGCCGRFSTMKQGLAIHGLWYW