; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:9493164..9498516
RNA-Seq ExpressionMoc03g14100
SyntenyMoc03g14100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149381.1 uncharacterized protein LOC111017811 [Momordica charantia]3.3e-2743.27Show/hide
Query:  EVQELAAFLGFPVALLPVRYLGVPLFSRQLSYSDCKPQLERIVSRIRNWLACVF------RLLEGGL------------GVRHLAYLEFYHYNDAALASY
        EV++LAAF  F V   PVRYLGVPLFS +LS+ DC+P LE+IVSR+ +W A +F      +L++  L                L +LEF HYN+AAL + 
Subjt:  EVQELAAFLGFPVALLPVRYLGVPLFSRQLSYSDCKPQLERIVSRIRNWLACVF------RLLEGGL------------GVRHLAYLEFYHYNDAALASY

Query:  YVGWISL-----DGLILFLSGMLFGLYSAYK-DMVAYFLFPDGSWHWPQGSVELRQFVSETSSVSVFVWRDDLPIWVPSLSGLFSVWSAKGVLRPTRPPV
        YVGW+ +      GLI FLS MLF L+   + +MV+  LF           + L  ++S+ S      WR+D+ +W+P  SGLFSV SA  VLRPT+P V
Subjt:  YVGWISL-----DGLILFLSGMLFGLYSAYK-DMVAYFLFPDGSWHWPQGSVELRQFVSETSSVSVFVWRDDLPIWVPSLSGLFSVWSAKGVLRPTRPPV

Query:  LCF--LWF
          F  LWF
Subjt:  LCF--LWF

XP_022157428.1 uncharacterized protein LOC111024128 [Momordica charantia]5.5e-2247.69Show/hide
Query:  VREWGVSDHCPLIFSIGAIVSRRQLSFRFLDYWVADPHFLSSVWATWKSYPQVSPI-----------------------------ETRVRMIVAQALLLQ
        VREWGVSDHCPL+F +G +V R + SFRF DYW  DP F S V  TW+ + QVSP+                             E R  M+ AQALLL 
Subjt:  VREWGVSDHCPLIFSIGAIVSRRQLSFRFLDYWVADPHFLSSVWATWKSYPQVSPI-----------------------------ETRVRMIVAQALLLQ

Query:  DPSSESAQEEERFTTRDLWVWAGKEEESLR
        DPSS  AQEEER  +RD W W   EE SLR
Subjt:  DPSSESAQEEERFTTRDLWVWAGKEEESLR

XP_022158198.1 uncharacterized protein LOC111024735 [Momordica charantia]1.8e-1253.85Show/hide
Query:  RILLNSMCRGPSLLGLISSWEMVREWGVSDHCPLIFSIGAIVSRRQLSFRFLDYWVADPHFLSSVWATWKSYPQVSPI
        R+L+NS   G  L+  +     VREWGVSDHCPL+F +G  V R + SFRF DYW  DP FLS V  TWK + QVSP+
Subjt:  RILLNSMCRGPSLLGLISSWEMVREWGVSDHCPLIFSIGAIVSRRQLSFRFLDYWVADPHFLSSVWATWKSYPQVSPI

XP_022158199.1 uncharacterized protein LOC111024737 [Momordica charantia]1.1e-2244.44Show/hide
Query:  RILLNSMCRGPSLLGLISSWEM-VREWGVSDHCPLIFSIGAIVSRRQLSFRFLDYWVADPHFLSSVWATWKSYPQVSPI---------------------
        R+L+NS     +  G +  +E+ VREWGVSDHCPL+F +G +V + + SF+F DYW +DP FLS V  TW+ + QVSP+                     
Subjt:  RILLNSMCRGPSLLGLISSWEM-VREWGVSDHCPLIFSIGAIVSRRQLSFRFLDYWVADPHFLSSVWATWKSYPQVSPI---------------------

Query:  --------ETRVRMIVAQALLLQDPSSESAQEEERFTTRDLWVWAGKEEESLR
                E R RM+  QALLL  PSS  AQEEER  TRD W WA  EE SLR
Subjt:  --------ETRVRMIVAQALLLQDPSSESAQEEERFTTRDLWVWAGKEEESLR

XP_022159081.1 uncharacterized protein LOC111025522 [Momordica charantia]1.8e-2540.49Show/hide
Query:  SIRHSSEVLGRRP------DFSETALPVLL----------RFCFKRILLNSMCRGPSLLGLISSW--------EMVREWGVSDHCPLIFSIGAIVSRRQL
        +IRHSSEVLG  P      DF +  L   L           +  K +    + +G   + + S+W          VREWGVSDHCPL+F +GA+V R + 
Subjt:  SIRHSSEVLGRRP------DFSETALPVLL----------RFCFKRILLNSMCRGPSLLGLISSW--------EMVREWGVSDHCPLIFSIGAIVSRRQL

Query:  SFRFLDYWVADPHFLSSVWATWKSYPQVSPI-----------------------------ETRVRMIVAQALLLQDPSSESAQEEERFTTRDLWVWAGKE
        SFRF DYW AD   LS V  TWK + QVSP+                             E R RM+ AQALLL DPSS  AQEEER  +RD W WA  E
Subjt:  SFRFLDYWVADPHFLSSVWATWKSYPQVSPI-----------------------------ETRVRMIVAQALLLQDPSSESAQEEERFTTRDLWVWAGKE

Query:  EESLR
        + SLR
Subjt:  EESLR

TrEMBL top hitse value%identityAlignment
A0A6J1D875 uncharacterized protein LOC1110178111.6e-2743.27Show/hide
Query:  EVQELAAFLGFPVALLPVRYLGVPLFSRQLSYSDCKPQLERIVSRIRNWLACVF------RLLEGGL------------GVRHLAYLEFYHYNDAALASY
        EV++LAAF  F V   PVRYLGVPLFS +LS+ DC+P LE+IVSR+ +W A +F      +L++  L                L +LEF HYN+AAL + 
Subjt:  EVQELAAFLGFPVALLPVRYLGVPLFSRQLSYSDCKPQLERIVSRIRNWLACVF------RLLEGGL------------GVRHLAYLEFYHYNDAALASY

Query:  YVGWISL-----DGLILFLSGMLFGLYSAYK-DMVAYFLFPDGSWHWPQGSVELRQFVSETSSVSVFVWRDDLPIWVPSLSGLFSVWSAKGVLRPTRPPV
        YVGW+ +      GLI FLS MLF L+   + +MV+  LF           + L  ++S+ S      WR+D+ +W+P  SGLFSV SA  VLRPT+P V
Subjt:  YVGWISL-----DGLILFLSGMLFGLYSAYK-DMVAYFLFPDGSWHWPQGSVELRQFVSETSSVSVFVWRDDLPIWVPSLSGLFSVWSAKGVLRPTRPPV

Query:  LCF--LWF
          F  LWF
Subjt:  LCF--LWF

A0A6J1DTC3 uncharacterized protein LOC1110241282.6e-2247.69Show/hide
Query:  VREWGVSDHCPLIFSIGAIVSRRQLSFRFLDYWVADPHFLSSVWATWKSYPQVSPI-----------------------------ETRVRMIVAQALLLQ
        VREWGVSDHCPL+F +G +V R + SFRF DYW  DP F S V  TW+ + QVSP+                             E R  M+ AQALLL 
Subjt:  VREWGVSDHCPLIFSIGAIVSRRQLSFRFLDYWVADPHFLSSVWATWKSYPQVSPI-----------------------------ETRVRMIVAQALLLQ

Query:  DPSSESAQEEERFTTRDLWVWAGKEEESLR
        DPSS  AQEEER  +RD W W   EE SLR
Subjt:  DPSSESAQEEERFTTRDLWVWAGKEEESLR

A0A6J1DYP6 uncharacterized protein LOC1110247375.3e-2344.44Show/hide
Query:  RILLNSMCRGPSLLGLISSWEM-VREWGVSDHCPLIFSIGAIVSRRQLSFRFLDYWVADPHFLSSVWATWKSYPQVSPI---------------------
        R+L+NS     +  G +  +E+ VREWGVSDHCPL+F +G +V + + SF+F DYW +DP FLS V  TW+ + QVSP+                     
Subjt:  RILLNSMCRGPSLLGLISSWEM-VREWGVSDHCPLIFSIGAIVSRRQLSFRFLDYWVADPHFLSSVWATWKSYPQVSPI---------------------

Query:  --------ETRVRMIVAQALLLQDPSSESAQEEERFTTRDLWVWAGKEEESLR
                E R RM+  QALLL  PSS  AQEEER  TRD W WA  EE SLR
Subjt:  --------ETRVRMIVAQALLLQDPSSESAQEEERFTTRDLWVWAGKEEESLR

A0A6J1E271 uncharacterized protein LOC1110253248.5e-1347.13Show/hide
Query:  MLFGLYSAYKDMVAYFLFPDGSWHWPQGSVELRQFVSETSSVSVFVWRDDLPIWVPSLSGLFSVWSAKGVLRPTRPPVLCF--LWFG
        +++ + S+    V  FL PDGSW WP+ SV+L + + E  SV   V ++D  +W P++SGLFSV S  G+LRP RPPV  F  LWFG
Subjt:  MLFGLYSAYKDMVAYFLFPDGSWHWPQGSVELRQFVSETSSVSVFVWRDDLPIWVPSLSGLFSVWSAKGVLRPTRPPVLCF--LWFG

A0A6J1E2U5 uncharacterized protein LOC1110255228.8e-2640.49Show/hide
Query:  SIRHSSEVLGRRP------DFSETALPVLL----------RFCFKRILLNSMCRGPSLLGLISSW--------EMVREWGVSDHCPLIFSIGAIVSRRQL
        +IRHSSEVLG  P      DF +  L   L           +  K +    + +G   + + S+W          VREWGVSDHCPL+F +GA+V R + 
Subjt:  SIRHSSEVLGRRP------DFSETALPVLL----------RFCFKRILLNSMCRGPSLLGLISSW--------EMVREWGVSDHCPLIFSIGAIVSRRQL

Query:  SFRFLDYWVADPHFLSSVWATWKSYPQVSPI-----------------------------ETRVRMIVAQALLLQDPSSESAQEEERFTTRDLWVWAGKE
        SFRF DYW AD   LS V  TWK + QVSP+                             E R RM+ AQALLL DPSS  AQEEER  +RD W WA  E
Subjt:  SFRFLDYWVADPHFLSSVWATWKSYPQVSPI-----------------------------ETRVRMIVAQALLLQDPSSESAQEEERFTTRDLWVWAGKE

Query:  EESLR
        + SLR
Subjt:  EESLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.3e-0440Show/hide
Query:  IKEVQELAAFLGFPVA--LLPVRYLGVPLFSRQLSYSDCKPQLERIVSRIRNWLA
        +K+  +      FP A   LPVRYLG+PL +++++ SD  P +E+I  RI  W A
Subjt:  IKEVQELAAFLGFPVA--LLPVRYLGVPLFSRQLSYSDCKPQLERIVSRIRNWLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGATTGCGCACGGCAAGCGAACACTTGGCATGGGTGCTGCCTTGTGAGTTTGTCCATTCGTCATTCTTCTGAGGTGCTCGGTCGTAGACCAGATTTTTCT
GAAACGGCCCTGCCTGTTTTGTTGAGATTTTGCTTCAAGCGAATCTTGTTGAACAGCATGTGTCGAGGCCCTAGTTTACTTGGACTAATAAGCAGTTGGGAGATG
GTCCGAGAGTGGGGGGTTTCTGATCATTGCCCTCTTATTTTTTCTATTGGTGCCATTGTTTCGAGGCGTCAGCTTTCTTTTCGGTTTCTTGATTACTGGGTTGCC
GATCCACACTTTTTATCGTCCGTTTGGGCTACGTGGAAATCGTATCCTCAGGTTTCTCCTATTGAGACGCGTGTCCGGATGATTGTTGCCCAAGCTTTGCTCTTA
CAGGATCCATCTTCTGAGTCCGCTCAGGAGGAGGAGAGGTTCACCACTCGTGATTTATGGGTTTGGGCTGGGAAGGAGGAGGAGTCACTACGCCTTTCTATTAAG
GAGGTGCAGGAGTTGGCTGCCTTTTTAGGGTTTCCGGTGGCTTTGCTACCAGTACGTTATCTTGGGGTCCCGCTCTTTTCGCGTCAGCTGTCCTATAGTGATTGT
AAGCCCCAGCTGGAGAGAATTGTCTCTCGTATTCGAAATTGGTTGGCTTGTGTTTTTCGTTTGCTGGAGGGTGGTTTGGGTGTGCGTCATTTGGCCTATTTGGAA
TTCTACCACTATAATGATGCTGCTTTGGCTTCTTATTACGTGGGCTGGATCTCTTTGGATGGCTTGATATTATTTCTGTCTGGGATGCTTTTTGGCCTCTACTCG
GCTTACAAGGACATGGTTGCTTATTTTCTATTTCCCGATGGTTCGTGGCACTGGCCTCAAGGGTCGGTTGAGCTTCGACAGTTCGTCTCTGAGACTTCATCTGTG
TCGGTTTTTGTTTGGAGGGATGATTTGCCCATTTGGGTTCCGTCACTGTCTGGTCTCTTCTCGGTGTGGAGTGCAAAGGGTGTGTTGCGGCCGACCCGACCTCCT
GTTCTTTGTTTTCTTTGGTTTGGTTTGATGGGAGTATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGATTGCGCACGGCAAGCGAACACTTGGCATGGGTGCTGCCTTGTGAGTTTGTCCATTCGTCATTCTTCTGAGGTGCTCGGTCGTAGACCAGATTTTTCT
GAAACGGCCCTGCCTGTTTTGTTGAGATTTTGCTTCAAGCGAATCTTGTTGAACAGCATGTGTCGAGGCCCTAGTTTACTTGGACTAATAAGCAGTTGGGAGATG
GTCCGAGAGTGGGGGGTTTCTGATCATTGCCCTCTTATTTTTTCTATTGGTGCCATTGTTTCGAGGCGTCAGCTTTCTTTTCGGTTTCTTGATTACTGGGTTGCC
GATCCACACTTTTTATCGTCCGTTTGGGCTACGTGGAAATCGTATCCTCAGGTTTCTCCTATTGAGACGCGTGTCCGGATGATTGTTGCCCAAGCTTTGCTCTTA
CAGGATCCATCTTCTGAGTCCGCTCAGGAGGAGGAGAGGTTCACCACTCGTGATTTATGGGTTTGGGCTGGGAAGGAGGAGGAGTCACTACGCCTTTCTATTAAG
GAGGTGCAGGAGTTGGCTGCCTTTTTAGGGTTTCCGGTGGCTTTGCTACCAGTACGTTATCTTGGGGTCCCGCTCTTTTCGCGTCAGCTGTCCTATAGTGATTGT
AAGCCCCAGCTGGAGAGAATTGTCTCTCGTATTCGAAATTGGTTGGCTTGTGTTTTTCGTTTGCTGGAGGGTGGTTTGGGTGTGCGTCATTTGGCCTATTTGGAA
TTCTACCACTATAATGATGCTGCTTTGGCTTCTTATTACGTGGGCTGGATCTCTTTGGATGGCTTGATATTATTTCTGTCTGGGATGCTTTTTGGCCTCTACTCG
GCTTACAAGGACATGGTTGCTTATTTTCTATTTCCCGATGGTTCGTGGCACTGGCCTCAAGGGTCGGTTGAGCTTCGACAGTTCGTCTCTGAGACTTCATCTGTG
TCGGTTTTTGTTTGGAGGGATGATTTGCCCATTTGGGTTCCGTCACTGTCTGGTCTCTTCTCGGTGTGGAGTGCAAAGGGTGTGTTGCGGCCGACCCGACCTCCT
GTTCTTTGTTTTCTTTGGTTTGGTTTGATGGGAGTATTTTGA
Protein sequenceShow/hide protein sequence
MGDCARQANTWHGCCLVSLSIRHSSEVLGRRPDFSETALPVLLRFCFKRILLNSMCRGPSLLGLISSWEMVREWGVSDHCPLIFSIGAIVSRRQLSFRFLDYWVA
DPHFLSSVWATWKSYPQVSPIETRVRMIVAQALLLQDPSSESAQEEERFTTRDLWVWAGKEEESLRLSIKEVQELAAFLGFPVALLPVRYLGVPLFSRQLSYSDC
KPQLERIVSRIRNWLACVFRLLEGGLGVRHLAYLEFYHYNDAALASYYVGWISLDGLILFLSGMLFGLYSAYKDMVAYFLFPDGSWHWPQGSVELRQFVSETSSV
SVFVWRDDLPIWVPSLSGLFSVWSAKGVLRPTRPPVLCFLWFGLMGVF