; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017939 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017939
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReplicase polyprotein 1ab
Genome locationscaffold373:3959683..3968642
RNA-Seq ExpressionMS017939
SyntenyMS017939
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008467324.1 PREDICTED: uncharacterized protein LOC103504702 [Cucumis melo]4.1e-11989.8Show/hide
Query:  MIASTALPPWQPPLQAPLRLRR-RP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQPPL APLRLRR RP  IP R  +GFVQAYRR  GN+D FGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSS AQS A+RA
Subjt:  MIASTALPPWQPPLQAPLRLRR-RP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREF IGLRWRNFTLDFSRNWPRYRRQLNEF+DTPLGKSFVTIFFLWFALSGWLFRFLIF TWILPFAGP+L+GTFANSL+IKG CPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGEYNSRKG+SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

XP_022159671.1 uncharacterized protein LOC111026015 [Momordica charantia]1.6e-134100Show/hide
Query:  MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR
        MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR
Subjt:  MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR

Query:  EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS
        EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS
Subjt:  EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS

Query:  CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
Subjt:  CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

XP_022957921.1 uncharacterized protein LOC111459308 [Cucurbita moschata]2.7e-11891.43Show/hide
Query:  MIASTALPPWQPPLQAPLRL-RRRP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQPPL+APLRL R RP  IPLRR VGFVQAYRR  GN+D FGE W+KVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAA+RA
Subjt:  MIASTALPPWQPPLQAPLRL-RRRP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGK FVTIFFLWFALSGWLFR LIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGE  +RKG SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

XP_023533650.1 uncharacterized protein LOC111795447 [Cucurbita pepo subsp. pepo]2.3e-11791.02Show/hide
Query:  MIASTALPPWQPPLQAPLRL-RRRP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQP L+APLRL R RP  IPLRR VGFVQAYRR  GN+D FGE W+KVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAA+RA
Subjt:  MIASTALPPWQPPLQAPLRL-RRRP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGK FVTIFFLWFALSGWLFR LIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGE  +RKG SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

XP_038874339.1 uncharacterized protein LOC120067037 [Benincasa hispida]4.3e-12493.06Show/hide
Query:  MIASTALPPWQPPLQAPLRLRRRPS--IPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQPP+QAPLRLRR  S  IP RR +GFVQAYRR  GNSD FGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAA+RA
Subjt:  MIASTALPPWQPPLQAPLRLRRRPS--IPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIG+FANSL+IKGTCPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGEYNSRKG+SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

TrEMBL top hitse value%identityAlignment
A0A0A0KPH9 Uncharacterized protein2.1e-11686.64Show/hide
Query:  MIASTALPPWQPPLQAPLRLRR-RP-SIPLRR-VGFVQAYRR----GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAE
        MIAST LPPWQPPLQAP RLRR RP  IP R  +GFVQAYRR    GN+D FG+AWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRV S AQS A+
Subjt:  MIASTALPPWQPPLQAPLRLRR-RP-SIPLRR-VGFVQAYRR----GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAE

Query:  RAREIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAG
        RAREIDREF IG+RWRNFTLDFSRNWPRYRRQLNEF+DTPLGKS VTIFFLWFALSGWLFRFLIF TWILPFAGP+LIGTFANSL+IKG CPACNREFAG
Subjt:  RAREIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAG

Query:  YKNQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        YKNQIISC+GCGN+VWQPK  GEYNSRKG+SGSKSQPNVIDVEFEEK
Subjt:  YKNQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

A0A1S3CT98 uncharacterized protein LOC1035047022.0e-11989.8Show/hide
Query:  MIASTALPPWQPPLQAPLRLRR-RP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQPPL APLRLRR RP  IP R  +GFVQAYRR  GN+D FGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSS AQS A+RA
Subjt:  MIASTALPPWQPPLQAPLRLRR-RP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREF IGLRWRNFTLDFSRNWPRYRRQLNEF+DTPLGKSFVTIFFLWFALSGWLFRFLIF TWILPFAGP+L+GTFANSL+IKG CPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGEYNSRKG+SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

A0A6J1E4L9 uncharacterized protein LOC1110260157.6e-135100Show/hide
Query:  MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR
        MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR
Subjt:  MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR

Query:  EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS
        EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS
Subjt:  EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS

Query:  CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
Subjt:  CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

A0A6J1H0K1 uncharacterized protein LOC1114593081.3e-11891.43Show/hide
Query:  MIASTALPPWQPPLQAPLRL-RRRP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQPPL+APLRL R RP  IPLRR VGFVQAYRR  GN+D FGE W+KVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAA+RA
Subjt:  MIASTALPPWQPPLQAPLRL-RRRP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGK FVTIFFLWFALSGWLFR LIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGE  +RKG SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

A0A6J1K7K5 uncharacterized protein LOC1114913572.5e-11790.61Show/hide
Query:  MIASTALPPWQPPLQAPLRL-RRRP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQP L+APLRL R RP  IPLRR VGFVQAYRR  GN+D FGE W+KVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAA+RA
Subjt:  MIASTALPPWQPPLQAPLRL-RRRP-SIPLRR-VGFVQAYRR--GNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREFGIGLRWRNFTLDFSRNWPRYRRQLN+FMDTPLGK FVTIFFLWFALSGWLFR LIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGE  +RKG SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G44870.1 unknown protein4.3e-7461.71Show/hide
Query:  VQAYRRGNSDDF------GEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDREFGIGLRWRNFTLDFSRNWPRYRRQLN
        V+A++RG+ D        G+AW   WR ANDGFE+FVFEA+KTAER+DR+Y+VSRR SS A SAA+RAREIDREFGI  R R  + DFSRN+P+YR+Q +
Subjt:  VQAYRRGNSDDF------GEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDREFGIGLRWRNFTLDFSRNWPRYRRQLN

Query:  EFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIISCSGCGNIVWQPKG---------QGEYN
         F++TPLG SF TIFFLWFALSGWLFR +I ATW+LP AGPLLIG  AN+ +IKG CPAC R+F GYKNQII C GCGNIVWQP+G             N
Subjt:  EFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIISCSGCGNIVWQPKG---------QGEYN

Query:  SRKGNSGSKSQPNVIDVEFEEK
        + KGNS    +  +IDV+FEEK
Subjt:  SRKGNSGSKSQPNVIDVEFEEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGCTTCAACAGCTCTGCCGCCATGGCAGCCACCGCTTCAAGCTCCGTTAAGGCTGAGGAGGAGGCCCTCAATCCCTCTGCGGAGGGTTGGCTTCGTTCAGGCGTA
CCGTCGCGGGAACAGCGATGATTTTGGCGAGGCCTGGAATAAAGTGTGGCGAGGCGCCAACGATGGTTTCGAGAAATTCGTATTCGAGGCGAGGAAAACCGCGGAGCGCC
TCGACAGGCGCTACTCTGTATCGCGCCGTGTTAGTTCTGTTGCCCAATCAGCGGCCGAGCGGGCGCGCGAGATTGATAGGGAGTTCGGGATTGGATTGCGTTGGCGTAAT
TTTACATTGGATTTTAGCAGAAATTGGCCAAGGTATAGGAGGCAACTCAATGAATTTATGGACACGCCATTAGGGAAAAGTTTTGTGACAATATTCTTCCTTTGGTTTGC
ATTATCTGGATGGCTTTTCCGATTCTTAATATTTGCAACGTGGATACTACCTTTTGCCGGTCCACTTCTCATTGGGACTTTTGCCAATAGCCTTATAATAAAGGGTACTT
GTCCTGCCTGTAATAGGGAATTTGCTGGGTACAAGAACCAAATTATTTCCTGCAGTGGCTGTGGAAACATAGTGTGGCAGCCTAAAGGCCAAGGGGAATACAATTCAAGA
AAAGGTAATTCTGGTTCTAAGTCACAACCCAATGTTATTGACGTGGAGTTTGAGGAGAAA
mRNA sequenceShow/hide mRNA sequence
ATGATAGCTTCAACAGCTCTGCCGCCATGGCAGCCACCGCTTCAAGCTCCGTTAAGGCTGAGGAGGAGGCCCTCAATCCCTCTGCGGAGGGTTGGCTTCGTTCAGGCGTA
CCGTCGCGGGAACAGCGATGATTTTGGCGAGGCCTGGAATAAAGTGTGGCGAGGCGCCAACGATGGTTTCGAGAAATTCGTATTCGAGGCGAGGAAAACCGCGGAGCGCC
TCGACAGGCGCTACTCTGTATCGCGCCGTGTTAGTTCTGTTGCCCAATCAGCGGCCGAGCGGGCGCGCGAGATTGATAGGGAGTTCGGGATTGGATTGCGTTGGCGTAAT
TTTACATTGGATTTTAGCAGAAATTGGCCAAGGTATAGGAGGCAACTCAATGAATTTATGGACACGCCATTAGGGAAAAGTTTTGTGACAATATTCTTCCTTTGGTTTGC
ATTATCTGGATGGCTTTTCCGATTCTTAATATTTGCAACGTGGATACTACCTTTTGCCGGTCCACTTCTCATTGGGACTTTTGCCAATAGCCTTATAATAAAGGGTACTT
GTCCTGCCTGTAATAGGGAATTTGCTGGGTACAAGAACCAAATTATTTCCTGCAGTGGCTGTGGAAACATAGTGTGGCAGCCTAAAGGCCAAGGGGAATACAATTCAAGA
AAAGGTAATTCTGGTTCTAAGTCACAACCCAATGTTATTGACGTGGAGTTTGAGGAGAAA
Protein sequenceShow/hide protein sequence
MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDREFGIGLRWRN
FTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIISCSGCGNIVWQPKGQGEYNSR
KGNSGSKSQPNVIDVEFEEK