; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr1:2700448..2701053
RNA-Seq ExpressionMoc01g04150
SyntenyMoc01g04150
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154608.1 uncharacterized protein LOC111021831 [Momordica charantia]1.8e-3856.55Show/hide
Query:  MTTVPPVDYVLPPSNPIGSSSSELLQPVSDSSSQ------SPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAW
        M T PP   V  P +PI     +   P S SS        +PYYL+H+D + LVLVT+ LT++NY  WSRSMLIALSIKNKLGFIDG++ RPI  LL AW
Subjt:  MTTVPPVDYVLPPSNPIGSSSSELLQPVSDSSSQ------SPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAW

Query:  TRNNHVVIAWIINSIS--ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISL
          NNHVVIAWI+NS+S  IS+S++FS+S RDIW D KERF+  NG RIFQL R+LA +  N  Q +SL
Subjt:  TRNNHVVIAWIINSIS--ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISL

XP_022156861.1 uncharacterized protein LOC111023702 [Momordica charantia]5.3e-3554.3Show/hide
Query:  DYVLPPSNPIGSSSSELLQPVS------DSSSQSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVV
        D +  P  P G S      PVS      D+S  +PYYL+H+D + LV V +LLT+DNY  WSRSM+I LS+KNKL FIDG +PRP   LL AW  NNH+V
Subjt:  DYVLPPSNPIGSSSSELLQPVS------DSSSQSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVV

Query:  IAWIINSIS--ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATI
        IAWI+NS+S  ISAS++FS+S RDIW D  ERF+  N   I+QL R LAT+
Subjt:  IAWIINSIS--ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATI

XP_031736904.1 uncharacterized protein LOC105434586 isoform X1 [Cucumis sativus]5.3e-3549.07Show/hide
Query:  NPIGSSSSELLQPVSDSSS--QSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--
        NP  + S    Q   D +    +PY+L+H+D +NLVLVTE LT++NYV WSR+M I LS+KNK+GF+DGT+ RP   LL  W RNN++VI+WI+NS+S  
Subjt:  NPIGSSSSELLQPVSDSSS--QSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--

Query:  ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISLALRVFGTNTLH
        ISA+++FSD  R IW + KERFQ KN  RIFQL R LAT+  N     + +   +G+++++
Subjt:  ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISLALRVFGTNTLH

XP_031736905.1 uncharacterized protein LOC105434586 isoform X2 [Cucumis sativus]5.3e-3549.07Show/hide
Query:  NPIGSSSSELLQPVSDSSS--QSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--
        NP  + S    Q   D +    +PY+L+H+D +NLVLVTE LT++NYV WSR+M I LS+KNK+GF+DGT+ RP   LL  W RNN++VI+WI+NS+S  
Subjt:  NPIGSSSSELLQPVSDSSS--QSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--

Query:  ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISLALRVFGTNTLH
        ISA+++FSD  R IW + KERFQ KN  RIFQL R LAT+  N     + +   +G+++++
Subjt:  ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISLALRVFGTNTLH

XP_031736906.1 uncharacterized protein LOC105434586 isoform X3 [Cucumis sativus]5.3e-3549.07Show/hide
Query:  NPIGSSSSELLQPVSDSSS--QSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--
        NP  + S    Q   D +    +PY+L+H+D +NLVLVTE LT++NYV WSR+M I LS+KNK+GF+DGT+ RP   LL  W RNN++VI+WI+NS+S  
Subjt:  NPIGSSSSELLQPVSDSSS--QSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--

Query:  ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISLALRVFGTNTLH
        ISA+++FSD  R IW + KERFQ KN  RIFQL R LAT+  N     + +   +G+++++
Subjt:  ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISLALRVFGTNTLH

TrEMBL top hitse value%identityAlignment
A0A6J1CMF8 uncharacterized protein LOC1110124681.1e-3349.69Show/hide
Query:  GSSSSELLQPVSDSS----SQSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--I
        G SSS     V+ +S      +PYYL+HSD ++LVLV++LL ++NY  WSRSM+IAL++KNK+GF+DG++ RP  A +++W   N+VVIAW++NS+S  I
Subjt:  GSSSSELLQPVSDSS----SQSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--I

Query:  SASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISLALRVFGTNTL
        SAS++FSDS RDIW D +ER+Q KN  RIFQL REL+ +  +      L+L  + T T+
Subjt:  SASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISLALRVFGTNTL

A0A6J1DKR8 uncharacterized protein LOC1110218318.5e-3956.55Show/hide
Query:  MTTVPPVDYVLPPSNPIGSSSSELLQPVSDSSSQ------SPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAW
        M T PP   V  P +PI     +   P S SS        +PYYL+H+D + LVLVT+ LT++NY  WSRSMLIALSIKNKLGFIDG++ RPI  LL AW
Subjt:  MTTVPPVDYVLPPSNPIGSSSSELLQPVSDSSSQ------SPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAW

Query:  TRNNHVVIAWIINSIS--ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISL
          NNHVVIAWI+NS+S  IS+S++FS+S RDIW D KERF+  NG RIFQL R+LA +  N  Q +SL
Subjt:  TRNNHVVIAWIINSIS--ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISL

A0A6J1DLQ9 uncharacterized protein LOC1110221172.4e-3364.91Show/hide
Query:  LYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--ISASLIFSDSTRDIWFDFKERFQHKN
        ++H+D SNLVLV++ LT+ NYV WSRSM IALSIKNKLGFI+G+LP+P   LL  W RN HVVIAW +NS+S  ISASLIF++ST +IW D K+RFQ +N
Subjt:  LYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--ISASLIFSDSTRDIWFDFKERFQHKN

Query:  GLRIFQLNRELATI
        G +IFQL R+LAT+
Subjt:  GLRIFQLNRELATI

A0A6J1DNP7 uncharacterized protein LOC1110220654.1e-3354.69Show/hide
Query:  LQPVSDSSSQSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--ISASLIFSDSTR
        + P+      +PY+L+HSD ++LVLV++LLTD+NY  WSRS++IAL++KNK+GF+DG++ RP D  L +W   N+VVI+WI NS+S  ISAS++FSDS  
Subjt:  LQPVSDSSSQSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS--ISASLIFSDSTR

Query:  DIWFDFKERFQHKNGLRIFQLNRELATI
        +IW D KERFQ +N  RIFQL REL+ +
Subjt:  DIWFDFKERFQHKNGLRIFQLNRELATI

A0A6J1DW89 uncharacterized protein LOC1110237022.6e-3554.3Show/hide
Query:  DYVLPPSNPIGSSSSELLQPVS------DSSSQSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVV
        D +  P  P G S      PVS      D+S  +PYYL+H+D + LV V +LLT+DNY  WSRSM+I LS+KNKL FIDG +PRP   LL AW  NNH+V
Subjt:  DYVLPPSNPIGSSSSELLQPVS------DSSSQSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVV

Query:  IAWIINSIS--ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATI
        IAWI+NS+S  ISAS++FS+S RDIW D  ERF+  N   I+QL R LAT+
Subjt:  IAWIINSIS--ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.2e-1332.61Show/hide
Query:  SELLQPVSDSSS-QSPYYL----YHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPR--PIDALLSAWTRNNHVVIAWIINSIS--IS
        +E ++ VS +S   SPYYL    +H    ++  +++   +DNYV W       L +  K GFIDGTLP+  P   L   W + N +V+ W++NS++  + 
Subjt:  SELLQPVSDSSS-QSPYYL----YHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPR--PIDALLSAWTRNNHVVIAWIINSIS--IS

Query:  ASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATI
         S++++++   +W D +  F     L+I+QL R LAT+
Subjt:  ASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACTGTCCCACCAGTTGATTATGTTCTTCCTCCGTCTAATCCGATTGGATCTTCTTCATCGGAATTACTTCAACCTGTTTCTGATTCTTCTTCTCAAAGTCCTTA
TTACCTTTATCACAGTGATAAATCTAATTTGGTTCTCGTTACGGAATTGCTCACCGATGACAATTACGTTTTTTGGAGTCGATCCATGCTCATTGCACTTTCCATCAAGA
ATAAGTTAGGGTTTATCGACGGTACGTTGCCACGACCTATTGATGCTCTTCTATCTGCCTGGACTCGCAATAATCATGTGGTTATTGCTTGGATTATTAATTCTATTTCG
ATTTCTGCAAGTCTCATTTTCTCTGATTCGACGCGCGATATTTGGTTTGATTTCAAGGAGCGATTTCAACACAAGAACGGCCTTAGAATTTTTCAACTCAATCGTGAATT
GGCTACCATTATTGACAATCTATATCAATGTATTTCACTTGCCTTAAGAGTGTTTGGGACGAATACATTACATATCGCCCTGCCTGTTCCTGTGGAAAGTGTTCATGTGA
TGGAGTTAAATCTATGGAGGAATTTGTTCAATTTGAATACCTCATGTGTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCACTGTCCCACCAGTTGATTATGTTCTTCCTCCGTCTAATCCGATTGGATCTTCTTCATCGGAATTACTTCAACCTGTTTCTGATTCTTCTTCTCAAAGTCCTTA
TTACCTTTATCACAGTGATAAATCTAATTTGGTTCTCGTTACGGAATTGCTCACCGATGACAATTACGTTTTTTGGAGTCGATCCATGCTCATTGCACTTTCCATCAAGA
ATAAGTTAGGGTTTATCGACGGTACGTTGCCACGACCTATTGATGCTCTTCTATCTGCCTGGACTCGCAATAATCATGTGGTTATTGCTTGGATTATTAATTCTATTTCG
ATTTCTGCAAGTCTCATTTTCTCTGATTCGACGCGCGATATTTGGTTTGATTTCAAGGAGCGATTTCAACACAAGAACGGCCTTAGAATTTTTCAACTCAATCGTGAATT
GGCTACCATTATTGACAATCTATATCAATGTATTTCACTTGCCTTAAGAGTGTTTGGGACGAATACATTACATATCGCCCTGCCTGTTCCTGTGGAAAGTGTTCATGTGA
TGGAGTTAAATCTATGGAGGAATTTGTTCAATTTGAATACCTCATGTGTTTCTTGA
Protein sequenceShow/hide protein sequence
MTTVPPVDYVLPPSNPIGSSSSELLQPVSDSSSQSPYYLYHSDKSNLVLVTELLTDDNYVFWSRSMLIALSIKNKLGFIDGTLPRPIDALLSAWTRNNHVVIAWIINSIS
ISASLIFSDSTRDIWFDFKERFQHKNGLRIFQLNRELATIIDNLYQCISLALRVFGTNTLHIALPVPVESVHVMELNLWRNLFNLNTSCVS