; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g04550 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g04550
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr3:3402291..3402939
RNA-Seq ExpressionMoc03g04550
SyntenyMoc03g04550
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]6.1e-3242.13Show/hide
Query:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-----------------
        + KD  SPIFL+SNICN++S+ LDS+NFVL KFQL +ILK HKLFGF+DG  P P+      P+T S+    ++P+YEDWI K                 
Subjt:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-----------------

Query:  -------------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSF
                           Y      ++V        + + P      Y++++KEIKDKLAN+S  + +EDL+IY LNGL  EYN+FRTSM TRS  V+F
Subjt:  -------------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSF

Query:  NELRVLLVSEEAVIDK
         EL VLL +EE+ + K
Subjt:  NELRVLLVSEEAVIDK

TYK17989.1 uncharacterized protein E5676_scaffold306G002980 [Cucumis melo var. makuwa]6.1e-3241.15Show/hide
Query:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSA----------SDPIYEDWIDK-------
        +SK+L+SP+FL++NICN++SI LDS+N+ L KFQ   +LK HKL+GFID  +P P P + + PTT SSS +           S+P YEDW  K       
Subjt:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSA----------SDPIYEDWIDK-------

Query:  --------------YCKPKDQ----------------------SIVCVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTS
                       CK   Q                       +  + + P      Y++++KE+KDKLAN + +VEDEDL+IY LNGL  EYN+FRTS
Subjt:  --------------YCKPKDQ----------------------SIVCVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTS

Query:  MHTRSLSVSFNELRVLLVSEEAVIDK
        M TRS  VSF+EL +LL SEE+ ++K
Subjt:  MHTRSLSVSFNELRVLLVSEEAVIDK

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]6.1e-3242.13Show/hide
Query:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-----------------
        + KD  SPIFL+SNICN++S+ LDS+NFVL KFQL +ILK HKLFGF+DG  P P+      P+T S+    ++P+YEDWI K                 
Subjt:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-----------------

Query:  -------------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSF
                           Y      ++V        + + P      Y++++KEIKDKLAN+S  + +EDL+IY LNGL  EYN+FRTSM TRS  V+F
Subjt:  -------------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSF

Query:  NELRVLLVSEEAVIDK
         EL VLL +EE+ + K
Subjt:  NELRVLLVSEEAVIDK

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]4.7e-3243.69Show/hide
Query:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPF------SSARPTTYSSSVSASDPIYEDWIDK-----------
        + KDL SPIFL+SNICN+VSI LDS++F+L KFQL +ILK HKLFGFIDG V  P  F      + ++PTT ++S+   +P +EDWI K           
Subjt:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPF------SSARPTTYSSSVSASDPIYEDWIDK-----------

Query:  -----------------------------------YCKPKDQSIVCVQEM---PYVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTR
                                             K   QSIV   E     YV+++KEIKDK AN+S+ + DE L+IY LNGLS EYN+  TSM TR
Subjt:  -----------------------------------YCKPKDQSIVCVQEM---PYVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTR

Query:  SLSVSFNELRVLLVSEEAVIDK
        + SVSF EL V + SEE+ I+K
Subjt:  SLSVSFNELRVLLVSEEAVIDK

XP_022158689.1 uncharacterized protein LOC111025150 [Momordica charantia]1.1e-3647.2Show/hide
Query:  KDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-------------------
        KDL+SPIFL+SNICN+VS+ LDSSNFVL KFQL +ILK HKL+GFIDG  PKP  F  +     SS   A++P + +WI K                   
Subjt:  KDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-------------------

Query:  -----------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSFNE
                         Y      ++V        + + P      YVQ++KE+KDKLAN+ VLV++EDL+IY LN L  E+N FRTSM TRS SVSF E
Subjt:  -----------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSFNE

Query:  LRVLLVSEEAVIDK
        L VLLVSEEA IDK
Subjt:  LRVLLVSEEAVIDK

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X27.3e-3142.13Show/hide
Query:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-----------------
        + KD  SPIFL+SNICN++S+ LDS+NFVL KFQL +ILK HKL+GFIDG  P P   +++  T  S+    S+P YEDWI K                 
Subjt:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-----------------

Query:  -------------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSF
                           Y      ++V        + + P      Y++++KEIKDKLAN+S  + +EDL+IY LNGL  EYN+FRTSM TRS  V+F
Subjt:  -------------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSF

Query:  NELRVLLVSEEAVIDK
         EL VLL +EE+ + K
Subjt:  NELRVLLVSEEAVIDK

A0A5D3CLI6 T4.57.3e-3142.13Show/hide
Query:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-----------------
        + KD  SPIFL+SNICN++S+ LDS+NFVL KFQL +ILK HKL+GFIDG  P P   +++  T  S+    S+P YEDWI K                 
Subjt:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-----------------

Query:  -------------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSF
                           Y      ++V        + + P      Y++++KEIKDKLAN+S  + +EDL+IY LNGL  EYN+FRTSM TRS  V+F
Subjt:  -------------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSF

Query:  NELRVLLVSEEAVIDK
         EL VLL +EE+ + K
Subjt:  NELRVLLVSEEAVIDK

A0A5D3D3T6 Retrotran_gag_3 domain-containing protein3.0e-3241.15Show/hide
Query:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSA----------SDPIYEDWIDK-------
        +SK+L+SP+FL++NICN++SI LDS+N+ L KFQ   +LK HKL+GFID  +P P P + + PTT SSS +           S+P YEDW  K       
Subjt:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSA----------SDPIYEDWIDK-------

Query:  --------------YCKPKDQ----------------------SIVCVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTS
                       CK   Q                       +  + + P      Y++++KE+KDKLAN + +VEDEDL+IY LNGL  EYN+FRTS
Subjt:  --------------YCKPKDQ----------------------SIVCVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTS

Query:  MHTRSLSVSFNELRVLLVSEEAVIDK
        M TRS  VSF+EL +LL SEE+ ++K
Subjt:  MHTRSLSVSFNELRVLLVSEEAVIDK

A0A6J1D9L6 uncharacterized protein LOC1110188922.3e-3243.69Show/hide
Query:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPF------SSARPTTYSSSVSASDPIYEDWIDK-----------
        + KDL SPIFL+SNICN+VSI LDS++F+L KFQL +ILK HKLFGFIDG V  P  F      + ++PTT ++S+   +P +EDWI K           
Subjt:  SSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPF------SSARPTTYSSSVSASDPIYEDWIDK-----------

Query:  -----------------------------------YCKPKDQSIVCVQEM---PYVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTR
                                             K   QSIV   E     YV+++KEIKDK AN+S+ + DE L+IY LNGLS EYN+  TSM TR
Subjt:  -----------------------------------YCKPKDQSIVCVQEM---PYVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTR

Query:  SLSVSFNELRVLLVSEEAVIDK
        + SVSF EL V + SEE+ I+K
Subjt:  SLSVSFNELRVLLVSEEAVIDK

A0A6J1E049 uncharacterized protein LOC1110251505.2e-3747.2Show/hide
Query:  KDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-------------------
        KDL+SPIFL+SNICN+VS+ LDSSNFVL KFQL +ILK HKL+GFIDG  PKP  F  +     SS   A++P + +WI K                   
Subjt:  KDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDK-------------------

Query:  -----------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSFNE
                         Y      ++V        + + P      YVQ++KE+KDKLAN+ VLV++EDL+IY LN L  E+N FRTSM TRS SVSF E
Subjt:  -----------------YCKPKDQSIV-------CVQEMP------YVQQVKEIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSFNE

Query:  LRVLLVSEEAVIDK
        L VLLVSEEA IDK
Subjt:  LRVLLVSEEAVIDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.5e-0432.14Show/hide
Query:  SKDLASPIFLVSNI-----CNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDW
        + D  SP +L  +I      ++  +  D  N+V  K +  S L+  K FGFIDG +PKP PFS               P+Y+ W
Subjt:  SKDLASPIFLVSNI-----CNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGACTTTATCTCCTCGAAAGATCTTGCTTCACCTATCTTTCTCGTGTCAAATATATGTAATATGGTATCAATCCACCTTGATTCTTCAAATTTTGTTCTCTGTAA
GTTTCAACTCATATCCATCTTGAAAGATCACAAGTTATTTGGTTTCATTGATGGTTGTGTGCCTAAGCCGAAGCCTTTTTCGTCTGCCCGGCCTACAACTTATTCATCTT
CTGTATCCGCATCTGATCCGATCTATGAAGATTGGATTGACAAATATTGTAAACCTAAAGACCAATCTATAGTCTGTGTCCAAGAAATGCCGTATGTTCAGCAAGTTAAA
GAAATTAAAGATAAATTGGCGAATATCAGTGTTCTTGTTGAAGACGAGGATCTTATCATCTACATTTTAAACGGACTTTCTGTTGAATACAATAGCTTTCGAACATCAAT
GCACACTCGGTCCTTATCGGTTTCGTTTAATGAACTTCGTGTTCTTCTAGTTTCAGAGGAAGCTGTGATTGACAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGACTTTATCTCCTCGAAAGATCTTGCTTCACCTATCTTTCTCGTGTCAAATATATGTAATATGGTATCAATCCACCTTGATTCTTCAAATTTTGTTCTCTGTAA
GTTTCAACTCATATCCATCTTGAAAGATCACAAGTTATTTGGTTTCATTGATGGTTGTGTGCCTAAGCCGAAGCCTTTTTCGTCTGCCCGGCCTACAACTTATTCATCTT
CTGTATCCGCATCTGATCCGATCTATGAAGATTGGATTGACAAATATTGTAAACCTAAAGACCAATCTATAGTCTGTGTCCAAGAAATGCCGTATGTTCAGCAAGTTAAA
GAAATTAAAGATAAATTGGCGAATATCAGTGTTCTTGTTGAAGACGAGGATCTTATCATCTACATTTTAAACGGACTTTCTGTTGAATACAATAGCTTTCGAACATCAAT
GCACACTCGGTCCTTATCGGTTTCGTTTAATGAACTTCGTGTTCTTCTAGTTTCAGAGGAAGCTGTGATTGACAAATAG
Protein sequenceShow/hide protein sequence
MVDFISSKDLASPIFLVSNICNMVSIHLDSSNFVLCKFQLISILKDHKLFGFIDGCVPKPKPFSSARPTTYSSSVSASDPIYEDWIDKYCKPKDQSIVCVQEMPYVQQVK
EIKDKLANISVLVEDEDLIIYILNGLSVEYNSFRTSMHTRSLSVSFNELRVLLVSEEAVIDK