; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g00890 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g00890
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr1:549696..552691
RNA-Seq ExpressionMoc01g00890
SyntenyMoc01g00890
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8515344.1 hypothetical protein F0562_018426 [Nyssa sinensis]3.0e-1531.5Show/hide
Query:  EALITLINATLSRSAIGHVVG-----------------TTSSNIV---------------------------DKLTAVSVQIDDEKILIH-SHGLTSDCN
        +AL+TL+NATLS++ + HV+G                 +T SNI+                           D L +VSV I+DE ILI+  +GL  + N
Subjt:  EALITLINATLSRSAIGHVVG-----------------TTSSNIV---------------------------DKLTAVSVQIDDEKILIH-SHGLTSDCN

Query:  AFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQAK------------------------GEVILNP--LSMGEMVVDSNGGR-------------------
        AF TSIRTR + +TLEE++VML+ EE+T+    K                        G  + N      G   +++ GGR                   
Subjt:  AFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQAK------------------------GEVILNP--LSMGEMVVDSNGGR-------------------

Query:  -----------------VFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMA
                         V CQICNK DH+ALD Y+R++FSYQG  PS QL AM+
Subjt:  -----------------VFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMA

KAA8518236.1 hypothetical protein F0562_015710 [Nyssa sinensis]1.2e-1431.1Show/hide
Query:  EALITLINATLSRSAIGHVVG-----------------TTSSNIV---------------------------DKLTAVSVQIDDEKILIH-SHGLTSDCN
        +AL+TL+NATLS++A+ H++G                 +T SNI+                           D L +VSV I+DE ILI+  +GL  + N
Subjt:  EALITLINATLSRSAIGHVVG-----------------TTSSNIV---------------------------DKLTAVSVQIDDEKILIH-SHGLTSDCN

Query:  AFCTSIRTRKDYLTLEELHVMLQFEEKTL------------------------------------AQQAKGEVIL--------------------NPLSM
        AF TSIRTR +++TLEE++ ML+ EE+T+                                    + + +G   L                    NPL  
Subjt:  AFCTSIRTRKDYLTLEELHVMLQFEEKTL------------------------------------AQQAKGEVIL--------------------NPLSM

Query:  GEMVVDS-----NGGRVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMAV
         +    S     N   V CQICNK  H+ALD Y+R++FSYQG  PS QL AM V
Subjt:  GEMVVDS-----NGGRVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMAV

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]2.0e-1431.65Show/hide
Query:  EALITLINATLSRSAIGHVVGTTSS---------------------------------------------NIVDKLTAVSVQIDDEKILIHS-HGLTSDC
        +AL+T+INATLS  A+ +VVG+TSS                                              I DKL  VS  I++E +LI++ +GL ++ 
Subjt:  EALITLINATLSRSAIGHVVGTTSS---------------------------------------------NIVDKLTAVSVQIDDEKILIHS-HGLTSDC

Query:  NAFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQAKGEVILNP----LSMGEMV-----------VDSNG------------------------------G
        N F TS+RTR   +T EELHV+L+ EE  LA+Q+KG+   N     LS  + +           V  NG                               
Subjt:  NAFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQAKGEVILNP----LSMGEMV-----------VDSNG------------------------------G

Query:  RVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAM
           CQIC++  HTALD +NRMN+++QG  P  QL AM
Subjt:  RVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAM

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]2.0e-1431.65Show/hide
Query:  EALITLINATLSRSAIGHVVGTTSS---------------------------------------------NIVDKLTAVSVQIDDEKILIHS-HGLTSDC
        +AL+T+INATLS  A+ +VVG+TSS                                              I DKL  VS  I++E +LI++ +GL ++ 
Subjt:  EALITLINATLSRSAIGHVVGTTSS---------------------------------------------NIVDKLTAVSVQIDDEKILIHS-HGLTSDC

Query:  NAFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQAKGEVILNP----LSMGEMV-----------VDSNG------------------------------G
        N F TS+RTR   +T EELHV+L+ EE  LA+Q+KG+   N     LS  + +           V  NG                               
Subjt:  NAFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQAKGEVILNP----LSMGEMV-----------VDSNG------------------------------G

Query:  RVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAM
           CQIC++  HTALD +NRMN+++QG  P  QL AM
Subjt:  RVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAM

XP_022158378.1 uncharacterized protein LOC111024876 [Momordica charantia]1.4e-1536.56Show/hide
Query:  ALITLINATLSRSAIGHVVGTTSSNIV-------------DKLTAVSVQIDDEKILIHS-HGLTSDCNAFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQ
        +L+TLINATLS +A+ +VVG  SS  V             DKL  VSV +DDE ++I++ +GL S+ N F TS+RTR   ++  ELHV+L  E   + +Q
Subjt:  ALITLINATLSRSAIGHVVGTTSSNIV-------------DKLTAVSVQIDDEKILIHS-HGLTSDCNAFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQ

Query:  AKGE--------VILNPLSMGEMVVD-----------SNGG-------------RVFCQICNKPDHTALDYYNRMNFSYQGCRPSA
        +K +        +++N  S  ++              SNGG             R+ CQIC K   TA+D YNRMN+++QG  P A
Subjt:  AKGE--------VILNPLSMGEMVVD-----------SNGG-------------RVFCQICNKPDHTALDYYNRMNFSYQGCRPSA

TrEMBL top hitse value%identityAlignment
A0A2N9EYE4 Uncharacterized protein1.1e-1538.62Show/hide
Query:  DKLTAVSVQIDDEKIL-IHSHGLTSDCNAFCTSIRTRKDYLTLEELHVMLQFEEKT---------------LAQQAKGEVILNPLSM----------GEM
        DKL+AV V++DDE++L +   GL S+ +AFC+++RTR   ++ EELHV+L  EE++               +A  A       PL +          G+ 
Subjt:  DKLTAVSVQIDDEKIL-IHSHGLTSDCNAFCTSIRTRKDYLTLEELHVMLQFEEKT---------------LAQQAKGEVILNPLSM----------GEM

Query:  VVDSNGGRVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMA
           S   R  CQIC KP H ALD ++RMNF+YQG  P A+L A+A
Subjt:  VVDSNGGRVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMA

A0A5B7C9B1 Retrotran_gag_3 domain-containing protein4.3e-1531.42Show/hide
Query:  EALITLINATLSRSAIGHVVGTTSSNIV--------------------------------------------DKLTAVSVQIDDEKILIHS-HGLTSDCN
        +AL+TLINATLS SA+ +V+G ++S  V                                            D L AVSV I+DE ILIH+ +GL  D N
Subjt:  EALITLINATLSRSAIGHVVGTTSSNIV--------------------------------------------DKLTAVSVQIDDEKILIHS-HGLTSDCN

Query:  AFCTSIRTRKDYLTLEELHVMLQFEEKTL-------------------AQQAKGEVILNPLSMGE-----------------------------------
        AF TSI TR   +TLEELH +L+ EE+TL                   A Q +     N    G                                    
Subjt:  AFCTSIRTRKDYLTLEELHVMLQFEEKTL-------------------AQQAKGEVILNPLSMGE-----------------------------------

Query:  ---------------MVVDSNGGRVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMA
                       M   SN  ++ CQICNKP H ALD Y+RM++SYQG  P  QL AMA
Subjt:  ---------------MVVDSNGGRVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMA

A0A5J4ZC67 Retrotran_gag_3 domain-containing protein1.5e-1531.5Show/hide
Query:  EALITLINATLSRSAIGHVVG-----------------TTSSNIV---------------------------DKLTAVSVQIDDEKILIH-SHGLTSDCN
        +AL+TL+NATLS++ + HV+G                 +T SNI+                           D L +VSV I+DE ILI+  +GL  + N
Subjt:  EALITLINATLSRSAIGHVVG-----------------TTSSNIV---------------------------DKLTAVSVQIDDEKILIH-SHGLTSDCN

Query:  AFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQAK------------------------GEVILNP--LSMGEMVVDSNGGR-------------------
        AF TSIRTR + +TLEE++VML+ EE+T+    K                        G  + N      G   +++ GGR                   
Subjt:  AFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQAK------------------------GEVILNP--LSMGEMVVDSNGGR-------------------

Query:  -----------------VFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMA
                         V CQICNK DH+ALD Y+R++FSYQG  PS QL AM+
Subjt:  -----------------VFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMA

A0A5J4ZKU4 Retrotran_gag_3 domain-containing protein5.6e-1531.1Show/hide
Query:  EALITLINATLSRSAIGHVVG-----------------TTSSNIV---------------------------DKLTAVSVQIDDEKILIH-SHGLTSDCN
        +AL+TL+NATLS++A+ H++G                 +T SNI+                           D L +VSV I+DE ILI+  +GL  + N
Subjt:  EALITLINATLSRSAIGHVVG-----------------TTSSNIV---------------------------DKLTAVSVQIDDEKILIH-SHGLTSDCN

Query:  AFCTSIRTRKDYLTLEELHVMLQFEEKTL------------------------------------AQQAKGEVIL--------------------NPLSM
        AF TSIRTR +++TLEE++ ML+ EE+T+                                    + + +G   L                    NPL  
Subjt:  AFCTSIRTRKDYLTLEELHVMLQFEEKTL------------------------------------AQQAKGEVIL--------------------NPLSM

Query:  GEMVVDS-----NGGRVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMAV
         +    S     N   V CQICNK  H+ALD Y+R++FSYQG  PS QL AM V
Subjt:  GEMVVDS-----NGGRVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMAV

A0A6J1DVX4 uncharacterized protein LOC1110248766.6e-1636.56Show/hide
Query:  ALITLINATLSRSAIGHVVGTTSSNIV-------------DKLTAVSVQIDDEKILIHS-HGLTSDCNAFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQ
        +L+TLINATLS +A+ +VVG  SS  V             DKL  VSV +DDE ++I++ +GL S+ N F TS+RTR   ++  ELHV+L  E   + +Q
Subjt:  ALITLINATLSRSAIGHVVGTTSSNIV-------------DKLTAVSVQIDDEKILIHS-HGLTSDCNAFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQ

Query:  AKGE--------VILNPLSMGEMVVD-----------SNGG-------------RVFCQICNKPDHTALDYYNRMNFSYQGCRPSA
        +K +        +++N  S  ++              SNGG             R+ CQIC K   TA+D YNRMN+++QG  P A
Subjt:  AKGE--------VILNPLSMGEMVVD-----------SNGG-------------RVFCQICNKPDHTALDYYNRMNFSYQGCRPSA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCTCTTATCACTTTGATAAATGCTACGCTTTCTCGTTCAGCCATTGGGCATGTCGTTGGAACAACTTCATCTAATATTGTTGATAAACTTACTGCCGTCTCAGT
TCAAATTGATGATGAGAAAATCTTGATCCACTCTCATGGGCTTACATCTGATTGTAATGCTTTTTGCACATCAATTCGCACTCGGAAGGATTATCTCACTCTTGAAGAAC
TTCATGTGATGCTACAGTTTGAAGAAAAAACTCTTGCTCAACAGGCTAAGGGGGAGGTAATTCTCAACCCTTTATCTATGGGGGAAATGGTCGTGGATTCTAATGGGGGA
CGTGTTTTCTGTCAAATTTGTAACAAGCCCGATCACACCGCTCTTGATTATTATAACAGGATGAACTTCTCCTATCAAGGCTGTCGTCCTTCGGCTCAATTGGTTGCAAT
GGCAGTACATGCTATGCCTATTGCTCGAACTCGGTTTCCGGACCAATCTGAACACTTGTGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGAATAGTTG
CTCACATCGGACCCGCCGAGTTTCCCGGTAGATCGGACCTTGACCAGGTCGCACCTCGACCCTCATACTTAGCACCTGTCAACGCTAGTGGTGGTAATCTCGGCGGTTCG
AGCTGGGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCTCTTATCACTTTGATAAATGCTACGCTTTCTCGTTCAGCCATTGGGCATGTCGTTGGAACAACTTCATCTAATATTGTTGATAAACTTACTGCCGTCTCAGT
TCAAATTGATGATGAGAAAATCTTGATCCACTCTCATGGGCTTACATCTGATTGTAATGCTTTTTGCACATCAATTCGCACTCGGAAGGATTATCTCACTCTTGAAGAAC
TTCATGTGATGCTACAGTTTGAAGAAAAAACTCTTGCTCAACAGGCTAAGGGGGAGGTAATTCTCAACCCTTTATCTATGGGGGAAATGGTCGTGGATTCTAATGGGGGA
CGTGTTTTCTGTCAAATTTGTAACAAGCCCGATCACACCGCTCTTGATTATTATAACAGGATGAACTTCTCCTATCAAGGCTGTCGTCCTTCGGCTCAATTGGTTGCAAT
GGCAGTACATGCTATGCCTATTGCTCGAACTCGGTTTCCGGACCAATCTGAACACTTGTGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGAATAGTTG
CTCACATCGGACCCGCCGAGTTTCCCGGTAGATCGGACCTTGACCAGGTCGCACCTCGACCCTCATACTTAGCACCTGTCAACGCTAGTGGTGGTAATCTCGGCGGTTCG
AGCTGGGCATGA
Protein sequenceShow/hide protein sequence
MEALITLINATLSRSAIGHVVGTTSSNIVDKLTAVSVQIDDEKILIHSHGLTSDCNAFCTSIRTRKDYLTLEELHVMLQFEEKTLAQQAKGEVILNPLSMGEMVVDSNGG
RVFCQICNKPDHTALDYYNRMNFSYQGCRPSAQLVAMAVHAMPIARTRFPDQSEHLCGPAQKGEHSDDQVRIVAHIGPAEFPGRSDLDQVAPRPSYLAPVNASGGNLGGS
SWA