; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g17660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g17660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr2:13221848..13224343
RNA-Seq ExpressionMoc02g17660
SyntenyMoc02g17660
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8523936.1 hypothetical protein F0562_010359 [Nyssa sinensis]2.0e-1636.26Show/hide
Query:  MGDFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINVLPSTES----------VALVVSTSPVSMSSSTNTRNSRGVQGQYC
        + D  Q+EY+MSFL GL+DSFS+   QLLL+DP   +NR FSL++QEEQQR  N  PS++S          V + V+ S  S   ++   NS   + Q  
Subjt:  MGDFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINVLPSTES----------VALVVSTSPVSMSSSTNTRNSRGVQGQYC

Query:  ERSLCSHC-----VIHRCYKLHGYPPGYRHCNGHISSHQNSSTASSAANSLSLAKNDFGSSSHVAVTEKDGI--SDPFHGFV
        ++  C+HC      + RCYK+HGYPPGY+                        + N+F +++H   T  D +  S+ F GFV
Subjt:  ERSLCSHC-----VIHRCYKLHGYPPGYRHCNGHISSHQNSSTASSAANSLSLAKNDFGSSSHVAVTEKDGI--SDPFHGFV

KAA8543184.1 hypothetical protein F0562_021321 [Nyssa sinensis]7.0e-1739.34Show/hide
Query:  MGDFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINVLP----STESVALVVST----SPVSMSSSTNTRNSRGVQGQYCER
        + D  Q+EY+MSFL GL+DSFS+   QLLL+DP   +NR FSL++QEEQQR  N       ST ++A  V T    S  S S ++   NS   + Q  +R
Subjt:  MGDFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINVLP----STESVALVVST----SPVSMSSSTNTRNSRGVQGQYCER

Query:  SLCSHC-----VIHRCYKLHGYPPGYR---HCNGHISSHQNSSTASSAANSLSLAKNDFGSSSHVAVTEKDGISDPFHGFVSP
          C HC      + RCYK+HGYPPGY+   + N + ++HQ S++   +  S     N FG          D ++DPF   V P
Subjt:  SLCSHC-----VIHRCYKLHGYPPGYR---HCNGHISSHQNSSTASSAANSLSLAKNDFGSSSHVAVTEKDGISDPFHGFVSP

XP_022148562.1 uncharacterized protein LOC111017196 [Momordica charantia]2.7e-1639.05Show/hide
Query:  DFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINVLPSTESVA-----LVVSTSPVSMS----SSTNTRNSRGVQGQYCERS
        ++ Q EY+M FL GLNDSFS+    LLL+ PP ++N AF L+ QE QQR I+ +    S A      V  T   + S    SST++  S   QG+  E+S
Subjt:  DFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINVLPSTESVA-----LVVSTSPVSMS----SSTNTRNSRGVQGQYCERS

Query:  LCSHC-----VIHRCYKLHGYPPGYRHCNGHISSHQNSSTASSAANSLSLAKNDFGSSSHVAVTEKDGI
        +C+HC      + RCYKLHGYPPGYR+ N    +H + + A SA  +         S SH+   +  G+
Subjt:  LCSHC-----VIHRCYKLHGYPPGYRHCNGHISSHQNSSTASSAANSLSLAKNDFGSSSHVAVTEKDGI

XP_022150855.1 uncharacterized protein LOC111018899 [Momordica charantia]1.7e-11199.08Show/hide
Query:  VTEKDGISDPFHGFVSPKVLHCGSDLSHLPTNSSVVQSDGCLRTFYGHTASPLVCKDPATVPLPTLSPTADTAWPLAPDLISLAVDISDPLVVAAPTPLA
        VTEKDGISDPFHGFVSPKVLHCGSDLSHLPTNSSVVQSDGCLRTFYGHTASPLVCKDPATVPLPTLSPTADTAWPLAPDLISLAVDISDPLVVAAPTPLA
Subjt:  VTEKDGISDPFHGFVSPKVLHCGSDLSHLPTNSSVVQSDGCLRTFYGHTASPLVCKDPATVPLPTLSPTADTAWPLAPDLISLAVDISDPLVVAAPTPLA

Query:  NLFDSSPALAHPSSMVPVLPSDPSAAATPTPPVVTASDPPTITTVPDPLPVAILAPPTLLVLGSSTVPILRSPTIAAPDLPVTPVPEQSPVRLVVVQPPD
        NLFDSSPALAHPSSMVPVLPSDPSAAATPTPPVVTASDPPTITTVPDPLPVAILAPPTLLVLGSSTVPILRSPTIAAPDLPVTPVPEQSPVRLVVVQPPD
Subjt:  NLFDSSPALAHPSSMVPVLPSDPSAAATPTPPVVTASDPPTITTVPDPLPVAILAPPTLLVLGSSTVPILRSPTIAAPDLPVTPVPEQSPVRLVVVQPPD

Query:  MPILHRILRATTPHNEVW
        MPILHRILRATTPHNE +
Subjt:  MPILHRILRATTPHNEVW

XP_022158788.1 uncharacterized protein LOC111025254 [Momordica charantia]6.0e-1638.04Show/hide
Query:  QIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINVLPSTESVALVVSTSPVSMSSSTNTRNSRGVQGQYCERSLCSHCVI-----
        Q E +M FL GLNDSFS+   QLLL++P  S+NR  SL+ QE QQR I  L       L+V  + V  S S  +  S G Q    ++ +C+HC I     
Subjt:  QIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINVLPSTESVALVVSTSPVSMSSSTNTRNSRGVQGQYCERSLCSHCVI-----

Query:  HRCYKLHGYPPGYRHCNGHISSHQNSSTASSAANSL----SLAKNDFGSSSHVAVTEKDGISDPFHGFVSPKVLHCGSDLSHLP
         +CY+LHGYPPG+R   G  SS  NSS + S + S+    SL+ N   S +     +  G+       +S    H  +D SH P
Subjt:  HRCYKLHGYPPGYRHCNGHISSHQNSSTASSAANSL----SLAKNDFGSSSHVAVTEKDGISDPFHGFVSPKVLHCGSDLSHLP

TrEMBL top hitse value%identityAlignment
A0A2N9EZ36 Uncharacterized protein1.2e-1744.85Show/hide
Query:  DFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINV---LPSTESVALVVSTSPVSMSSSTNTRNSRGVQGQYC--ERSLCSH
        D  Q EY+M FL GLNDSFS    Q+L+ DP  ++ +AF+L++QEE+QR IN+    P+ +SVAL         +     RN  G +GQ+   ER LCSH
Subjt:  DFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINV---LPSTESVALVVSTSPVSMSSSTNTRNSRGVQGQYC--ERSLCSH

Query:  C-----VIHRCYKLHGYPPGYRHCNGHISSHQNSST
        C      + +CYKLHGYPPGY+  N   S++Q S+T
Subjt:  C-----VIHRCYKLHGYPPGYRHCNGHISSHQNSST

A0A2N9HKX8 Integrase catalytic domain-containing protein2.6e-1744.12Show/hide
Query:  DFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINV---LPSTESVALVVSTSPVSMSSSTNTRNSRGVQGQYC--ERSLCSH
        D  Q EY+M FL GLNDSF+    Q+L+ DP  ++ +AF+L++QEE+QR IN+    P+ +SVAL         +     RN  G +GQ+   ER LCSH
Subjt:  DFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINV---LPSTESVALVVSTSPVSMSSSTNTRNSRGVQGQYC--ERSLCSH

Query:  C-----VIHRCYKLHGYPPGYRHCNGHISSHQNSST
        C      + +CYKLHGYPPGY+  N   S++Q S+T
Subjt:  C-----VIHRCYKLHGYPPGYRHCNGHISSHQNSST

A0A2N9I4I6 Integrase catalytic domain-containing protein9.0e-1843.06Show/hide
Query:  DFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINV---LPSTESVALVVSTSPVSMSSSTNTRNSRGVQGQYC--ERSLCSH
        D  Q EY+M FL GLNDSFS    Q+L+ DP  ++ +AF+L++QEE+QR IN+    P+ +SVAL         +     RN  G +GQ+   ER LCSH
Subjt:  DFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINV---LPSTESVALVVSTSPVSMSSSTNTRNSRGVQGQYC--ERSLCSH

Query:  C-----VIHRCYKLHGYPPGYRHCNGHISSHQNSSTASSAANSL
        C      + +CYKLHGYPPGY+  N   S++Q S+T     ++L
Subjt:  C-----VIHRCYKLHGYPPGYRHCNGHISSHQNSSTASSAANSL

A0A2N9IDW9 Uncharacterized protein2.0e-1744.12Show/hide
Query:  DFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINV---LPSTESVALVVSTSPVSMSSSTNTRNSRGVQGQYC--ERSLCSH
        D  Q EY+M FL GLNDSF+    Q+L+ DP  ++ +AF+L++QEE+QR+IN+    P+ +SVAL         +     RN  G +GQ+   ER LCSH
Subjt:  DFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINV---LPSTESVALVVSTSPVSMSSSTNTRNSRGVQGQYC--ERSLCSH

Query:  C-----VIHRCYKLHGYPPGYRHCNGHISSHQNSST
        C      + +CYKLHGYPPGY+  N   S++Q S+T
Subjt:  C-----VIHRCYKLHGYPPGYRHCNGHISSHQNSST

A0A6J1D9M2 uncharacterized protein LOC1110188998.2e-11299.08Show/hide
Query:  VTEKDGISDPFHGFVSPKVLHCGSDLSHLPTNSSVVQSDGCLRTFYGHTASPLVCKDPATVPLPTLSPTADTAWPLAPDLISLAVDISDPLVVAAPTPLA
        VTEKDGISDPFHGFVSPKVLHCGSDLSHLPTNSSVVQSDGCLRTFYGHTASPLVCKDPATVPLPTLSPTADTAWPLAPDLISLAVDISDPLVVAAPTPLA
Subjt:  VTEKDGISDPFHGFVSPKVLHCGSDLSHLPTNSSVVQSDGCLRTFYGHTASPLVCKDPATVPLPTLSPTADTAWPLAPDLISLAVDISDPLVVAAPTPLA

Query:  NLFDSSPALAHPSSMVPVLPSDPSAAATPTPPVVTASDPPTITTVPDPLPVAILAPPTLLVLGSSTVPILRSPTIAAPDLPVTPVPEQSPVRLVVVQPPD
        NLFDSSPALAHPSSMVPVLPSDPSAAATPTPPVVTASDPPTITTVPDPLPVAILAPPTLLVLGSSTVPILRSPTIAAPDLPVTPVPEQSPVRLVVVQPPD
Subjt:  NLFDSSPALAHPSSMVPVLPSDPSAAATPTPPVVTASDPPTITTVPDPLPVAILAPPTLLVLGSSTVPILRSPTIAAPDLPVTPVPEQSPVRLVVVQPPD

Query:  MPILHRILRATTPHNEVW
        MPILHRILRATTPHNE +
Subjt:  MPILHRILRATTPHNEVW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGATTTTCTCCAAATTGAATATTTGATGAGTTTCCTCAGGGGACTGAACGACTCTTTTTCCCGTGCTCATGTCCAGCTACTGCTCATAGATCCACCTCTTTCTGT
CAATCGTGCATTTTCTCTTCTTCTTCAAGAAGAACAACAACGCACTATAAATGTGCTTCCCTCAACTGAAAGTGTTGCACTTGTTGTATCCACTTCTCCTGTATCTATGA
GTTCATCCACTAATACTCGGAATTCTCGTGGAGTTCAAGGTCAGTATTGTGAGCGTTCTCTATGCTCTCATTGTGTCATCCATCGTTGTTATAAGCTCCATGGTTACCCA
CCGGGTTATCGTCATTGCAATGGTCATATTTCATCTCACCAGAATTCTTCCACGGCCTCCTCTGCTGCCAATTCTCTCTCTCTTGCTAAGAATGATTTTGGATCCTCTTC
TCATGTTGCAGTTACTGAGAAAGATGGCATTTCTGACCCATTTCATGGTTTTGTTTCACCCAAGGTTCTTCATTGTGGCAGTGATTTGTCCCATCTTCCCACTAACTCTA
GTGTTGTGCAATCTGATGGATGTTTGAGGACATTTTATGGACATACGGCTTCACCTCTTGTGTGTAAAGACCCTGCCACTGTTCCTCTTCCGACGCTGTCGCCGACCGCT
GACACGGCCTGGCCGCTTGCTCCCGATTTGATCTCGCTGGCCGTTGACATATCCGATCCATTGGTAGTCGCAGCTCCAACTCCGCTGGCCAACTTGTTTGATTCTTCGCC
GGCTCTTGCACATCCTTCCTCGATGGTGCCTGTACTTCCGTCGGACCCATCGGCCGCTGCCACCCCAACTCCGCCAGTCGTAACAGCATCGGACCCACCAACAATCACAA
CGGTACCTGACCCATTGCCTGTCGCCATTCTAGCTCCACCAACTCTTCTTGTTCTAGGTTCGTCGACCGTTCCCATTCTCAGGTCACCGACCATTGCCGCTCCCGATTTG
CCGGTCACACCCGTTCCAGAACAGTCGCCTGTTCGTTTAGTTGTTGTCCAACCGCCTGACATGCCTATTTTACATCGTATTCTAAGAGCTACAACACCACATAATGAGGT
TTGGACCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGATTTTCTCCAAATTGAATATTTGATGAGTTTCCTCAGGGGACTGAACGACTCTTTTTCCCGTGCTCATGTCCAGCTACTGCTCATAGATCCACCTCTTTCTGT
CAATCGTGCATTTTCTCTTCTTCTTCAAGAAGAACAACAACGCACTATAAATGTGCTTCCCTCAACTGAAAGTGTTGCACTTGTTGTATCCACTTCTCCTGTATCTATGA
GTTCATCCACTAATACTCGGAATTCTCGTGGAGTTCAAGGTCAGTATTGTGAGCGTTCTCTATGCTCTCATTGTGTCATCCATCGTTGTTATAAGCTCCATGGTTACCCA
CCGGGTTATCGTCATTGCAATGGTCATATTTCATCTCACCAGAATTCTTCCACGGCCTCCTCTGCTGCCAATTCTCTCTCTCTTGCTAAGAATGATTTTGGATCCTCTTC
TCATGTTGCAGTTACTGAGAAAGATGGCATTTCTGACCCATTTCATGGTTTTGTTTCACCCAAGGTTCTTCATTGTGGCAGTGATTTGTCCCATCTTCCCACTAACTCTA
GTGTTGTGCAATCTGATGGATGTTTGAGGACATTTTATGGACATACGGCTTCACCTCTTGTGTGTAAAGACCCTGCCACTGTTCCTCTTCCGACGCTGTCGCCGACCGCT
GACACGGCCTGGCCGCTTGCTCCCGATTTGATCTCGCTGGCCGTTGACATATCCGATCCATTGGTAGTCGCAGCTCCAACTCCGCTGGCCAACTTGTTTGATTCTTCGCC
GGCTCTTGCACATCCTTCCTCGATGGTGCCTGTACTTCCGTCGGACCCATCGGCCGCTGCCACCCCAACTCCGCCAGTCGTAACAGCATCGGACCCACCAACAATCACAA
CGGTACCTGACCCATTGCCTGTCGCCATTCTAGCTCCACCAACTCTTCTTGTTCTAGGTTCGTCGACCGTTCCCATTCTCAGGTCACCGACCATTGCCGCTCCCGATTTG
CCGGTCACACCCGTTCCAGAACAGTCGCCTGTTCGTTTAGTTGTTGTCCAACCGCCTGACATGCCTATTTTACATCGTATTCTAAGAGCTACAACACCACATAATGAGGT
TTGGACCCTTTGA
Protein sequenceShow/hide protein sequence
MGDFLQIEYLMSFLRGLNDSFSRAHVQLLLIDPPLSVNRAFSLLLQEEQQRTINVLPSTESVALVVSTSPVSMSSSTNTRNSRGVQGQYCERSLCSHCVIHRCYKLHGYP
PGYRHCNGHISSHQNSSTASSAANSLSLAKNDFGSSSHVAVTEKDGISDPFHGFVSPKVLHCGSDLSHLPTNSSVVQSDGCLRTFYGHTASPLVCKDPATVPLPTLSPTA
DTAWPLAPDLISLAVDISDPLVVAAPTPLANLFDSSPALAHPSSMVPVLPSDPSAAATPTPPVVTASDPPTITTVPDPLPVAILAPPTLLVLGSSTVPILRSPTIAAPDL
PVTPVPEQSPVRLVVVQPPDMPILHRILRATTPHNEVWTL