; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026824 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026824
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr10:42326544..42329114
RNA-Seq ExpressionLag0026824
SyntenyLag0026824
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]6.2e-5867.72Show/hide
Query:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN
        +AHKL+GFIDG+  CPP+T  S    SSTST      PP     Q NP YE+W+AKDQALMT+INATLSP AL YVVG TSSKQ W+VL K YSS SR+N
Subjt:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN

Query:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK
        +VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS  ++EED+LIYALNGLP+ +NTFRTSMRTRSQ VT +ELHVLL+ EE+A+ K
Subjt:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK

KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]1.8e-5765.61Show/hide
Query:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN
        +AHKLFGF+DG+  CP            TS  T +  PP     Q NPLYE+W+AKDQALMT+INATLSP AL YVVG TSSKQ W+VL K YSS SR+N
Subjt:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN

Query:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK
        +VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS  ++EED+LIYALNGLP+ +NTFRTSMRTRSQ VT +ELHVLL+ EE+A+ K
Subjt:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]6.2e-5867.72Show/hide
Query:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN
        +AHKL+GFIDG+  CPP+T  S    SSTST      PP     Q NP YE+W+AKDQALMT+INATLSP AL YVVG TSSKQ W+VL K YSS SR+N
Subjt:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN

Query:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK
        +VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS  ++EED+LIYALNGLP+ +NTFRTSMRTRSQ VT +ELHVLL+ EE+A+ K
Subjt:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]6.2e-5867.72Show/hide
Query:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN
        +AHKL+GFIDG+  CPP+T  S    SSTST      PP     Q NP YE+W+AKDQALMT+INATLSP AL YVVG TSSKQ W+VL K YSS SR+N
Subjt:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN

Query:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK
        +VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS  ++EED+LIYALNGLP+ +NTFRTSMRTRSQ VT +ELHVLL+ EE+A+ K
Subjt:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]6.2e-5867.72Show/hide
Query:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN
        +AHKL+GFIDG+  CPP+T  S    SSTST      PP     Q NP YE+W+AKDQALMT+INATLSP AL YVVG TSSKQ W+VL K YSS SR+N
Subjt:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN

Query:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK
        +VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS  ++EED+LIYALNGLP+ +NTFRTSMRTRSQ VT +ELHVLL+ EE+A+ K
Subjt:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X23.0e-5867.72Show/hide
Query:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN
        +AHKL+GFIDG+  CPP+T  S    SSTST      PP     Q NP YE+W+AKDQALMT+INATLSP AL YVVG TSSKQ W+VL K YSS SR+N
Subjt:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN

Query:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK
        +VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS  ++EED+LIYALNGLP+ +NTFRTSMRTRSQ VT +ELHVLL+ EE+A+ K
Subjt:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X33.0e-5867.72Show/hide
Query:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN
        +AHKL+GFIDG+  CPP+T  S    SSTST      PP     Q NP YE+W+AKDQALMT+INATLSP AL YVVG TSSKQ W+VL K YSS SR+N
Subjt:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN

Query:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK
        +VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS  ++EED+LIYALNGLP+ +NTFRTSMRTRSQ VT +ELHVLL+ EE+A+ K
Subjt:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X13.0e-5867.72Show/hide
Query:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN
        +AHKL+GFIDG+  CPP+T  S    SSTST      PP     Q NP YE+W+AKDQALMT+INATLSP AL YVVG TSSKQ W+VL K YSS SR+N
Subjt:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN

Query:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK
        +VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS  ++EED+LIYALNGLP+ +NTFRTSMRTRSQ VT +ELHVLL+ EE+A+ K
Subjt:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK

A0A5D3CLI6 T4.53.0e-5867.72Show/hide
Query:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN
        +AHKL+GFIDG+  CPP+T  S    SSTST      PP     Q NP YE+W+AKDQALMT+INATLSP AL YVVG TSSKQ W+VL K YSS SR+N
Subjt:  QAHKLFGFIDGSTNCPPKTVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTN

Query:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK
        +VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS  ++EED+LIYALNGLP+ +NTFRTSMRTRSQ VT +ELHVLL+ EE+A+ K
Subjt:  IVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK

A0A6J1DYF1 uncharacterized protein LOC1110257099.0e-5565.79Show/hide
Query:  QAHKLFGFIDGSTNCPPK-TVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRT
        +AHKL+GFIDGST  P K  V SS +LSSTS          P+   +NP + +W+AKD ALMTL+NATLSP+AL Y+VGC SS+Q W+ L K+YSSSSRT
Subjt:  QAHKLFGFIDGSTNCPPK-TVPSSTTLSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRT

Query:  NIVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK
        N+VNLKS+LQSI+KK GESID Y++RIKELKDKLANVSV+VD ED+LIY LNGLP  FN F TSM TRSQSV+ +EL+VLL  EEAAI+K
Subjt:  NIVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-0526.24Show/hide
Query:  EEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYAL
        E+W   D+   + I   LS   +  ++   +++  W  LE  Y S + TN + LK  L ++    G +   ++     L  +LAN+ V ++EED  I  L
Subjt:  EEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSITKKSGESIDDYVKRIKELKDKLANVSVIVDEEDILIYAL

Query:  NGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEKIE
        N LPS ++   T++     ++ L ++   L + E   +K E
Subjt:  NGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEKIE

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.4e-0625.18Show/hide
Query:  WVAKDQALMTLINATLSPAALV-YVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSITKKSGE-SIDDYVKRIKELKDKLANVSVIVDEEDILIYAL
        W  +D  +   +  TL+P       V  ++S+  W  ++  + ++     + L S+L+  TK  G+  + DY +++K+L D L NV V V + ++++Y L
Subjt:  WVAKDQALMTLINATLSPAALV-YVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSITKKSGE-SIDDYVKRIKELKDKLANVSVIVDEEDILIYAL

Query:  NGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK
        NGL   F+     ++ R    + D+   +L+ EE  +++
Subjt:  NGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAAGGAGTTAACGAGGACAACCAGGCAGAAATCAGACTGGAAGATGGACCTAAGAGGTGAAACCGACAAGTGGGACGGGCCAAGACCGAAGGGGTCGTCTGGTCT
GGCTCCGTATGCCCTCCTACTTCCGTTTTCCGACTTAAGCATCGGAGGCGGTGTGGCTAGCACCACACCATTCTGCAGGTTTACAGTTTTGCAGGCCACGTCTTCCCCCT
CATCTAAAAATTTACCGTTGGTGGCACGTGAAGGTCAGGCTCACAAGTTGTTTGGATTTATCGATGGCTCCACTAATTGCCCTCCAAAGACGGTTCCATCATCCACCACA
TTGTCCTCTACATCGACGGAGACTGCAACTGAAGCACCTCCTGCTCCAGTATCTTCTCAGATTAATCCACTTTATGAGGAGTGGGTTGCCAAAGATCAAGCTCTAATGAC
ACTGATCAATGCCACTCTGTCGCCGGCAGCCTTAGTCTATGTTGTTGGTTGCACATCATCCAAACAAGCCTGGGAGGTCCTTGAAAAGCACTACTCTTCGAGCTCAAGAA
CCAACATCGTCAATTTGAAGTCAGATCTTCAATCAATAACCAAGAAGTCGGGTGAGTCAATCGATGATTATGTTAAACGAATCAAAGAACTCAAGGATAAATTAGCTAAT
GTTTCTGTTATTGTGGATGAAGAGGATATTCTTATCTATGCCTTAAATGGCTTACCTTCTATTTTTAACACTTTTCGCACATCTATGAGAACACGGTCACAATCGGTTAC
ACTCGATGAGCTCCACGTCCTTCTAAAAGTAGAGGAGGCTGCTATTGAAAAAATCGAAGCAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCACAAGGAGTTAACGAGGACAACCAGGCAGAAATCAGACTGGAAGATGGACCTAAGAGGTGAAACCGACAAGTGGGACGGGCCAAGACCGAAGGGGTCGTCTGGTCT
GGCTCCGTATGCCCTCCTACTTCCGTTTTCCGACTTAAGCATCGGAGGCGGTGTGGCTAGCACCACACCATTCTGCAGGTTTACAGTTTTGCAGGCCACGTCTTCCCCCT
CATCTAAAAATTTACCGTTGGTGGCACGTGAAGGTCAGGCTCACAAGTTGTTTGGATTTATCGATGGCTCCACTAATTGCCCTCCAAAGACGGTTCCATCATCCACCACA
TTGTCCTCTACATCGACGGAGACTGCAACTGAAGCACCTCCTGCTCCAGTATCTTCTCAGATTAATCCACTTTATGAGGAGTGGGTTGCCAAAGATCAAGCTCTAATGAC
ACTGATCAATGCCACTCTGTCGCCGGCAGCCTTAGTCTATGTTGTTGGTTGCACATCATCCAAACAAGCCTGGGAGGTCCTTGAAAAGCACTACTCTTCGAGCTCAAGAA
CCAACATCGTCAATTTGAAGTCAGATCTTCAATCAATAACCAAGAAGTCGGGTGAGTCAATCGATGATTATGTTAAACGAATCAAAGAACTCAAGGATAAATTAGCTAAT
GTTTCTGTTATTGTGGATGAAGAGGATATTCTTATCTATGCCTTAAATGGCTTACCTTCTATTTTTAACACTTTTCGCACATCTATGAGAACACGGTCACAATCGGTTAC
ACTCGATGAGCTCCACGTCCTTCTAAAAGTAGAGGAGGCTGCTATTGAAAAAATCGAAGCAAGATGA
Protein sequenceShow/hide protein sequence
MHKELTRTTRQKSDWKMDLRGETDKWDGPRPKGSSGLAPYALLLPFSDLSIGGGVASTTPFCRFTVLQATSSPSSKNLPLVAREGQAHKLFGFIDGSTNCPPKTVPSSTT
LSSTSTETATEAPPAPVSSQINPLYEEWVAKDQALMTLINATLSPAALVYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSITKKSGESIDDYVKRIKELKDKLAN
VSVIVDEEDILIYALNGLPSIFNTFRTSMRTRSQSVTLDELHVLLKVEEAAIEKIEAR