; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011452 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011452
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiongag_pre-integrs domain-containing protein
Genome locationchr1:25139115..25142160
RNA-Seq ExpressionLag0011452
SyntenyLag0011452
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7129225.1 hypothetical protein RHSIM_Rhsim10G0050800 [Rhododendron simsii]2.2e-1637.18Show/hide
Query:  NNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGSQKSLRRV
        ++ AS H+ S R  F S+T G    VRMGN   S+  G+GDV L+T  G KL+L+DVR VPNI++NLIS GKL D+GY  +FG+ + KL  GS    R  
Subjt:  NNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGSQKSLRRV

Query:  EASKWKAIAVAKVRGLVSSLVTCLNRGFKSFFFGNNRSGLEEDNRCHHLAEVGVRV
        ++S    +     +G V+++             G+  S      R  H++E G+++
Subjt:  EASKWKAIAVAKVRGLVSSLVTCLNRGFKSFFFGNNRSGLEEDNRCHHLAEVGVRV

KAF7129546.1 hypothetical protein RHSIM_Rhsim10G0154200 [Rhododendron simsii]3.8e-1637.18Show/hide
Query:  NNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGSQKSLRRV
        ++ AS H+ S R  F S+T G    VRMGN   S+  G+GDV L+T  G KL+L+DVR VP+I++NLIS GKL D+GY  +FG+ + KL  GS    R  
Subjt:  NNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGSQKSLRRV

Query:  EASKWKAIAVAKVRGLVSSLVTCLNRGFKSFFFGNNRSGLEEDNRCHHLAEVGVRV
        ++S    + V   +G V+++             G+  S      R  H++E G+++
Subjt:  EASKWKAIAVAKVRGLVSSLVTCLNRGFKSFFFGNNRSGLEEDNRCHHLAEVGVRV

KAG5549868.1 hypothetical protein RHGRI_014986 [Rhododendron griersonianum]1.0e-1637.5Show/hide
Query:  LSMDVASLVAHEKTAVKLMEALTN----NAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKL
        +S D   +V  +++ V L+   TN    + AS H+ S    FTS+ EG    VRMGN   S+  G+G++ L+T  G KLVL+DVR VP+I++NLISTGKL
Subjt:  LSMDVASLVAHEKTAVKLMEALTN----NAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKL

Query:  VDDGYMCEFGNRQCKLKLGSQKSLRRVEASKWKAIAVAKVRGLVS-SLVTCLNRGFKSFFFGNNRSGLEEDNRCHHLAEVGVRV
         D+GY   FGN + KL  GS      V A   K  ++  ++G +S  LV  L+ G  +  +           R  H++E G+ V
Subjt:  VDDGYMCEFGNRQCKLKLGSQKSLRRVEASKWKAIAVAKVRGLVS-SLVTCLNRGFKSFFFGNNRSGLEEDNRCHHLAEVGVRV

KAG8367017.1 hypothetical protein BUALT_Bualt16G0028600 [Buddleja alternifolia]5.0e-1638.26Show/hide
Query:  RKVHNAMNERPKGMTDEDWEALNEEVVASIRMCLSMDVASLVAHEKTAVKLMEALTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSL
        R  +N  NE  K   D    A + +VV    +C    V+S       + +  + + ++ AS HI   R +FTS+T G+   VRM N   +   G+G+++L
Subjt:  RKVHNAMNERPKGMTDEDWEALNEEVVASIRMCLSMDVASLVAHEKTAVKLMEALTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSL

Query:  KTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS
        +T  G +L+LRDVR++PNI++N+ISTGKL DDGY+  FG  + KL  GS
Subjt:  KTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS

NEY13661.1 hypothetical protein [Bifidobacterium pseudocatenulatum]3.8e-1649.47Show/hide
Query:  LTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS
        + +  AS H+   R  F+S+T G +  VRMGNG++ +  GIGDV L+T+ G KL+L+ VR VP I++NLISTG+L D+GY  EF N + KL  GS
Subjt:  LTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS

TrEMBL top hitse value%identityAlignment
A0A484KV97 Reverse transcriptase domain-containing protein7.0e-1649.46Show/hide
Query:  NNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS
        ++ A+ H+ S +  FTS+T G + +++MGN   S+  GIG V L+TK G KLVL++VR  P+I++NLIST  L D+GYM  FG+ QCKL  GS
Subjt:  NNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS

A0A484LZ49 gag_pre-integrs domain-containing protein7.0e-1637.84Show/hide
Query:  KVHNAMNERPKGMTDEDWEALNEEVVASIRMCLSMDVASLVAHEKTAVKLMEALTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLK
        K+ N    +P+G   E+   +       + +    D+ ++ + E + V     + + AAS H+ S +  FTS+T G + +++MGN   S+  GIG V L+
Subjt:  KVHNAMNERPKGMTDEDWEALNEEVVASIRMCLSMDVASLVAHEKTAVKLMEALTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLK

Query:  TKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS
        TK G KLVL++VR  P+I++NLIST  L D+GYM  FG+ QCKL  GS
Subjt:  TKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS

A0A484LZS2 gag_pre-integrs domain-containing protein9.2e-1635.81Show/hide
Query:  KVHNAMNERPKGMTDEDWEALNEEVVASIRMCLSMDVASLVAHEKTAVKLMEALTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLK
        K+ N    +P+G   E+   +     + + +    D+ ++ + E + V       ++ A+ H+ S +  FTS+T G + +++MGN   S+  GIG V L+
Subjt:  KVHNAMNERPKGMTDEDWEALNEEVVASIRMCLSMDVASLVAHEKTAVKLMEALTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLK

Query:  TKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS
        TK G KLVL++VR  P+I++NLIST  L D+GY+  FG+ QCKL  GS
Subjt:  TKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS

A0A484M452 gag_pre-integrs domain-containing protein3.2e-1636.49Show/hide
Query:  KVHNAMNERPKGMTDEDWEALNEEVVASIRMCLSMDVASLVAHEKTAVKLMEALTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLK
        K+ N    +P+G   E+   +       + +    D+ ++ + E + V       ++ A+ H+ S +  FTS+T G + +++MGN   S+  GIG V L+
Subjt:  KVHNAMNERPKGMTDEDWEALNEEVVASIRMCLSMDVASLVAHEKTAVKLMEALTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLK

Query:  TKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS
        TK G KLVL++VR  P+I++NLIST  L D+GYM  FG+ QCKL  GS
Subjt:  TKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS

A0A803N0B7 Uncharacterized protein3.2e-1642.86Show/hide
Query:  LVAHEKTAVKLMEALT----NNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMC
        ++ H++++V L    T    ++ AS+H+ S +  FTS+T G   +++MGN   ++  G+GDV L T  G KLVL++VR VP+I++NLISTGKL D+ Y  
Subjt:  LVAHEKTAVKLMEALT----NNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMC

Query:  EFGNRQCKLKLGSQKSLRR
         F + QCKL  GS    RR
Subjt:  EFGNRQCKLKLGSQKSLRR

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-1344.33Show/hide
Query:  EALTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS
        E + + AAS H    R LF  +  G    V+MGN   S+ +GIGD+ +KT  G  LVL+DVR VP+++MNLIS   L  DGY   F N++ +L  GS
Subjt:  EALTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGS

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein8.0e-0428.24Show/hide
Query:  LTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFG
        + +  A +++      FT+        V   +G      G GDV ++ K G K  +R+V FVP +  N++S GK+V   Y    G
Subjt:  LTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGIGDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGACAGCTACAGCCATATCAATCCATTCGACATGGACCTAAGGAAAGTGCATAACGCAATGAATGAGAGACCGAAAGGGATGACCGACGAAGATTGGGAAGCTCT
GAATGAAGAGGTAGTTGCAAGCATAAGGATGTGTTTATCAATGGATGTGGCAAGTCTAGTGGCCCATGAGAAAACTGCGGTTAAATTAATGGAAGCACTTACAAACAATG
CAGCGTCTGTACACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGAAGGACATCATGATCTAGTAAGGATGGGGAATGGTAGAACCTCCAGAACTAGTGGGATT
GGAGATGTTAGTTTGAAGACAAAATGTGGAGATAAATTAGTACTGCGAGATGTCAGGTTTGTGCCTAATATCAAGATGAATCTTATTTCTACTGGAAAGTTGGTAGATGA
TGGTTACATGTGTGAGTTTGGTAATCGCCAGTGTAAACTCAAGTTAGGATCCCAGAAGTCACTTAGGCGAGTTGAGGCATCAAAGTGGAAGGCCATAGCAGTTGCTAAGG
TCAGAGGTCTGGTCTCTAGCTTGGTAACATGTTTGAATAGAGGATTCAAGTCATTCTTCTTCGGGAACAATCGTTCAGGGTTGGAAGAAGATAACAGGTGTCACCACTTA
GCTGAAGTGGGAGTACGTGTTTGTCTTTTTCGTCTGGGAGATAGGTCCACATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGACAGCTACAGCCATATCAATCCATTCGACATGGACCTAAGGAAAGTGCATAACGCAATGAATGAGAGACCGAAAGGGATGACCGACGAAGATTGGGAAGCTCT
GAATGAAGAGGTAGTTGCAAGCATAAGGATGTGTTTATCAATGGATGTGGCAAGTCTAGTGGCCCATGAGAAAACTGCGGTTAAATTAATGGAAGCACTTACAAACAATG
CAGCGTCTGTACACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGAAGGACATCATGATCTAGTAAGGATGGGGAATGGTAGAACCTCCAGAACTAGTGGGATT
GGAGATGTTAGTTTGAAGACAAAATGTGGAGATAAATTAGTACTGCGAGATGTCAGGTTTGTGCCTAATATCAAGATGAATCTTATTTCTACTGGAAAGTTGGTAGATGA
TGGTTACATGTGTGAGTTTGGTAATCGCCAGTGTAAACTCAAGTTAGGATCCCAGAAGTCACTTAGGCGAGTTGAGGCATCAAAGTGGAAGGCCATAGCAGTTGCTAAGG
TCAGAGGTCTGGTCTCTAGCTTGGTAACATGTTTGAATAGAGGATTCAAGTCATTCTTCTTCGGGAACAATCGTTCAGGGTTGGAAGAAGATAACAGGTGTCACCACTTA
GCTGAAGTGGGAGTACGTGTTTGTCTTTTTCGTCTGGGAGATAGGTCCACATAG
Protein sequenceShow/hide protein sequence
MDDSYSHINPFDMDLRKVHNAMNERPKGMTDEDWEALNEEVVASIRMCLSMDVASLVAHEKTAVKLMEALTNNAASVHIASDRSLFTSFTEGHHDLVRMGNGRTSRTSGI
GDVSLKTKCGDKLVLRDVRFVPNIKMNLISTGKLVDDGYMCEFGNRQCKLKLGSQKSLRRVEASKWKAIAVAKVRGLVSSLVTCLNRGFKSFFFGNNRSGLEEDNRCHHL
AEVGVRVCLFRLGDRST