; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039449 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039449
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiongag_pre-integrs domain-containing protein
Genome locationchr2:43945655..43947365
RNA-Seq ExpressionLag0039449
SyntenyLag0039449
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6390526.1 hypothetical protein SASPL_148264 [Salvia splendens]1.6e-1045.45Show/hide
Query:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGK
        G  G VRM N   ++ +GIGD+ L T+ G KLVLRDVR VP+I++N+IST KL DDGY+  FG          +W L+K +  + +G+
Subjt:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGK

KAG6409956.1 hypothetical protein SASPL_128000 [Salvia splendens]2.4e-1137.76Show/hide
Query:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGKVELAAKI-ANFD
        G  G VRM N   ++ +GIGD+ L T+ G KLVLRDVR VP+I++N+IST KL DDGY+  FG          +W L+K +  + + K E  +++  N D
Subjt:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGKVELAAKI-ANFD

Query:  EF------DHDPSIQKQLGSP-------GEKVDGYRESPVVRR
                D+   IQ + G P        E      E P+VRR
Subjt:  EF------DHDPSIQKQLGSP-------GEKVDGYRESPVVRR

KAG6437371.1 hypothetical protein SASPL_102286 [Salvia splendens]1.9e-1137.76Show/hide
Query:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGKVELAAKI-ANFD
        G  G VRM N   ++ +GIGD+ L T+ G KLVLRDVR VP+I++N+IST KL DDGY+  FG          +W L+K +  + + K E  +++  N D
Subjt:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGKVELAAKI-ANFD

Query:  EF------DHDPSIQKQLGSP-------GEKVDGYRESPVVRR
                D+   IQ + G P        E      E P+VRR
Subjt:  EF------DHDPSIQKQLGSP-------GEKVDGYRESPVVRR

KAG8364690.1 hypothetical protein BUALT_Bualt18G0024700 [Buddleja alternifolia]1.2e-1039.22Show/hide
Query:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGKVE-----LAAKI
        G+ G VRM N   ++  G+G++ L+T+ G +L+LRDVR +PNI++N+IST KL DDGY+  FG          +W L K +  + +GK +     + AK+
Subjt:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGKVE-----LAAKI

Query:  AN
        +N
Subjt:  AN

KAG8376937.1 hypothetical protein BUALT_Bualt09G0116000 [Buddleja alternifolia]7.1e-1139.22Show/hide
Query:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGKVE-----LAAKI
        G+ G VRM N   ++  G+G++ L+T+ G +L+LRDVR +PNI++N+IST KL DDGY+  FG          +W L K +  + +GK +     + AK+
Subjt:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGKVE-----LAAKI

Query:  AN
        +N
Subjt:  AN

TrEMBL top hitse value%identityAlignment
A0A2I0HZM2 Uncharacterized protein (Fragment)2.5e-0955.07Show/hide
Query:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKL
        G +G VRM NG++ K  GIGDV L+TE G KL+L+ VR VP I++NLIST +L D+G+  EF + R KL
Subjt:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKL

A0A2I0IPH9 Integrase catalytic domain-containing protein5.0e-1056.52Show/hide
Query:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKL
        G +G VRM NG++ K  GIGDV L+TE G KL+L+ VR VP I++NLIST +L D+GY  EF + R KL
Subjt:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKL

A0A2I0IXX9 Integrase catalytic domain-containing protein2.5e-0955.07Show/hide
Query:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKL
        G +G VRM NG++ K  GIGDV L+TE G KL+L+ VR VP I++NLIS  +L D+GY  EF + R KL
Subjt:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKL

A0A4D8YHC2 CCHC-type domain-containing protein7.6e-1145.45Show/hide
Query:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGK
        G  G VRM N   ++ +GIGD+ L T+ G KLVLRDVR VP+I++N+IST KL DDGY+  FG          +W L+K +  + +G+
Subjt:  GHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGK

A0A4D9AG59 CCHC-type domain-containing protein1.4e-0944.71Show/hide
Query:  GLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGK
        G VRM N   ++ +GIG + L T+ G KLVLRDVR VP+I++N+IST KL DDGY+  FG          +W L+K +  + +G+
Subjt:  GLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKLKLRSQWMLVKTAYDSCRGK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-1034.88Show/hide
Query:  RRCEGHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKL--------KLRSQWMLVKTAYDSCRGK
        R   G  G V+M N   SK +GIGD+ +KT  G  LVL+DVR VP+++MNLIS + L  DGY   F + + +L        K  ++  L +T  + C+G+
Subjt:  RRCEGHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGSLRCKL--------KLRSQWMLVKTAYDSCRGK

Query:  VELAAKIANFDEFDHDPSIQKQLGSPGEK
        +  A    + D +       K++G   EK
Subjt:  VELAAKIANFDEFDHDPSIQKQLGSPGEK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGTCAAAGTTACTCCTGGTTCTTCGGATGGTTGGAGTTCGCGGTGGCGACAGATATCAGATCGGGATTGGATTAGTGGCGGAGTTCGTCGGAGTGGCGGCGGCGT
AGGTCTTCAACTGCGGCGACGATGTGAAGGACATCATGGTCTAGTGAGGATGGAGAATGGTAGAACCTCCAAAACTAGTGGAATTGGAGATGTTGGTCTGAAGACAGAAT
GTGGAGATAAATTAGTACTGCGAGATGTCAGGTTTGTGCCTAATATCAAGATGAATCTTATTTCTACTGTCAAGTTGGCAGATGATGGTTACATGTGTGAGTTTGGTAGT
CTCCGGTGTAAACTCAAGTTAAGATCCCAGTGGATGTTGGTTAAAACTGCATATGATAGTTGTAGAGGTAAAGTGGAGCTAGCAGCAAAGATAGCCAATTTCGATGAGTT
CGATCACGATCCTTCAATTCAGAAACAATTGGGAAGTCCAGGAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGACGCTTGAATGAATTGAAGAGGTCGCTTA
GGCGAGTTGATGCATCAAAGTGGAATGCCAGAGCAGTTGCTAATGGAGCCGAAAAATATGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGGTCAAAGTTACTCCTGGTTCTTCGGATGGTTGGAGTTCGCGGTGGCGACAGATATCAGATCGGGATTGGATTAGTGGCGGAGTTCGTCGGAGTGGCGGCGGCGT
AGGTCTTCAACTGCGGCGACGATGTGAAGGACATCATGGTCTAGTGAGGATGGAGAATGGTAGAACCTCCAAAACTAGTGGAATTGGAGATGTTGGTCTGAAGACAGAAT
GTGGAGATAAATTAGTACTGCGAGATGTCAGGTTTGTGCCTAATATCAAGATGAATCTTATTTCTACTGTCAAGTTGGCAGATGATGGTTACATGTGTGAGTTTGGTAGT
CTCCGGTGTAAACTCAAGTTAAGATCCCAGTGGATGTTGGTTAAAACTGCATATGATAGTTGTAGAGGTAAAGTGGAGCTAGCAGCAAAGATAGCCAATTTCGATGAGTT
CGATCACGATCCTTCAATTCAGAAACAATTGGGAAGTCCAGGAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGACGCTTGAATGAATTGAAGAGGTCGCTTA
GGCGAGTTGATGCATCAAAGTGGAATGCCAGAGCAGTTGCTAATGGAGCCGAAAAATATGAATAA
Protein sequenceShow/hide protein sequence
MWVKVTPGSSDGWSSRWRQISDRDWISGGVRRSGGGVGLQLRRRCEGHHGLVRMENGRTSKTSGIGDVGLKTECGDKLVLRDVRFVPNIKMNLISTVKLADDGYMCEFGS
LRCKLKLRSQWMLVKTAYDSCRGKVELAAKIANFDEFDHDPSIQKQLGSPGEKVDGYRESPVVRRLNELKRSLRRVDASKWNARAVANGAEKYE