; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026314 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026314
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:34450430..34453416
RNA-Seq ExpressionLag0026314
SyntenyLag0026314
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.1e-7065.53Show/hide
Query:  SSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKP-EGRKSAAWKCNNDIITSWILNSVSKEIVASVNYTGSAK
        +S  DAQLNP+ +HHS    A +VTQPL GA NY SWS+AML+A+ G+NK GFI G I+KP +G    AW CNNDI+ SWILNSVSKEI AS+ Y GS K
Subjt:  SSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKP-EGRKSAAWKCNNDIITSWILNSVSKEIVASVNYTGSAK

Query:  AVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFASVRAQILLMAPLP
         +WDEL  RFKQSNGP IYQLRKE VT  QGN+++E+YYTKLKTIWQ+L EYR    C+CGG+K FI+HL+S ++M FLMGL+DS+A+VRAQILLM PLP
Subjt:  AVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFASVRAQILLMAPLP

Query:  SITEVF
        SI  VF
Subjt:  SITEVF

KAB5512286.1 hypothetical protein DKX38_029314 [Salix brachista]9.5e-5146.85Show/hide
Query:  IADKDASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKPEGRKSA---AWKCNNDIITSWIL
        +A   AS N SS +   S ID   +P++LHH  S   VL++QPL G  NYN+WS++M +AL  KNK  F+DG+++KP         AW  +N+++ SW+L
Subjt:  IADKDASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKPEGRKSA---AWKCNNDIITSWIL

Query:  NSVSKEIVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGL
        NS+SKEI +SV Y  SA  +W++L  RF Q NGP I+QL+K +   +QG+M+V +YYT+LK +W +L  YRP+  CSCG +K+ ++H    ++  FLMGL
Subjt:  NSVSKEIVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGL

Query:  SDSFASVRAQILLMAPLPSITE
        +DSFA +R QILL+ PLPSI +
Subjt:  SDSFASVRAQILLMAPLPSITE

KYP75905.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.3e-5248.62Show/hide
Query:  ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKK--PEGRKSAAWKCNNDIITSWILNSVSKE
        AS N+SS++      D   NP+ LH S +  A +V+QPL G  NYNSWS+A+L+AL  KNK GF+DGTI K  P  +   +W+ NN+I+ SW+LN +SK+
Subjt:  ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKK--PEGRKSAAWKCNNDIITSWILNSVSKE

Query:  IVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFAS
        + ASV Y+ SA A+W++L  RF+Q NGP ++QLR++LVT  QG++++  Y+TK+K +W++L EY+P  AC+CGGIK +I+H  S + M+FLMGL++ ++ 
Subjt:  IVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFAS

Query:  VRAQILLMAPLPSITEVF
        +R QILLM P+P I + F
Subjt:  VRAQILLMAPLPSITEVF

XP_020221438.1 uncharacterized protein LOC109804079 [Cajanus cajan]2.3e-5248.62Show/hide
Query:  ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKK--PEGRKSAAWKCNNDIITSWILNSVSKE
        AS N+SS++      D   NP+ LH S +  A +V+QPL G  NYNSWS+A+L+AL  KNK GF+DGTI K  P  +   +W+ NN+I+ SW+LN +SK+
Subjt:  ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKK--PEGRKSAAWKCNNDIITSWILNSVSKE

Query:  IVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFAS
        + ASV Y+ SA A+W++L  RF+Q NGP ++QLR++LVT  QG++++  Y+TK+K +W++L EY+P  AC+CGGIK +I+H  S + M+FLMGL++ ++ 
Subjt:  IVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFAS

Query:  VRAQILLMAPLPSITEVF
        +R QILLM P+P I + F
Subjt:  VRAQILLMAPLPSITEVF

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]2.6e-7261.26Show/hide
Query:  IADKD-ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKPEGRKSAAWKCNNDIITSWILNS
        +AD++  S+  +S+   ++ I++QLNP+ +HHS +   +LVTQ LLGA NYNSW ++MLIAL GKNK GFIDGTIKKP G   AAWKCNNDIITSWI+NS
Subjt:  IADKD-ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKPEGRKSAAWKCNNDIITSWILNS

Query:  VSKEIVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSD
        VSKEI AS+ YTGSAK +WDEL  RF+QS+ P I+QLRKELVT  QG +S+E+YYTKLKT+WQ+L +YRP + C+C G+KS  E   S +VM FLMGL++
Subjt:  VSKEIVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSD

Query:  SFASVRAQILLMAPLPSITEVF
        S+A +RAQILLM P+P + +VF
Subjt:  SFASVRAQILLMAPLPSITEVF

TrEMBL top hitse value%identityAlignment
A0A151RKK2 Uncharacterized protein1.0e-5048.17Show/hide
Query:  ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKPE--GRKSAAWKCNNDIITSWILNSVSKE
        A  N SS  +D        N F LH S +  A +V+QPL G  NYNSWS  +L+AL GKNK GF+DGTI KP+   +   +W+ NN+I+ SW+LNS+SK+
Subjt:  ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKPE--GRKSAAWKCNNDIITSWILNSVSKE

Query:  IVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFAS
        + ASV Y+ SA A+W++L  RF+Q NGP ++QLR++L+T  Q + S+  Y+TK+K  W++L EY+P+  C CGGIK +I++    +VM+FLMGL++ ++ 
Subjt:  IVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFAS

Query:  VRAQILLMAPLPSITEVF
        +R QILLM P+PSI +VF
Subjt:  VRAQILLMAPLPSITEVF

A0A151U9A5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-5248.62Show/hide
Query:  ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKK--PEGRKSAAWKCNNDIITSWILNSVSKE
        AS N+SS++      D   NP+ LH S +  A +V+QPL G  NYNSWS+A+L+AL  KNK GF+DGTI K  P  +   +W+ NN+I+ SW+LN +SK+
Subjt:  ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKK--PEGRKSAAWKCNNDIITSWILNSVSKE

Query:  IVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFAS
        + ASV Y+ SA A+W++L  RF+Q NGP ++QLR++LVT  QG++++  Y+TK+K +W++L EY+P  AC+CGGIK +I+H  S + M+FLMGL++ ++ 
Subjt:  IVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFAS

Query:  VRAQILLMAPLPSITEVF
        +R QILLM P+P I + F
Subjt:  VRAQILLMAPLPSITEVF

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 85.3e-7165.53Show/hide
Query:  SSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKP-EGRKSAAWKCNNDIITSWILNSVSKEIVASVNYTGSAK
        +S  DAQLNP+ +HHS    A +VTQPL GA NY SWS+AML+A+ G+NK GFI G I+KP +G    AW CNNDI+ SWILNSVSKEI AS+ Y GS K
Subjt:  SSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKP-EGRKSAAWKCNNDIITSWILNSVSKEIVASVNYTGSAK

Query:  AVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFASVRAQILLMAPLP
         +WDEL  RFKQSNGP IYQLRKE VT  QGN+++E+YYTKLKTIWQ+L EYR    C+CGG+K FI+HL+S ++M FLMGL+DS+A+VRAQILLM PLP
Subjt:  AVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFASVRAQILLMAPLP

Query:  SITEVF
        SI  VF
Subjt:  SITEVF

A0A5N5J1A3 Uncharacterized protein4.6e-5146.85Show/hide
Query:  IADKDASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKPEGRKSA---AWKCNNDIITSWIL
        +A   AS N SS +   S ID   +P++LHH  S   VL++QPL G  NYN+WS++M +AL  KNK  F+DG+++KP         AW  +N+++ SW+L
Subjt:  IADKDASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKPEGRKSA---AWKCNNDIITSWIL

Query:  NSVSKEIVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGL
        NS+SKEI +SV Y  SA  +W++L  RF Q NGP I+QL+K +   +QG+M+V +YYT+LK +W +L  YRP+  CSCG +K+ ++H    ++  FLMGL
Subjt:  NSVSKEIVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGL

Query:  SDSFASVRAQILLMAPLPSITE
        +DSFA +R QILL+ PLPSI +
Subjt:  SDSFASVRAQILLMAPLPSITE

A0A6J1CXR2 uncharacterized protein LOC1110152391.2e-7261.26Show/hide
Query:  IADKD-ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKPEGRKSAAWKCNNDIITSWILNS
        +AD++  S+  +S+   ++ I++QLNP+ +HHS +   +LVTQ LLGA NYNSW ++MLIAL GKNK GFIDGTIKKP G   AAWKCNNDIITSWI+NS
Subjt:  IADKD-ASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWSKAMLIALLGKNKEGFIDGTIKKPEGRKSAAWKCNNDIITSWILNS

Query:  VSKEIVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSD
        VSKEI AS+ YTGSAK +WDEL  RF+QS+ P I+QLRKELVT  QG +S+E+YYTKLKT+WQ+L +YRP + C+C G+KS  E   S +VM FLMGL++
Subjt:  VSKEIVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDLCEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSD

Query:  SFASVRAQILLMAPLPSITEVF
        S+A +RAQILLM P+P + +VF
Subjt:  SFASVRAQILLMAPLPSITEVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.8e-2335.16Show/hide
Query:  NYNSWSKAMLIALLGKNKEGFIDGTIKKPE--GRKSAAWKCNNDIITSWILNSVSKEIVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQG
        NY +W       L    K GFIDGT+ KP+        W+  N ++  W++NS++ +++ SV Y  +A  +W++L   F       IYQLR+ L T  QG
Subjt:  NYNSWSKAMLIALLGKNKEGFIDGTIKKPE--GRKSAAWKCNNDIITSWILNSVSKEIVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQG

Query:  NMSVESYYTKLKTIWQDLCEYRPVLACSCGG-----IKSFIEHLDSGFVMIFLMG--LSDSFASVRAQILLMAPLPSITEVF
          SVE Y+ KL  +W +L EY P+  C CGG      K   E  +      FLMG  L+  F +V  +I+   P PS+ E F
Subjt:  NMSVESYYTKLKTIWQDLCEYRPVLACSCGG-----IKSFIEHLDSGFVMIFLMG--LSDSFASVRAQILLMAPLPSITEVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAAGCGGCGGCTCCTTGCAACAAGCGGCGGCTTCTTGCAACAAGCGGCGGCGGTGGCAAAGACGGCCGGTGACGACGCCGGTGGTGAGAGAGAGAGAAGAAGACG
ATCAATGGAAGTCGTGGAGAAGAAGAAAACGATCAAAAAGCGGCTATTATTCGATTCCATCGCGGACAAGGATGCTTCTAACAACGATTCTTCAGCGATCGATGATTCCT
CTATGATCGACGCTCAATTGAATCCCTTCCACCTTCATCATTCCTATAGTTCTTTGGCCGTATTGGTTACACAGCCCCTGCTTGGTGCAAGAAATTATAATTCCTGGAGC
AAAGCAATGTTGATCGCACTCTTAGGAAAGAACAAAGAAGGCTTCATTGACGGCACAATCAAGAAACCAGAGGGAAGAAAATCCGCTGCATGGAAATGCAACAATGATAT
CATAACCTCCTGGATTCTTAATTCTGTGTCGAAGGAAATTGTAGCCAGTGTCAACTATACAGGCTCCGCCAAGGCAGTGTGGGACGAACTTCACGGACGCTTCAAGCAAA
GCAACGGTCCACACATCTATCAGCTTCGCAAGGAATTAGTTACTGCTGCTCAAGGTAACATGTCCGTCGAAAGCTATTACACCAAATTGAAGACAATTTGGCAAGATTTG
TGTGAATATCGACCTGTTCTTGCATGTTCTTGTGGAGGTATCAAGTCGTTTATCGAGCACCTGGATTCTGGATTTGTTATGATATTTCTAATGGGATTGAGTGATTCTTT
CGCAAGTGTCCGTGCTCAGATTCTCCTGATGGCTCCATTACCGTCGATTACTGAGGTTTTTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAAGCGGCGGCTCCTTGCAACAAGCGGCGGCTTCTTGCAACAAGCGGCGGCGGTGGCAAAGACGGCCGGTGACGACGCCGGTGGTGAGAGAGAGAGAAGAAGACG
ATCAATGGAAGTCGTGGAGAAGAAGAAAACGATCAAAAAGCGGCTATTATTCGATTCCATCGCGGACAAGGATGCTTCTAACAACGATTCTTCAGCGATCGATGATTCCT
CTATGATCGACGCTCAATTGAATCCCTTCCACCTTCATCATTCCTATAGTTCTTTGGCCGTATTGGTTACACAGCCCCTGCTTGGTGCAAGAAATTATAATTCCTGGAGC
AAAGCAATGTTGATCGCACTCTTAGGAAAGAACAAAGAAGGCTTCATTGACGGCACAATCAAGAAACCAGAGGGAAGAAAATCCGCTGCATGGAAATGCAACAATGATAT
CATAACCTCCTGGATTCTTAATTCTGTGTCGAAGGAAATTGTAGCCAGTGTCAACTATACAGGCTCCGCCAAGGCAGTGTGGGACGAACTTCACGGACGCTTCAAGCAAA
GCAACGGTCCACACATCTATCAGCTTCGCAAGGAATTAGTTACTGCTGCTCAAGGTAACATGTCCGTCGAAAGCTATTACACCAAATTGAAGACAATTTGGCAAGATTTG
TGTGAATATCGACCTGTTCTTGCATGTTCTTGTGGAGGTATCAAGTCGTTTATCGAGCACCTGGATTCTGGATTTGTTATGATATTTCTAATGGGATTGAGTGATTCTTT
CGCAAGTGTCCGTGCTCAGATTCTCCTGATGGCTCCATTACCGTCGATTACTGAGGTTTTTCTTTGA
Protein sequenceShow/hide protein sequence
MLKRRLLATSGGFLQQAAAVAKTAGDDAGGERERRRRSMEVVEKKKTIKKRLLFDSIADKDASNNDSSAIDDSSMIDAQLNPFHLHHSYSSLAVLVTQPLLGARNYNSWS
KAMLIALLGKNKEGFIDGTIKKPEGRKSAAWKCNNDIITSWILNSVSKEIVASVNYTGSAKAVWDELHGRFKQSNGPHIYQLRKELVTAAQGNMSVESYYTKLKTIWQDL
CEYRPVLACSCGGIKSFIEHLDSGFVMIFLMGLSDSFASVRAQILLMAPLPSITEVFL