; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034978 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034978
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase core domain containing protein
Genome locationchr3:13194686..13196267
RNA-Seq ExpressionLag0034978
SyntenyLag0034978
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036141.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]5.0e-3137.84Show/hide
Query:  INPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTHADNLRQAGSLVQGDKSK
        +N  ++ WV  D LLLGW+YNSMT EVA Q+MG + AK+LW A+Q+LFGVQSR EED+LR  FQ  RKG+SKM DYLR+ KT+ +NL Q       +K  
Subjt:  INPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTHADNLRQAGSLVQGDKSK

Query:  INRLMEVGLASQIVKIREEEMEARIVEEEGGKAIITRSVRGSNQNVRSDGSNMGSFIGGNSTLHVFAAGQNSNLFIANPKTVVDPN-----------CKD
        I+ L               +M++ ++       I  + +   N N +S G    +    N  L  F    NSN F+  P+TV+D N             D
Subjt:  INRLMEVGLASQIVKIREEEMEARIVEEEGGKAIITRSVRGSNQNVRSDGSNMGSFIGGNSTLHVFAAGQNSNLFIANPKTVVDPN-----------CKD

Query:  LVSVSK-------------LAQDNDVYLEFHANSYLVKDTHTDKVLLKGVLKDCLYKLK
          ++S               AQDN+VYLEFH +   V +  T + +++GVLKD LY L+
Subjt:  LVSVSK-------------LAQDNDVYLEFHANSYLVKDTHTDKVLLKGVLKDCLYKLK

KAA0060208.1 Integrase, catalytic core [Cucumis melo var. makuwa]1.0e-3135.45Show/hide
Query:  VAIGTDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVK
        +A  TD  GASS +   + T+NP ++ WV  D LLLGW+YNSMT EVA Q+MG + AK+L  A+Q+LFGVQSR EED+LR  FQ  RKG+SKM DYLR+ 
Subjt:  VAIGTDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVK

Query:  KTHADNLRQAGS-----------------------------LVQGD-----KSKIN-----------RLMEVGLA-------SQIVKIREEEM--EARIV
        KT+A+NL QAGS                             +VQ       ++++N           +++ V +        + +V   EEE+   A+ V
Subjt:  KTHADNLRQAGS-----------------------------LVQGD-----KSKIN-----------RLMEVGLA-------SQIVKIREEEM--EARIV

Query:  EEEGGKAIITRSVRGSNQNVRSDGSNMGS----------FIGGNSTLHVFAAGQN------SNLFIANPKTVVDPNCKDLVSVSKLAQDNDVYLEFHANS
        E      +I     G+  +V +D SN+ +           +G  + L +   G +      S++ + N   V D   K+L+SVSKL QDN+V LEF+ + 
Subjt:  EEEGGKAIITRSVRGSNQNVRSDGSNMGS----------FIGGNSTLHVFAAGQN------SNLFIANPKTVVDPNCKDLVSVSKLAQDNDVYLEFHANS

Query:  YLVKDTHTDKVLLKGVLKDCLYKLKINGAV
          VKD  T + +++GVL+D LY L + G +
Subjt:  YLVKDTHTDKVLLKGVLKDCLYKLKINGAV

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]2.5e-3858.68Show/hide
Query:  TDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTHA
        T+++  SSS+ A E  INPLYESWV  DQLLLGWLYNSMTPEVATQVMG++NA +LWAA+QELFGVQS+AEEDYLRQVFQQ RKGS KMTD+LRV K+HA
Subjt:  TDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTHA

Query:  DNLRQAGSLVQGDKSKINRLMEVGLASQIVKIREEEMEARIVEEEGGKAII-------TRSVRGSNQ
        DNL QAGS V              L SQ++   +EE    +   +G + I         RSV G NQ
Subjt:  DNLRQAGSLVQGDKSKINRLMEVGLASQIVKIREEEMEARIVEEEGGKAII-------TRSVRGSNQ

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]1.6e-2965.14Show/hide
Query:  GTDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTH
        G+  +GASSS  A+E  +NP YESW+ VDQLLLGWLYNSMTPEVA QVMG + AK+LW ++ +LFGVQSR EEDYLR VFQ  RKG+ KM +YL+  K +
Subjt:  GTDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTH

Query:  ADNLRQAGS
         DNL QAGS
Subjt:  ADNLRQAGS

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]1.6e-2965.14Show/hide
Query:  GTDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTH
        G+  +GASSS  A+E  +NP YESW+ VDQLLLGWLYNSMTPEVA QVMG + AK+LW ++ +LFGVQSR EEDYLR VFQ  RKG+ KM +YL+  K +
Subjt:  GTDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTH

Query:  ADNLRQAGS
         DNL QAGS
Subjt:  ADNLRQAGS

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein2.4e-2659.68Show/hide
Query:  ANASNTTVA-IGTDVA-GASSS-TPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGS
        A++SNTTV   G D   GASSS TP +   +N L+E WV  D LLLGWLYNSMTP+VA Q+MG  N ++LW A Q+ FGVQSRAEED+LRQ+ Q  RKG+
Subjt:  ANASNTTVA-IGTDVA-GASSS-TPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGS

Query:  SKMTDYLRVKKTHADNLRQAGSLV
        +KM +YL V KT+ DNL Q GS V
Subjt:  SKMTDYLRVKKTHADNLRQAGSLV

A0A5A7UY76 Integrase, catalytic core4.9e-3235.45Show/hide
Query:  VAIGTDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVK
        +A  TD  GASS +   + T+NP ++ WV  D LLLGW+YNSMT EVA Q+MG + AK+L  A+Q+LFGVQSR EED+LR  FQ  RKG+SKM DYLR+ 
Subjt:  VAIGTDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVK

Query:  KTHADNLRQAGS-----------------------------LVQGD-----KSKIN-----------RLMEVGLA-------SQIVKIREEEM--EARIV
        KT+A+NL QAGS                             +VQ       ++++N           +++ V +        + +V   EEE+   A+ V
Subjt:  KTHADNLRQAGS-----------------------------LVQGD-----KSKIN-----------RLMEVGLA-------SQIVKIREEEM--EARIV

Query:  EEEGGKAIITRSVRGSNQNVRSDGSNMGS----------FIGGNSTLHVFAAGQN------SNLFIANPKTVVDPNCKDLVSVSKLAQDNDVYLEFHANS
        E      +I     G+  +V +D SN+ +           +G  + L +   G +      S++ + N   V D   K+L+SVSKL QDN+V LEF+ + 
Subjt:  EEEGGKAIITRSVRGSNQNVRSDGSNMGS----------FIGGNSTLHVFAAGQN------SNLFIANPKTVVDPNCKDLVSVSKLAQDNDVYLEFHANS

Query:  YLVKDTHTDKVLLKGVLKDCLYKLKINGAV
          VKD  T + +++GVL+D LY L + G +
Subjt:  YLVKDTHTDKVLLKGVLKDCLYKLKINGAV

A0A5D3CPY2 Retrotransposon protein, putative, Ty1-copia subclass2.4e-3137.84Show/hide
Query:  INPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTHADNLRQAGSLVQGDKSK
        +N  ++ WV  D LLLGW+YNSMT EVA Q+MG + AK+LW A+Q+LFGVQSR EED+LR  FQ  RKG+SKM DYLR+ KT+ +NL Q       +K  
Subjt:  INPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTHADNLRQAGSLVQGDKSK

Query:  INRLMEVGLASQIVKIREEEMEARIVEEEGGKAIITRSVRGSNQNVRSDGSNMGSFIGGNSTLHVFAAGQNSNLFIANPKTVVDPN-----------CKD
        I+ L               +M++ ++       I  + +   N N +S G    +    N  L  F    NSN F+  P+TV+D N             D
Subjt:  INRLMEVGLASQIVKIREEEMEARIVEEEGGKAIITRSVRGSNQNVRSDGSNMGSFIGGNSTLHVFAAGQNSNLFIANPKTVVDPN-----------CKD

Query:  LVSVSK-------------LAQDNDVYLEFHANSYLVKDTHTDKVLLKGVLKDCLYKLK
          ++S               AQDN+VYLEFH +   V +  T + +++GVLKD LY L+
Subjt:  LVSVSK-------------LAQDNDVYLEFHANSYLVKDTHTDKVLLKGVLKDCLYKLK

A0A6J1D5J0 uncharacterized protein LOC1110175011.2e-3858.68Show/hide
Query:  TDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTHA
        T+++  SSS+ A E  INPLYESWV  DQLLLGWLYNSMTPEVATQVMG++NA +LWAA+QELFGVQS+AEEDYLRQVFQQ RKGS KMTD+LRV K+HA
Subjt:  TDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTHA

Query:  DNLRQAGSLVQGDKSKINRLMEVGLASQIVKIREEEMEARIVEEEGGKAII-------TRSVRGSNQ
        DNL QAGS V              L SQ++   +EE    +   +G + I         RSV G NQ
Subjt:  DNLRQAGSLVQGDKSKINRLMEVGLASQIVKIREEEMEARIVEEEGGKAII-------TRSVRGSNQ

A0A6J1DCW4 uncharacterized protein LOC1110195987.3e-2851.37Show/hide
Query:  TDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTHA
        T++ G++SS  +   T+NP YE+W+VVD+LLLGWLYNSM  +VA QVMG   ++ LW AVQELFGVQSRAE DYL+QVFQQ  KGS +M +YL++ K+HA
Subjt:  TDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKTHA

Query:  DNLRQAGSLVQGDKSKINRLMEVGLASQIVKIREEEMEARIVEEEG
        DNL  AGS V              L SQ++   +EE    +V  +G
Subjt:  DNLRQAGSLVQGDKSKINRLMEVGLASQIVKIREEEMEARIVEEEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGCCTCAAATACTACTGTCGCCATTGGAACCGATGTTGCTGGAGCGTCGAGTTCAACACCTGCCATGGAAACCACGATAAACCCACTATATGAATCATGGGT
CGTTGTTGATCAGCTTCTTCTTGGCTGGTTGTACAACTCCATGACCCCAGAGGTTGCAACGCAAGTGATGGGGCATGACAATGCCAAGAATTTGTGGGCAGCCGTTCAAG
AACTCTTTGGGGTTCAGTCACGGGCAGAAGAAGATTACCTTCGGCAAGTCTTTCAACAAATCCGAAAAGGGTCCTCCAAGATGACAGACTATTTGAGAGTCAAGAAGACT
CATGCGGACAACCTTAGACAAGCTGGAAGTCTGGTTCAAGGGGACAAAAGCAAAATCAACCGTTTAATGGAAGTCGGTTTGGCTTCCCAAATAGTCAAAATCAGAGAGGA
GGAAATGGAGGCCAGAATCGTGGAAGAGGAAGGTGGCAAGGCAATAATCACTCGATCTGTCAGAGGTTCTAATCAGAATGTGAGAAGTGATGGTAGTAATATGGGTTCAT
TTATTGGAGGAAATTCAACTCTTCATGTGTTTGCAGCTGGTCAAAATTCCAATCTGTTTATAGCAAATCCAAAAACTGTGGTAGATCCAAACTGCAAAGATTTAGTGAGT
GTCTCAAAACTTGCTCAGGACAATGATGTTTATCTTGAATTTCATGCAAATTCTTATCTTGTAAAGGACACTCATACGGACAAGGTGTTGCTGAAGGGGGTTCTTAAAGA
TTGCCTTTACAAACTTAAGATTAATGGAGCAGTTACTGGTAGTGCTTCGAGATGTGTAGAAAAGTTCGGAGTTGGCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACGCCTCAAATACTACTGTCGCCATTGGAACCGATGTTGCTGGAGCGTCGAGTTCAACACCTGCCATGGAAACCACGATAAACCCACTATATGAATCATGGGT
CGTTGTTGATCAGCTTCTTCTTGGCTGGTTGTACAACTCCATGACCCCAGAGGTTGCAACGCAAGTGATGGGGCATGACAATGCCAAGAATTTGTGGGCAGCCGTTCAAG
AACTCTTTGGGGTTCAGTCACGGGCAGAAGAAGATTACCTTCGGCAAGTCTTTCAACAAATCCGAAAAGGGTCCTCCAAGATGACAGACTATTTGAGAGTCAAGAAGACT
CATGCGGACAACCTTAGACAAGCTGGAAGTCTGGTTCAAGGGGACAAAAGCAAAATCAACCGTTTAATGGAAGTCGGTTTGGCTTCCCAAATAGTCAAAATCAGAGAGGA
GGAAATGGAGGCCAGAATCGTGGAAGAGGAAGGTGGCAAGGCAATAATCACTCGATCTGTCAGAGGTTCTAATCAGAATGTGAGAAGTGATGGTAGTAATATGGGTTCAT
TTATTGGAGGAAATTCAACTCTTCATGTGTTTGCAGCTGGTCAAAATTCCAATCTGTTTATAGCAAATCCAAAAACTGTGGTAGATCCAAACTGCAAAGATTTAGTGAGT
GTCTCAAAACTTGCTCAGGACAATGATGTTTATCTTGAATTTCATGCAAATTCTTATCTTGTAAAGGACACTCATACGGACAAGGTGTTGCTGAAGGGGGTTCTTAAAGA
TTGCCTTTACAAACTTAAGATTAATGGAGCAGTTACTGGTAGTGCTTCGAGATGTGTAGAAAAGTTCGGAGTTGGCAAATAA
Protein sequenceShow/hide protein sequence
MANASNTTVAIGTDVAGASSSTPAMETTINPLYESWVVVDQLLLGWLYNSMTPEVATQVMGHDNAKNLWAAVQELFGVQSRAEEDYLRQVFQQIRKGSSKMTDYLRVKKT
HADNLRQAGSLVQGDKSKINRLMEVGLASQIVKIREEEMEARIVEEEGGKAIITRSVRGSNQNVRSDGSNMGSFIGGNSTLHVFAAGQNSNLFIANPKTVVDPNCKDLVS
VSKLAQDNDVYLEFHANSYLVKDTHTDKVLLKGVLKDCLYKLKINGAVTGSASRCVEKFGVGK