; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017636 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017636
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:6237945..6239495
RNA-Seq ExpressionLag0017636
SyntenyLag0017636
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN71391.1 hypothetical protein VITISV_038145 [Vitis vinifera]9.2e-2731.43Show/hide
Query:  NGVMKFDGKNFGYWKMQVKDYLTCRKVHKALKER---PKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHETTAVKL---MEALTNRQSSNKESTVGSA
        +G++K    N+  WK ++KD L C+++ + ++ R   P    +++W+ L+ + V  IR  +   V   VA E  A  L   +E+L  R+++  ++ +   
Subjt:  NGVMKFDGKNFGYWKMQVKDYLTCRKVHKALKER---PKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHETTAVKL---MEALTNRQSSNKESTVGSA

Query:  LVITKGKDKVDEENEPSSSRKKWKNRNE------------------------CSN---HSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSK
        LV  K KD   +       R+ +K + E                        C N     S W++D+A S HI + R  F+S+T G  G V MGN    +
Subjt:  LVITKGKDKVDEENEPSSSRKKWKNRNE------------------------CSN---HSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSK

Query:  TSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLYKCQLDVAKG
          G+GDV L+T  G KLVL+DVR +P ++ +LIS GKL+D+GY    G  + K+  GS V+  G + +TLYK ++ + KG
Subjt:  TSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLYKCQLDVAKG

KAF7129225.1 hypothetical protein RHSIM_Rhsim10G0050800 [Rhododendron simsii]1.7e-2553.39Show/hide
Query:  WILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVA
        W++DS AS H+ S R  F S+T G  G VRMGN   SK  G+GDV L+T  G KL+L+DVR +PNI++NLIS GKL+D+GY  +FG  + KL  GS VVA
Subjt:  WILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVA

Query:  VGHRKSTLYKCQLDVAKG
         G + STLY  Q  ++KG
Subjt:  VGHRKSTLYKCQLDVAKG

KAF7129546.1 hypothetical protein RHSIM_Rhsim10G0154200 [Rhododendron simsii]5.0e-2551.69Show/hide
Query:  WILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVA
        W++DS AS H+ S R  F S+T G  G VRMGN   SK  G+GDV L+T  G KL+L+DVR +P+I++NLIS GKL+D+GY  +FG  + KL  GS +VA
Subjt:  WILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVA

Query:  VGHRKSTLYKCQLDVAKG
         G + STLY  Q+ ++KG
Subjt:  VGHRKSTLYKCQLDVAKG

KAG8364690.1 hypothetical protein BUALT_Bualt18G0024700 [Buddleja alternifolia]2.9e-2547.58Show/hide
Query:  SNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKL
        S   +DWI+DS AS HI   R +FTS+T G+ G VRM N   ++  G+G+++L+T+ G +L+LRDVR +PNI++N+ISTGKL+DDGY+  FG  + KL  
Subjt:  SNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKL

Query:  GSQVVAVGHRKSTLYKCQLDVAKG
        GS + A G +K++LY  Q  ++ G
Subjt:  GSQVVAVGHRKSTLYKCQLDVAKG

KAG8367017.1 hypothetical protein BUALT_Bualt16G0028600 [Buddleja alternifolia]5.0e-2546.77Show/hide
Query:  SNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKL
        S   +DWI+DS AS HI   R +FTS+T G+ G VRM N   ++  G+G+++L+T+ G +L+LRDVR++PNI++N+ISTGKL+DDGY+  FG  + KL  
Subjt:  SNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKL

Query:  GSQVVAVGHRKSTLYKCQLDVAKG
        GS + A   +K++LY  Q  ++ G
Subjt:  GSQVVAVGHRKSTLYKCQLDVAKG

TrEMBL top hitse value%identityAlignment
A0A2I0KAA2 gag_pre-integrs domain-containing protein3.2e-2533.33Show/hide
Query:  VMKFDGKNFGYWKMQVKDYLTCRKVHKALKERPKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHE-TTAVKLMEALTNRQSSNKESTVGSALVITKGK
        + KFDG ++G+W+MQ++DYL  + + + L+++  GMT+ +W+ LD +A++ IR+ L  +VA   A E TT  ++ +AL    SS  ES            
Subjt:  VMKFDGKNFGYWKMQVKDYLTCRKVHKALKERPKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHE-TTAVKLMEALTNRQSSNKESTVGSALVITKGK

Query:  DKVDEENEPSSSRKKWKNRNECSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLIS
                                    W+LDS AS H   +  +  ++T G+ G V + +    + +G GDV +KT  G    LR VR +P +K NLIS
Subjt:  DKVDEENEPSSSRKKWKNRNECSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLIS

Query:  TGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLY
         G L+D+GY   FGS   K+  G+ VVA G +  TLY
Subjt:  TGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLY

A0A2N9GCZ3 Uncharacterized protein1.1e-2532.98Show/hide
Query:  NGVMKFDGKNFGYWKMQVKDYLTCRKVH-KALKERPKGMTDEDWEALD-----------------EEAVASIRMCLLMDVASLVAHETTAV----KLMEA
        +G+ KFDG NFGYWK+Q++DYL  +K+H   L ++P  M D +W  LD                 E+ +A+ ++ L+  + +L   E  AV         
Subjt:  NGVMKFDGKNFGYWKMQVKDYLTCRKVH-KALKERPKGMTDEDWEALD-----------------EEAVASIRMCLLMDVASLVAHETTAV----KLMEA

Query:  LTNRQSSNK---ESTVGSALVIT---KGKDK-----------VDEENEPSSSRKKWKNRNECSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG
        +TN+ SS +   +  + + +V+    K KD            V++E     S +K KN +   N+  +W++DSA + H+   + LFT++     G V MG
Subjt:  LTNRQSSNK---ESTVGSALVIT---KGKDK-----------VDEENEPSSSRKKWKNRNECSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMG

Query:  NGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLYKCQLDVAK
        N   SK  GIGDV +KT  G  ++L++VR +PN+  NLIST  ++  GY    G+ R KL  G  VVA G     LYK ++   K
Subjt:  NGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLYKCQLDVAK

A0A5N5N166 Uncharacterized protein7.1e-2530.5Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCRKVH-KALKERPKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHETTAVKLMEALT--------------------
        G+ K DG++FGYWKMQ++DYL  +K+H   L ++P+ M DE+W +LD + +  IR+ L   VA  V  E T   LM  L+                    
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCRKVH-KALKERPKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHETTAVKLMEALT--------------------

Query:  -------------NRQSSNKESTVGSAL-VITKGK-----DKVDEENEPSSSRKKWKNRNEC-----SNHSSDWILDSAASVHIASDRSLFTSFTGGHHG
                      R+ + + ST GS L V T+G       K+ +E++  S+    +  +E      S+    W+LDS AS H    +++  ++  G +G
Subjt:  -------------NRQSSNKESTVGSAL-VITKGK-----DKVDEENEPSSSRKKWKNRNEC-----SNHSSDWILDSAASVHIASDRSLFTSFTGGHHG

Query:  LVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLY
        +V + +G   +  GIGDV +K        L+ VR +P +K NLIS G+L++ G+   F     K+  G  ++A G +  TLY
Subjt:  LVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLY

A0A803LL22 Uncharacterized protein5.8e-2732.3Show/hide
Query:  VMKFDGKNFGYWKMQVKDYLTCRKVHKALKERPKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHETTAVKLMEALTN---------------------
        + KFDGK+FG+WKMQ++DYL  + ++  L  +P GM +E+W+ LD +A+  IR+ L   VA  V   TT V +++AL+N                     
Subjt:  VMKFDGKNFGYWKMQVKDYLTCRKVHKALKERPKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHETTAVKLMEALTN---------------------

Query:  ----RQSSNKESTVGSALVITKGKDKVDE---ENEPSSSRKK----WKNRNECSNH----SSD----------------WILDSAASVHIASDRSLFTSF
            R+ ++ ES+ G+     +G+ +      + E  + +KK     KN ++ S+     SSD                WILDS AS H +  + +F +F
Subjt:  ----RQSSNKESTVGSALVITKGKDKVDE---ENEPSSSRKK----WKNRNECSNH----SSD----------------WILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKST--LYK
          G  G V + +    +  G GDV +KT  G    L DVR++P ++ NLIS G+L+  GY   FG    K+  G+ VVA G +  T  LYK
Subjt:  TGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKST--LYK

A5AJI7 Integrase catalytic domain-containing protein4.4e-2731.43Show/hide
Query:  NGVMKFDGKNFGYWKMQVKDYLTCRKVHKALKER---PKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHETTAVKL---MEALTNRQSSNKESTVGSA
        +G++K    N+  WK ++KD L C+++ + ++ R   P    +++W+ L+ + V  IR  +   V   VA E  A  L   +E+L  R+++  ++ +   
Subjt:  NGVMKFDGKNFGYWKMQVKDYLTCRKVHKALKER---PKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHETTAVKL---MEALTNRQSSNKESTVGSA

Query:  LVITKGKDKVDEENEPSSSRKKWKNRNE------------------------CSN---HSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSK
        LV  K KD   +       R+ +K + E                        C N     S W++D+A S HI + R  F+S+T G  G V MGN    +
Subjt:  LVITKGKDKVDEENEPSSSRKKWKNRNE------------------------CSN---HSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSK

Query:  TSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLYKCQLDVAKG
          G+GDV L+T  G KLVL+DVR +P ++ +LIS GKL+D+GY    G  + K+  GS V+  G + +TLYK ++ + KG
Subjt:  TSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLYKCQLDVAKG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-2037.65Show/hide
Query:  KGKDKVD-EENEPSSSRKKWKNRN---------EC---SNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLV
        KGK +   ++N+ +++     N N         EC   S   S+W++D+AAS H    R LF  +  G  G V+MGN   SK +GIGD+ +KT  G  LV
Subjt:  KGKDKVD-EENEPSSSRKKWKNRN---------EC---SNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLV

Query:  LRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLYKCQLDVAKG
        L+DVR +P+++MNLIS   L+ DGY   F +++ +L  GS V+A G  + TLY+   ++ +G
Subjt:  LRDVRFMPNIKMNLISTGKLEDDGYMCEFGSRRCKLKLGSQVVAVGHRKSTLYKCQLDVAKG

P25601 Putative transposon Ty5-1 protein YCL075W3.9e-0436Show/hide
Query:  SSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLIS
        SS+WI D+  + H+  DRS+F+SFT         G G +    G G V++ T     + L DV ++P++ +NLIS
Subjt:  SSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.6e-0435.09Show/hide
Query:  EPSSSRKKWKNRNECS----NHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLIS
        +P S    W+ R   +      S++W+LDS A+ HI SD    SL   +TGG    V + +G T   S  G  SL T+    L L ++ ++PNI  NLIS
Subjt:  EPSSSRKKWKNRNECS----NHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLIS

Query:  TGKL-EDDGYMCEF
          +L   +G   EF
Subjt:  TGKL-EDDGYMCEF

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein1.2e-0529.87Show/hide
Query:  KFDGKNFGYWKMQVKDYLTCRKVHKALKERPKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHETTAVKLMEALTN
        K DG ++ + +M+++DYL  +K+H+ L ++ + M+ +DW  L  + +  IR+ +  ++A  VA E +   LM+ L++
Subjt:  KFDGKNFGYWKMQVKDYLTCRKVHKALKERPKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHETTAVKLMEALTN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGTAGAGCCAAGAAATTTCAATGGAGTCATGAAGTTCGACGGAAAAAATTTTGGATATTGGAAGATGCAAGTCAAAGATTATTTAACTTGCAGGAAAGTGCA
TAAGGCATTGAAGGAGAGACCGAAAGGGATGACAGACGAAGATTGGGAAGCTCTGGATGAAGAGGCAGTTGCAAGCATAAGGATGTGTTTATTAATGGATGTGGCAAGTC
TAGTGGCCCATGAGACAACTGCGGTTAAATTGATGGAAGCACTTACAAACAGGCAGAGTAGTAATAAAGAGTCTACAGTAGGGTCAGCTTTGGTTATAACTAAGGGTAAG
GATAAGGTTGATGAAGAAAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAAAATAGGAATGAGTGTAGTAACCACTCATCAGATTGGATATTAGATAGTGCAGCGTCTGT
ACATATCGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGACATCATGGTCTAGTGAGGATGGGGAATGGTAGAACCTCCAAAACTAGTGGGATTGGAGATGTTA
GTCTGAAGACAGAGTGTGGAGATAAATTAGTATTGCGAGATGTCAGGTTTATGCCTAATATCAAGATGAATCTTATTTCCACTGGCAAGTTGGAAGATGATGGTTACATG
TGTGAGTTTGGTAGTCGCCGGTGTAAACTCAAGTTAGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACATTGTACAAATGTCAGTTGGATGTTGCCAAAGGATC
AAAGAGACTATGGATGCCGGTTACAGCTGCAGATGATAGTTGTAGAGAAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGGCACTCGAATGAATTGATGAAGT
CGCTTAGGCGAGTTGAGACATCAAAGTGGAAGGCCAAAGCAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTCGGTAACAGGTTTGAATAGAGGATTCAAGCCATTCTTC
TTTGGAAACAGTCGTTCAAGTTGGAAGAAGATAACATGTGTCACCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTTGTAGAGCCAAGAAATTTCAATGGAGTCATGAAGTTCGACGGAAAAAATTTTGGATATTGGAAGATGCAAGTCAAAGATTATTTAACTTGCAGGAAAGTGCA
TAAGGCATTGAAGGAGAGACCGAAAGGGATGACAGACGAAGATTGGGAAGCTCTGGATGAAGAGGCAGTTGCAAGCATAAGGATGTGTTTATTAATGGATGTGGCAAGTC
TAGTGGCCCATGAGACAACTGCGGTTAAATTGATGGAAGCACTTACAAACAGGCAGAGTAGTAATAAAGAGTCTACAGTAGGGTCAGCTTTGGTTATAACTAAGGGTAAG
GATAAGGTTGATGAAGAAAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAAAATAGGAATGAGTGTAGTAACCACTCATCAGATTGGATATTAGATAGTGCAGCGTCTGT
ACATATCGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGACATCATGGTCTAGTGAGGATGGGGAATGGTAGAACCTCCAAAACTAGTGGGATTGGAGATGTTA
GTCTGAAGACAGAGTGTGGAGATAAATTAGTATTGCGAGATGTCAGGTTTATGCCTAATATCAAGATGAATCTTATTTCCACTGGCAAGTTGGAAGATGATGGTTACATG
TGTGAGTTTGGTAGTCGCCGGTGTAAACTCAAGTTAGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACATTGTACAAATGTCAGTTGGATGTTGCCAAAGGATC
AAAGAGACTATGGATGCCGGTTACAGCTGCAGATGATAGTTGTAGAGAAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGGCACTCGAATGAATTGATGAAGT
CGCTTAGGCGAGTTGAGACATCAAAGTGGAAGGCCAAAGCAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTCGGTAACAGGTTTGAATAGAGGATTCAAGCCATTCTTC
TTTGGAAACAGTCGTTCAAGTTGGAAGAAGATAACATGTGTCACCACTTAG
Protein sequenceShow/hide protein sequence
MGFVEPRNFNGVMKFDGKNFGYWKMQVKDYLTCRKVHKALKERPKGMTDEDWEALDEEAVASIRMCLLMDVASLVAHETTAVKLMEALTNRQSSNKESTVGSALVITKGK
DKVDEENEPSSSRKKWKNRNECSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTSGIGDVSLKTECGDKLVLRDVRFMPNIKMNLISTGKLEDDGYM
CEFGSRRCKLKLGSQVVAVGHRKSTLYKCQLDVAKGSKRLWMPVTAADDSCREEKVDGYRESPVVRHSNELMKSLRRVETSKWKAKAVAKVKGQVSSSVTGLNRGFKPFF
FGNSRSSWKKITCVTT