; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008228 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008228
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr9:15296828..15297885
RNA-Seq ExpressionLag0008228
SyntenyLag0008228
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]4.0e-1927.36Show/hide
Query:  RDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGLREESISMAVGVQDQGTLE----------PKRERHDQKPFSQANEG
        R+ + EKGF    S   G    FIS VI+   WQ FC H  + +VPLV+EFYA L+ +  +     +   T            P ++    +  + A E 
Subjt:  RDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGLREESISMAVGVQDQGTLE----------PKRERHDQKPFSQANEG

Query:  SIKACGQQGGSMKSQTKVK-----SLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLIT
         +K   +    + +Q  +      +    +L+  + VW HF+ +RL+ +TH  TIS +R +LLY ++ G  INVG +I D+I AC  K    L+F SLI+
Subjt:  SIKACGQQGGSMKSQTKVK-----SLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLIT

Query:  QLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQ-SRPNVASPSQHTPFTGPSPASEALGM-----------VHHQLDRIRENL
        +LC +  +  +  E R      +DL  I ++     ++ +K    +   + SRP+    + HT     + + E L             +   L + +E L
Subjt:  QLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQ-SRPNVASPSQHTPFTGPSPASEALGM-----------VHHQLDRIRENL

Query:  KTYWTYAKERDEAIREFY
          +W Y+++RD A+++ +
Subjt:  KTYWTYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.9e-2231.6Show/hide
Query:  MKKRDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGL-----------------REESISMAVG----VQDQGTLEPKR
        ++ R    EKGF    S+  G L  FI++VI Q+ W++FCAH ++ +VPLVREFYA L                  EE+I+   G    V +        
Subjt:  MKKRDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGL-----------------REESISMAVG----VQDQGTLEPKR

Query:  ERHDQKPFSQANEGSIKACGQQGGSMKSQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAD
          HD    +     ++        +  + T ++S     L   + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++  
Subjt:  ERHDQKPFSQANEGSIKACGQQGGSMKSQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAD

Query:  KLFFGSLITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQSRPNVASPSQ
         LFF SLIT+LC+  +     +EE+      ID   + ++ Q     +    ++Q    SRP  AS S+
Subjt:  KLFFGSLITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQSRPNVASPSQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.0e-2730.48Show/hide
Query:  MKKRDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGLR--EESISMAVGVQDQ----------GTLEPKRERHD--QKP
        ++ R    EKGF    S+  G L  FI++VI Q+ W++FCAH ++ +VPLVREFYA L   EE+     GVQ            G  +P  E  +  Q  
Subjt:  MKKRDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGLR--EESISMAVGVQDQ----------GTLEPKRERHD--QKP

Query:  FSQANEGSIKACGQQGGSMK-SQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGS
          Q     ++     G     S     + +   L   + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++   LFF S
Subjt:  FSQANEGSIKACGQQGGSMK-SQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGS

Query:  LITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQSRPNVASPS-------QHTPFTGPSPASEALGMVHHQ--LDRIRENL
        LIT+LC+  +     +EE+      ID   + ++ Q     +    ++Q    SRP  AS +       Q         + + +   H    L    +  
Subjt:  LITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQSRPNVASPS-------QHTPFTGPSPASEALGMVHHQ--LDRIRENL

Query:  KTYWTYAKERDEAIREFYLSIAPSIAPVFQNFPQSLLP----QEEEDSDEE
        + +W Y+KERD A+++   +      P F  FPQ +L     + E +SD++
Subjt:  KTYWTYAKERDEAIREFYLSIAPSIAPVFQNFPQSLLP----QEEEDSDEE

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]9.6e-2133.49Show/hide
Query:  VWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNI
        VW HF+K+RL+PTTH  T+S DR++LLY ++ G  INVG +I  EI AC  +++  LFF SLITQLC+  +     +EE+      ID   + ++ Q   
Subjt:  VWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNI

Query:  QRKDKASTSQATPQSRPNVASPS-------QHTPFTGPSPASEALGMVHHQ--LDRIRENLKTYWTYAKERDEAIREFYLSIAPSIAPVFQNFPQSLLP-
          + +   +Q    SRP VAS S       Q         + + +   H    L    +  + +W Y+KERD A+++   +      P F  FPQ LL  
Subjt:  QRKDKASTSQATPQSRPNVASPS-------QHTPFTGPSPASEALGMVHHQ--LDRIRENLKTYWTYAKERDEAIREFYLSIAPSIAPVFQNFPQSLLP-

Query:  ---QEEEDSDEE
           + E +SD++
Subjt:  ---QEEEDSDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]4.0e-1933.84Show/hide
Query:  FISRVIVQYKWQEFCAHLQETVVPLVREFYAGLRE--------ESISMAVGVQDQGTL----EPKRERHD-----QKPFSQANEGSIKACGQQG--GSMK
        FI+ VI+Q+ WQ FCAH ++ +VPLVREFY  +            + + + V+   T+    +P  E  +      KP       ++   G +    S  
Subjt:  FISRVIVQYKWQEFCAHLQETVVPLVREFYAGLRE--------ESISMAVGVQDQGTL----EPKRERHD-----QKPFSQANEGSIKACGQQG--GSMK

Query:  SQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLITQLCQRVKIVPDKDEER
        + T ++S     L   + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG +I  EI AC  +++  LFF SLIT +C+  +     +EE+
Subjt:  SQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLITQLCQRVKIVPDKDEER

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.9e-2231.6Show/hide
Query:  MKKRDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGL-----------------REESISMAVG----VQDQGTLEPKR
        ++ R    EKGF    S+  G L  FI++VI Q+ W++FCAH ++ +VPLVREFYA L                  EE+I+   G    V +        
Subjt:  MKKRDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGL-----------------REESISMAVG----VQDQGTLEPKR

Query:  ERHDQKPFSQANEGSIKACGQQGGSMKSQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAD
          HD    +     ++        +  + T ++S     L   + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++  
Subjt:  ERHDQKPFSQANEGSIKACGQQGGSMKSQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAD

Query:  KLFFGSLITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQSRPNVASPSQ
         LFF SLIT+LC+  +     +EE+      ID   + ++ Q     +    ++Q    SRP  AS S+
Subjt:  KLFFGSLITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQSRPNVASPSQ

A0A2P5BCG4 Uncharacterized protein (Fragment)1.9e-2730.48Show/hide
Query:  MKKRDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGLR--EESISMAVGVQDQ----------GTLEPKRERHD--QKP
        ++ R    EKGF    S+  G L  FI++VI Q+ W++FCAH ++ +VPLVREFYA L   EE+     GVQ            G  +P  E  +  Q  
Subjt:  MKKRDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGLR--EESISMAVGVQDQ----------GTLEPKRERHD--QKP

Query:  FSQANEGSIKACGQQGGSMK-SQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGS
          Q     ++     G     S     + +   L   + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++   LFF S
Subjt:  FSQANEGSIKACGQQGGSMK-SQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGS

Query:  LITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQSRPNVASPS-------QHTPFTGPSPASEALGMVHHQ--LDRIRENL
        LIT+LC+  +     +EE+      ID   + ++ Q     +    ++Q    SRP  AS +       Q         + + +   H    L    +  
Subjt:  LITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQSRPNVASPS-------QHTPFTGPSPASEALGMVHHQ--LDRIRENL

Query:  KTYWTYAKERDEAIREFYLSIAPSIAPVFQNFPQSLLP----QEEEDSDEE
        + +W Y+KERD A+++   +      P F  FPQ +L     + E +SD++
Subjt:  KTYWTYAKERDEAIREFYLSIAPSIAPVFQNFPQSLLP----QEEEDSDEE

A0A2P5CEY2 Uncharacterized protein4.6e-2133.49Show/hide
Query:  VWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNI
        VW HF+K+RL+PTTH  T+S DR++LLY ++ G  INVG +I  EI AC  +++  LFF SLITQLC+  +     +EE+      ID   + ++ Q   
Subjt:  VWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNI

Query:  QRKDKASTSQATPQSRPNVASPS-------QHTPFTGPSPASEALGMVHHQ--LDRIRENLKTYWTYAKERDEAIREFYLSIAPSIAPVFQNFPQSLLP-
          + +   +Q    SRP VAS S       Q         + + +   H    L    +  + +W Y+KERD A+++   +      P F  FPQ LL  
Subjt:  QRKDKASTSQATPQSRPNVASPS-------QHTPFTGPSPASEALGMVHHQ--LDRIRENLKTYWTYAKERDEAIREFYLSIAPSIAPVFQNFPQSLLP-

Query:  ---QEEEDSDEE
           + E +SD++
Subjt:  ---QEEEDSDEE

A0A2P5DAQ2 Uncharacterized protein1.9e-1933.84Show/hide
Query:  FISRVIVQYKWQEFCAHLQETVVPLVREFYAGLRE--------ESISMAVGVQDQGTL----EPKRERHD-----QKPFSQANEGSIKACGQQG--GSMK
        FI+ VI+Q+ WQ FCAH ++ +VPLVREFY  +            + + + V+   T+    +P  E  +      KP       ++   G +    S  
Subjt:  FISRVIVQYKWQEFCAHLQETVVPLVREFYAGLRE--------ESISMAVGVQDQGTL----EPKRERHD-----QKPFSQANEGSIKACGQQG--GSMK

Query:  SQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLITQLCQRVKIVPDKDEER
        + T ++S     L   + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG +I  EI AC  +++  LFF SLIT +C+  +     +EE+
Subjt:  SQTKVKSLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLITQLCQRVKIVPDKDEER

W9RBS1 Uncharacterized protein1.9e-1927.36Show/hide
Query:  RDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGLREESISMAVGVQDQGTLE----------PKRERHDQKPFSQANEG
        R+ + EKGF    S   G    FIS VI+   WQ FC H  + +VPLV+EFYA L+ +  +     +   T            P ++    +  + A E 
Subjt:  RDFLNEKGF----SKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGLREESISMAVGVQDQGTLE----------PKRERHDQKPFSQANEG

Query:  SIKACGQQGGSMKSQTKVK-----SLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLIT
         +K   +    + +Q  +      +    +L+  + VW HF+ +RL+ +TH  TIS +R +LLY ++ G  INVG +I D+I AC  K    L+F SLI+
Subjt:  SIKACGQQGGSMKSQTKVK-----SLVPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLIT

Query:  QLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQ-SRPNVASPSQHTPFTGPSPASEALGM-----------VHHQLDRIRENL
        +LC +  +  +  E R      +DL  I ++     ++ +K    +   + SRP+    + HT     + + E L             +   L + +E L
Subjt:  QLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNIQRKDKASTSQATPQ-SRPNVASPSQHTPFTGPSPASEALGM-----------VHHQLDRIRENL

Query:  KTYWTYAKERDEAIREFY
          +W Y+++RD A+++ +
Subjt:  KTYWTYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAAGAGATTTCCTCAATGAGAAGGGATTCTCTAAACGAGCAGGAGCACTGTCAGAGTTCATAAGCAGAGTTATCGTCCAGTACAAATGGCAGGAGTTCTGTGC
TCACCTTCAGGAGACTGTTGTGCCTTTAGTTCGTGAATTCTACGCCGGCTTGAGGGAGGAAAGTATTAGCATGGCGGTGGGTGTACAGGATCAAGGCACCCTTGAACCCA
AGAGGGAACGACATGATCAGAAACCCTTCAGCCAAGCAAATGAAGGAAGCATTAAAGCTTGTGGCCAACAAGGGGGTTCAATGAAATCACAGACGAAAGTGAAGTCTTTA
GTGCCAAGGGACTTAAAGCAAGAATCGGTAGTGTGGCTTCACTTTATCAAGAACCGTTTGATGCCAACCACCCATGACAACACAATTTCAGTGGATAGAGTGATGCTACT
CTATTGCCTTATGAAGGGGTTGGAGATCAATGTGGGGAGCATTATTAGGGATGAAATCTTAGCATGTGGACGGAAAAGGGCAGACAAGCTTTTCTTTGGCTCACTCATCA
CCCAACTCTGTCAGAGGGTGAAGATTGTGCCAGACAAGGACGAAGAGCGCCATTTCTTTAAATCAACCATTGACTTGTCCTTGATAGGAAAACTTAAACAGAACAACATC
CAGAGGAAGGATAAAGCCTCTACATCACAGGCCACACCTCAATCAAGGCCGAATGTAGCCTCTCCATCCCAACACACTCCTTTTACAGGGCCTTCACCAGCATCAGAAGC
ACTAGGCATGGTCCACCACCAGCTTGATCGAATCAGGGAGAACCTGAAGACTTACTGGACATATGCAAAGGAGCGGGATGAAGCTATTAGAGAGTTCTATCTCTCCATTG
CCCCTAGTATTGCTCCGGTCTTTCAAAATTTCCCTCAGTCGCTGCTGCCACAAGAAGAAGAGGATTCTGATGAAGAGCAAATAATGATGAAGATGATAAAGATGATGAAG
AGAGAGAGAGAGTTCCTCAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAAAGAGATTTCCTCAATGAGAAGGGATTCTCTAAACGAGCAGGAGCACTGTCAGAGTTCATAAGCAGAGTTATCGTCCAGTACAAATGGCAGGAGTTCTGTGC
TCACCTTCAGGAGACTGTTGTGCCTTTAGTTCGTGAATTCTACGCCGGCTTGAGGGAGGAAAGTATTAGCATGGCGGTGGGTGTACAGGATCAAGGCACCCTTGAACCCA
AGAGGGAACGACATGATCAGAAACCCTTCAGCCAAGCAAATGAAGGAAGCATTAAAGCTTGTGGCCAACAAGGGGGTTCAATGAAATCACAGACGAAAGTGAAGTCTTTA
GTGCCAAGGGACTTAAAGCAAGAATCGGTAGTGTGGCTTCACTTTATCAAGAACCGTTTGATGCCAACCACCCATGACAACACAATTTCAGTGGATAGAGTGATGCTACT
CTATTGCCTTATGAAGGGGTTGGAGATCAATGTGGGGAGCATTATTAGGGATGAAATCTTAGCATGTGGACGGAAAAGGGCAGACAAGCTTTTCTTTGGCTCACTCATCA
CCCAACTCTGTCAGAGGGTGAAGATTGTGCCAGACAAGGACGAAGAGCGCCATTTCTTTAAATCAACCATTGACTTGTCCTTGATAGGAAAACTTAAACAGAACAACATC
CAGAGGAAGGATAAAGCCTCTACATCACAGGCCACACCTCAATCAAGGCCGAATGTAGCCTCTCCATCCCAACACACTCCTTTTACAGGGCCTTCACCAGCATCAGAAGC
ACTAGGCATGGTCCACCACCAGCTTGATCGAATCAGGGAGAACCTGAAGACTTACTGGACATATGCAAAGGAGCGGGATGAAGCTATTAGAGAGTTCTATCTCTCCATTG
CCCCTAGTATTGCTCCGGTCTTTCAAAATTTCCCTCAGTCGCTGCTGCCACAAGAAGAAGAGGATTCTGATGAAGAGCAAATAATGATGAAGATGATAAAGATGATGAAG
AGAGAGAGAGAGTTCCTCAGATGA
Protein sequenceShow/hide protein sequence
MKKRDFLNEKGFSKRAGALSEFISRVIVQYKWQEFCAHLQETVVPLVREFYAGLREESISMAVGVQDQGTLEPKRERHDQKPFSQANEGSIKACGQQGGSMKSQTKVKSL
VPRDLKQESVVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRADKLFFGSLITQLCQRVKIVPDKDEERHFFKSTIDLSLIGKLKQNNI
QRKDKASTSQATPQSRPNVASPSQHTPFTGPSPASEALGMVHHQLDRIRENLKTYWTYAKERDEAIREFYLSIAPSIAPVFQNFPQSLLPQEEEDSDEEQIMMKMIKMMK
REREFLR