; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018201 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018201
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:18768937..18771405
RNA-Seq ExpressionLag0018201
SyntenyLag0018201
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8694437.1 hypothetical protein F3Y22_tig00110783pilonHSYRG00231 [Hibiscus syriacus]1.2e-0527.47Show/hide
Query:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRKDK-----------ASTSQATPP--TGSNVASPSQH---T
        +PT+H+ T+S  R++LL+ ++ G  I++G II +    C +++A    F +LIT L Q++K +           A  ++A  P   G   A   +H   T
Subjt:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRKDK-----------ASTSQATPP--TGSNVASPSQH---T

Query:  SFTGPLPASEALG-----MVHR---QLDQIRENLKTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEE----DEEKEEKESSLDEE
        S   P P S A        VHR    + Q+ E L  Y+ YAK +D  +    +   P      P FP  L+P  E+     E    D    +  SS D  
Subjt:  SFTGPLPASEALG-----MVHR---QLDQIRENLKTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEE----DEEKEEKESSLDEE

Query:  YGRAFMTPM----VCHCMKGLLREKNRGEAGIP
               P+    +      + RE NR  A +P
Subjt:  YGRAFMTPM----VCHCMKGLLREKNRGEAGIP

KAF4375842.1 hypothetical protein G4B88_026421 [Cannabis sativa]1.5e-0830.53Show/hide
Query:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRK----------DKASTSQATPPTGSNVASPSQHTSFTGPL
        +PT+HD+T+S +R+ +LYC++KG +IN+G +I  +I  C  +   K FF+ LIT+  +              +       P       +PS  T+ T   
Subjt:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRK----------DKASTSQATPPTGSNVASPSQHTSFTGPL

Query:  PASEALGMVHRQLDQIRENLKTYWTYAKEQD
        P  E L        Q+ E L+T+W Y +E+D
Subjt:  PASEALGMVHRQLDQIRENLKTYWTYAKEQD

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]1.6e-0726.77Show/hide
Query:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRK----------------DKASTSQATPPTGSNVASPSQHT
        +PTTH  T+S DR++LLY ++ G  IN+G +I  +I AC  +++   FF SLITQL +  +                D  + ++ T   G    +    T
Subjt:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRK----------------DKASTSQATPPTGSNVASPSQHT

Query:  SFTGPLPASEALGMVHRQLDQIRENL---------------------KTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEEDEE
        S      +S A G + +QL  + + L                     + +W Y+KE+D A+++   +        FP FPQ LL   + + + E D++
Subjt:  SFTGPLPASEALGMVHRQLDQIRENL---------------------KTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEEDEE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.8e-0726.09Show/hide
Query:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQ-----------------------RRKDKASTSQATPPTGSNV
        +PTTH   +S DR++LL+ ++ G  IN+G +I  +I AC  ++    FF SLIT+L +                       R   +  T     P+ S  
Subjt:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQ-----------------------RRKDKASTSQATPPTGSNV

Query:  ASPSQHTSFTGPLPASEALGMVHRQLDQIRENLKTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEEDEE
        A+ S   +    L   +AL     Q +   +  + +W Y+KE+D A+++   +        FP FPQ +L   + + + E D++
Subjt:  ASPSQHTSFTGPLPASEALGMVHRQLDQIRENLKTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEEDEE

XP_024971944.1 uncharacterized protein LOC112510826 [Cynara cardunculus var. scolymus]1.8e-0649.09Show/hide
Query:  PTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQL
        PTTHD++ISV++++LLYC++ G  IN+G ++   IL C ++R  K FF SLI +L
Subjt:  PTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQL

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)7.4e-0625.38Show/hide
Query:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQ-------------------------RRKDKASTSQATPPTGS
        +PTTH  T+S DR++LL+ ++ G  IN+G +I  +I AC  ++    FF SLIT+L +                         R   +  T     P+ S
Subjt:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQ-------------------------RRKDKASTSQATPPTGS

Query:  NVASPSQHTSFTGPLPASEALGMVHRQ-----------LDQIRENLKTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEEDEE
          A+ S + +    L   +AL     Q           L    +  + +W Y+KE+D A+++   +        FP FPQ +L   + + + E D++
Subjt:  NVASPSQHTSFTGPLPASEALGMVHRQ-----------LDQIRENLKTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEEDEE

A0A2P5CEY2 Uncharacterized protein7.9e-0826.77Show/hide
Query:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRK----------------DKASTSQATPPTGSNVASPSQHT
        +PTTH  T+S DR++LLY ++ G  IN+G +I  +I AC  +++   FF SLITQL +  +                D  + ++ T   G    +    T
Subjt:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRK----------------DKASTSQATPPTGSNVASPSQHT

Query:  SFTGPLPASEALGMVHRQLDQIRENL---------------------KTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEEDEE
        S      +S A G + +QL  + + L                     + +W Y+KE+D A+++   +        FP FPQ LL   + + + E D++
Subjt:  SFTGPLPASEALGMVHRQLDQIRENL---------------------KTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEEDEE

A0A2P5DXM3 Uncharacterized protein1.4e-0726.09Show/hide
Query:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQ-----------------------RRKDKASTSQATPPTGSNV
        +PTTH   +S DR++LL+ ++ G  IN+G +I  +I AC  ++    FF SLIT+L +                       R   +  T     P+ S  
Subjt:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQ-----------------------RRKDKASTSQATPPTGSNV

Query:  ASPSQHTSFTGPLPASEALGMVHRQLDQIRENLKTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEEDEE
        A+ S   +    L   +AL     Q +   +  + +W Y+KE+D A+++   +        FP FPQ +L   + + + E D++
Subjt:  ASPSQHTSFTGPLPASEALGMVHRQLDQIRENLKTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEEDEE

A0A6A2ZSK6 Integrase catalytic domain-containing protein5.7e-0627.47Show/hide
Query:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRKDK-----------ASTSQATPP--TGSNVASPSQH---T
        +PT+H+ T+S  R++LL+ ++ G  I++G II +    C +++A    F +LIT L Q++K +           A  ++A  P   G   A   +H   T
Subjt:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRKDK-----------ASTSQATPP--TGSNVASPSQH---T

Query:  SFTGPLPASEALG-----MVHR---QLDQIRENLKTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEE----DEEKEEKESSLDEE
        S   P P S A        VHR    + Q+ E L  Y+ YAK +D  +    +   P      P FP  L+P  E+     E    D    +  SS D  
Subjt:  SFTGPLPASEALG-----MVHR---QLDQIRENLKTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEE----DEEKEEKESSLDEE

Query:  YGRAFMTPM----VCHCMKGLLREKNRGEAGIP
               P+    +      + RE NR  A +P
Subjt:  YGRAFMTPM----VCHCMKGLLREKNRGEAGIP

A0A7J6FZ22 Uncharacterized protein7.2e-0930.53Show/hide
Query:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRK----------DKASTSQATPPTGSNVASPSQHTSFTGPL
        +PT+HD+T+S +R+ +LYC++KG +IN+G +I  +I  C  +   K FF+ LIT+  +              +       P       +PS  T+ T   
Subjt:  MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRK----------DKASTSQATPPTGSNVASPSQHTSFTGPL

Query:  PASEALGMVHRQLDQIRENLKTYWTYAKEQD
        P  E L        Q+ E L+T+W Y +E+D
Subjt:  PASEALGMVHRQLDQIRENLKTYWTYAKEQD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACCACCCATGACAACACAATTTCAGTGGATAGAGTTATGCTACTCTATTGCCTTATGAAGGGGTTGGAAATCAACTTGGGGAGCATTATTAGGGATGATATCTT
AGCTTGTGGACGGAAAAGGGCAGACAAGTTTTTCTTCGCCTCACTCATCACCCAACTCTATCAGAGGAGGAAGGATAAAGCCTCCACATCACAGGCTACTCCACCTACAG
GGTCGAATGTAGCTTCTCCATCCCAGCACACTTCTTTTACAGGGCCTTTACCAGCATCAGAAGCCCTAGGCATGGTCCACCGCCAGTTAGATCAAATCAGGGAGAACCTG
AAGACTTATTGGACGTATGCAAAGGAGCAGGATGAAGCTATTAGAGAGTTCTATCTCTCTATTGCCCCTAGTATGGCCCTGGTCTTTCCAAATTTCCCTCAGCCGCTGCT
GCCCCAAGAAGAAGAGGATTCTGATGAAGAGGAAGATGAAGAGAAAGAAGAGAAAGAGAGTTCCTTGGACGAGGAATATGGGAGGGCATTCATGACACCCATGGTTTGCC
ACTGCATGAAAGGTTTACTTAGGGAAAAAAACAGAGGAGAAGCTGGAATTCCCCATAAATGCGTCCGCATTTCTGGGAAGGCAAAATTGAAATGCGACCGCATTTCTGGA
AAAACAGTGGCCGTTCCGAGTCATCCGCGGGTCGTTGTTGACGAGTCTTCTTCGCACCTAACCGGTTGCTTTGACCCCGAAACTGACTCGACAACCTTCCAACACCTATT
TCACAGCTCCTCAACCAATTTCTACACTATATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCAACCACCCATGACAACACAATTTCAGTGGATAGAGTTATGCTACTCTATTGCCTTATGAAGGGGTTGGAAATCAACTTGGGGAGCATTATTAGGGATGATATCTT
AGCTTGTGGACGGAAAAGGGCAGACAAGTTTTTCTTCGCCTCACTCATCACCCAACTCTATCAGAGGAGGAAGGATAAAGCCTCCACATCACAGGCTACTCCACCTACAG
GGTCGAATGTAGCTTCTCCATCCCAGCACACTTCTTTTACAGGGCCTTTACCAGCATCAGAAGCCCTAGGCATGGTCCACCGCCAGTTAGATCAAATCAGGGAGAACCTG
AAGACTTATTGGACGTATGCAAAGGAGCAGGATGAAGCTATTAGAGAGTTCTATCTCTCTATTGCCCCTAGTATGGCCCTGGTCTTTCCAAATTTCCCTCAGCCGCTGCT
GCCCCAAGAAGAAGAGGATTCTGATGAAGAGGAAGATGAAGAGAAAGAAGAGAAAGAGAGTTCCTTGGACGAGGAATATGGGAGGGCATTCATGACACCCATGGTTTGCC
ACTGCATGAAAGGTTTACTTAGGGAAAAAAACAGAGGAGAAGCTGGAATTCCCCATAAATGCGTCCGCATTTCTGGGAAGGCAAAATTGAAATGCGACCGCATTTCTGGA
AAAACAGTGGCCGTTCCGAGTCATCCGCGGGTCGTTGTTGACGAGTCTTCTTCGCACCTAACCGGTTGCTTTGACCCCGAAACTGACTCGACAACCTTCCAACACCTATT
TCACAGCTCCTCAACCAATTTCTACACTATATAA
Protein sequenceShow/hide protein sequence
MPTTHDNTISVDRVMLLYCLMKGLEINLGSIIRDDILACGRKRADKFFFASLITQLYQRRKDKASTSQATPPTGSNVASPSQHTSFTGPLPASEALGMVHRQLDQIRENL
KTYWTYAKEQDEAIREFYLSIAPSMALVFPNFPQPLLPQEEEDSDEEEDEEKEEKESSLDEEYGRAFMTPMVCHCMKGLLREKNRGEAGIPHKCVRISGKAKLKCDRISG
KTVAVPSHPRVVVDESSSHLTGCFDPETDSTTFQHLFHSSSTNFYTI