; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005672 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005672
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr6:25813909..25816867
RNA-Seq ExpressionLag0005672
SyntenyLag0005672
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040737.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.5e-2990.28Show/hide
Query:  TDEAGEEGGYASWCIFVGYDQQRL---RCIDPTTKSVSPHVVFDEASSWWSSENVVLPDSGMLDFRMRAQAP
        TDEAGEEGGYASWCIFVGYDQ      RCIDPTTKSVSPH+VFDEASSWWSSENVVLPDSGMLDFRMRAQAP
Subjt:  TDEAGEEGGYASWCIFVGYDQQRL---RCIDPTTKSVSPHVVFDEASSWWSSENVVLPDSGMLDFRMRAQAP

KAG5524056.1 hypothetical protein RHGRI_030902 [Rhododendron griersonianum]3.1e-1148.84Show/hide
Query:  GFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW
        GF+V G   KVCKL++ S+Y LK SSRQWYL+ ++AI   GF M  +DHC+  +       ILS YV DIL+AGN +  I  T+ W
Subjt:  GFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW

OMO58188.1 Integrase, catalytic core [Corchorus capsularis]9.7e-1351.16Show/hide
Query:  GFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW
        GF   G E KVCKL++ S+Y LK SSRQWYL+ +QAI   GF M  +DHC+  +   G   ILS YV DIL+AGN +  I  T+ W
Subjt:  GFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW

PIA41056.1 hypothetical protein AQUCO_02300087v1 [Aquilegia coerulea]6.3e-1281.4Show/hide
Query:  MRYGEWNLLVPGKGNACYFHFFTSHPTLLCCLVDSKNEVPVVL
        MRYGEWNLLVPGKGN CYFHFFTSHPT LCCLVDSK    ++L
Subjt:  MRYGEWNLLVPGKGNACYFHFFTSHPTLLCCLVDSKNEVPVVL

VAH70504.1 unnamed protein product [Triticum turgidum subsp. durum]3.1e-1142.06Show/hide
Query:  SGMLDFRMRAQAPSNKSLILPGFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFA
        +GML+  +    P        GF   G+E KVCKL K S+Y LK +SRQWY+  + AI+  GF M   DHCI  +IM  +  +LS YV DILIA N    
Subjt:  SGMLDFRMRAQAPSNKSLILPGFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFA

Query:  INRTQNW
        +   + W
Subjt:  INRTQNW

TrEMBL top hitse value%identityAlignment
A0A2N9EKE5 Uncharacterized protein2.5e-1451.16Show/hide
Query:  GFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW
        GFEV GHE KVC+L++ S+Y LK SSRQWYL+ + +I+  GF M  +DHC+  +     I ILS YV DIL+AGN + +I  T+ W
Subjt:  GFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW

A0A2N9FNH3 Integrase catalytic domain-containing protein2.5e-1451.16Show/hide
Query:  GFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW
        GFEV GHE KVC+L++ S+Y LK SSRQWYL+ + +I+  GF M  +DHC+  +     I ILS YV DIL+AGN + +I  T+ W
Subjt:  GFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW

A0A2N9GVL2 Integrase catalytic domain-containing protein2.9e-1551.14Show/hide
Query:  LPGFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW
        L GFEV GHE KVC+L++ S+Y LK SSRQWYL+ + +I+  GF MN +DHC+  +     I ILS YV DIL+ GN + +I  T+ W
Subjt:  LPGFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW

A0A2N9HWA3 Integrase catalytic domain-containing protein2.9e-1551.14Show/hide
Query:  LPGFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW
        L GFEV GHE KVC+L++ S+Y LK SSRQWYL+ + +I+  GF MN +DHC+  +     I ILS YV DIL+ GN + +I  T+ W
Subjt:  LPGFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNW

A0A5A7TGM3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-2990.28Show/hide
Query:  TDEAGEEGGYASWCIFVGYDQQRL---RCIDPTTKSVSPHVVFDEASSWWSSENVVLPDSGMLDFRMRAQAP
        TDEAGEEGGYASWCIFVGYDQ      RCIDPTTKSVSPH+VFDEASSWWSSENVVLPDSGMLDFRMRAQAP
Subjt:  TDEAGEEGGYASWCIFVGYDQQRL---RCIDPTTKSVSPHVVFDEASSWWSSENVVLPDSGMLDFRMRAQAP

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-0641.33Show/hide
Query:  GFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIE-TRIMDGEIRILSNYVYDILIAG
        GFEV G +  VCKL K SLY LK + RQWY+K +  +    +   + D C+   R  +    IL  YV D+LI G
Subjt:  GFEVLGHEDKVCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIE-TRIMDGEIRILSNYVYDILIAG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-0533.67Show/hide
Query:  VCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAIN----------RTQNWGWVKLLLCLEIA
        VC L+K S+Y LK +SRQW+LK +  +   GF  +  DH    +I       +  YV DI+I  N+  A++          + ++ G +K  L LEIA
Subjt:  VCKLRKLSLYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAIN----------RTQNWGWVKLLLCLEIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCCCAAATTCTCGGCGTCGAGAGAGATTTGAGATTCCTTACGGACGAAGCTGGAGAAGAAGGCGGCTATGCTAGTTGGTGCATCTTTGTCGGCTATGATCAGCA
GAGGTTGAGGTGCATTGATCCGACTACAAAATCTGTATCTCCACATGTCGTTTTCGATGAGGCATCATCATGGTGGTCCTCCGAAAATGTAGTACTGCCGGATTCCGGGA
TGCTAGACTTCCGTATGCGGGCACAAGCACCTAGCAACAAGAGCCTGATATTGCCAGGTTTTGAGGTTCTTGGGCATGAAGATAAGGTTTGCAAGCTAAGAAAGCTTTCT
TTATATGACTTGAAACTGTCTTCCCGACAGTGGTACCTTAAGTTGAATCAAGCTATTAGCGATATGGGATTTCGAATGAATTTCGACGATCATTGTATTGAAACTCGAAT
AATGGATGGTGAGATTAGAATATTGTCAAACTATGTTTATGATATCCTTATTGCCGGAAATTCCTTGTTTGCCATAAATCGAACTCAAAATTGGGGATGGGTGAAGCTTC
TTCTGTGTTTGGAAATAGCGGGTGGCTGTGTTATCCGCCCGGACAACAAGTCAAGTTGGCGGGGCTTTTTGGGGACTAGCCCGCTTCCCATTACTCGACGAAGGCTTCCC
ATTGTCCGGTTCTTTCTGGAATTGGCTTCAATCCGAGTCCGGGTCCCTGGGCTGGCTGCATCGGGGAGGGGTAGAAGCAATGGATCGAGTGTATGGATGAAATTGATTCT
CGTCCTGGGGATCATCTCTCCTAGTCTTGATCTAGCCGGTCCCCTCGGAATGATCTCGAGATGGAAGCGGGAGCGGAGAGCCCCAGCCCAGCTTGAAGGGCTAGGGATGC
GATACGGGGAATGGAACCTCTTAGTACCGGGCAAGGGGAATGCATGCTATTTCCATTTCTTCACATCCCACCCAACCCTGCTATGTTGTCTCGTAGATAGCAAGAATGAA
GTCCCCGTAGTCCTGGCTATGTGGATACTCATTCTAAACCACATGGGAGCTTCTGTAGCCAGGTCAACTCGAGATAGCAAGAATATACGGGAGCCATTAATATGGACAAC
TTTTCTCCCTACACGGGAAGCCACCACGAGAAAGAAACCTTCCGGGCCAAAGATGATTGATGTTTTCATTCGAGAAAGAAACTCCTGGCTTAACTCAAAAATAGGTTGCT
CATTTGCTCCTATGGCCTATCTTGCGGCAAGGCCAGGCCCCCTATGGAAAGTCCGGGTTTCAAGCCCTAAACGGCCGTCCACCCGCCTACTCAAGAAATCCGAGACCTGT
TGTAGGAGCGAAGCCGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACTCCCAAATTCTCGGCGTCGAGAGAGATTTGAGATTCCTTACGGACGAAGCTGGAGAAGAAGGCGGCTATGCTAGTTGGTGCATCTTTGTCGGCTATGATCAGCA
GAGGTTGAGGTGCATTGATCCGACTACAAAATCTGTATCTCCACATGTCGTTTTCGATGAGGCATCATCATGGTGGTCCTCCGAAAATGTAGTACTGCCGGATTCCGGGA
TGCTAGACTTCCGTATGCGGGCACAAGCACCTAGCAACAAGAGCCTGATATTGCCAGGTTTTGAGGTTCTTGGGCATGAAGATAAGGTTTGCAAGCTAAGAAAGCTTTCT
TTATATGACTTGAAACTGTCTTCCCGACAGTGGTACCTTAAGTTGAATCAAGCTATTAGCGATATGGGATTTCGAATGAATTTCGACGATCATTGTATTGAAACTCGAAT
AATGGATGGTGAGATTAGAATATTGTCAAACTATGTTTATGATATCCTTATTGCCGGAAATTCCTTGTTTGCCATAAATCGAACTCAAAATTGGGGATGGGTGAAGCTTC
TTCTGTGTTTGGAAATAGCGGGTGGCTGTGTTATCCGCCCGGACAACAAGTCAAGTTGGCGGGGCTTTTTGGGGACTAGCCCGCTTCCCATTACTCGACGAAGGCTTCCC
ATTGTCCGGTTCTTTCTGGAATTGGCTTCAATCCGAGTCCGGGTCCCTGGGCTGGCTGCATCGGGGAGGGGTAGAAGCAATGGATCGAGTGTATGGATGAAATTGATTCT
CGTCCTGGGGATCATCTCTCCTAGTCTTGATCTAGCCGGTCCCCTCGGAATGATCTCGAGATGGAAGCGGGAGCGGAGAGCCCCAGCCCAGCTTGAAGGGCTAGGGATGC
GATACGGGGAATGGAACCTCTTAGTACCGGGCAAGGGGAATGCATGCTATTTCCATTTCTTCACATCCCACCCAACCCTGCTATGTTGTCTCGTAGATAGCAAGAATGAA
GTCCCCGTAGTCCTGGCTATGTGGATACTCATTCTAAACCACATGGGAGCTTCTGTAGCCAGGTCAACTCGAGATAGCAAGAATATACGGGAGCCATTAATATGGACAAC
TTTTCTCCCTACACGGGAAGCCACCACGAGAAAGAAACCTTCCGGGCCAAAGATGATTGATGTTTTCATTCGAGAAAGAAACTCCTGGCTTAACTCAAAAATAGGTTGCT
CATTTGCTCCTATGGCCTATCTTGCGGCAAGGCCAGGCCCCCTATGGAAAGTCCGGGTTTCAAGCCCTAAACGGCCGTCCACCCGCCTACTCAAGAAATCCGAGACCTGT
TGTAGGAGCGAAGCCGCTTGA
Protein sequenceShow/hide protein sequence
MNSQILGVERDLRFLTDEAGEEGGYASWCIFVGYDQQRLRCIDPTTKSVSPHVVFDEASSWWSSENVVLPDSGMLDFRMRAQAPSNKSLILPGFEVLGHEDKVCKLRKLS
LYDLKLSSRQWYLKLNQAISDMGFRMNFDDHCIETRIMDGEIRILSNYVYDILIAGNSLFAINRTQNWGWVKLLLCLEIAGGCVIRPDNKSSWRGFLGTSPLPITRRRLP
IVRFFLELASIRVRVPGLAASGRGRSNGSSVWMKLILVLGIISPSLDLAGPLGMISRWKRERRAPAQLEGLGMRYGEWNLLVPGKGNACYFHFFTSHPTLLCCLVDSKNE
VPVVLAMWILILNHMGASVARSTRDSKNIREPLIWTTFLPTREATTRKKPSGPKMIDVFIRERNSWLNSKIGCSFAPMAYLAARPGPLWKVRVSSPKRPSTRLLKKSETC
CRSEAA