; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007494 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007494
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionrRNA intron-encoded homing endonuclease
Genome locationchr9:91044..91998
RNA-Seq ExpressionLag0007494
SyntenyLag0007494
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA3026779.1 Hypothetical predicted protein [Olea europaea subsp. europaea]3.8e-4554.5Show/hide
Query:  EAITVSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGGCCPRRREKSTEPY--HLEEGEVVTSIVLHACTLPVPQQ----------NPGAGRAKE
        E   VSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGG  P    ++ EP   H  E +            P+ ++          NPGA   KE
Subjt:  EAITVSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGGCCPRRREKSTEPY--HLEEGEVVTSIVLHACTLPVPQQ----------NPGAGRAKE

Query:  LKRIRPP-PAPHRCAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTH--RCPPRNPPSGWLRRCGHT
          R R P  AP      G   +  S+RLSATDISALASMKN AKCDTWCELQ+P NHRVFERKLRP PS RGHVCLGVTH     PR   +G     G  
Subjt:  LKRIRPP-PAPHRCAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTH--RCPPRNPPSGWLRRCGHT

Query:  LASRAHRRADGLNSSPRRLSSR
         AS A  R    ++S R L+SR
Subjt:  LASRAHRRADGLNSSPRRLSSR

GEV87049.1 hypothetical protein [Tanacetum cinerariifolium]1.0e-2961.42Show/hide
Query:  CPRRREKSTEPYHLEEGEVVTSIVLHACTLPVPQQNPGAGRAKELKRIRPPPAPHRCAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANH
        C RRR+ STEPYHLEEGEVVT                   R  +LK+     A  R A  G +L Y  +RLSATDISA ASMKNVAKCDTWCELQ+P NH
Subjt:  CPRRREKSTEPYHLEEGEVVTSIVLHACTLPVPQQNPGAGRAKELKRIRPPPAPHRCAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANH

Query:  RVFERKLRPEPSGRGHVCLGVTHRCPP
        RVFERKLRP P GRGHVCLGVTHR  P
Subjt:  RVFERKLRPEPSGRGHVCLGVTHRCPP

KAG5568993.1 hypothetical protein H5410_064037 [Solanum commersonii]6.6e-4554.67Show/hide
Query:  EAITVSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGGCCP-RRREKSTEPYHLEEGEVVTSIVLHACTLPVPQQNPGAGRAKELKRIRPPPAPH
        E   VSASHQLALTTSLPFVHTARRSYRLN PVKCSDRGDVGG  P   RE +    H           L     P  +     GR   + R+   P P 
Subjt:  EAITVSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGGCCP-RRREKSTEPYHLEEGEVVTSIVLHACTLPVPQQNPGAGRAKELKRIRPPPAPH

Query:  RCAE---------------------GGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPPSGW
        R                        GG  L +  +RLSATDISALASMKNVAKCDTWCELQ+P NHRVFERKLRP+P GRGHVCLGVTHR  PR  P G 
Subjt:  RCAE---------------------GGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPPSGW

Query:  LRRCGHTLASRAHR
         R  G  LASRA R
Subjt:  LRRCGHTLASRAHR

KAG7527853.1 hypothetical protein ISN44_Un253g000010, partial [Arabidopsis suecica]1.2e-2741Show/hide
Query:  WPLRPRKFEAITVSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGG---------CCPRRREKSTEPYHLEEGEVVTSIVLHACTLPVPQQNPGA
        WP  P       VSASHQLALTTSLPFVHTARRSYRLN PVKCSDRGDVGG          CP++ +  T  +H      +T  VL    +         
Subjt:  WPLRPRKFEAITVSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGG---------CCPRRREKSTEPYHLEEGEVVTSIVLHACTLPVPQQNPGA

Query:  GRAKELKRIRPPPAPHRCAEGGAFLSYYSQRLSATDIS------ALASMKNVAK---------CDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHR
                        R A  G+F+S  S  L   DI+         + K  A           D     ++P NHRVFERKLRP+PSGRGHVCLGVT+R
Subjt:  GRAKELKRIRPPPAPHRCAEGGAFLSYYSQRLSATDIS------ALASMKNVAK---------CDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHR

Query:  CPPRNPPSGWLRRCGHTLASRAHRRADGLNSSPRRLSSR
         P  +         G  L SR    A GLN S  RL  R
Subjt:  CPPRNPPSGWLRRCGHTLASRAHRRADGLNSSPRRLSSR

KAG9438841.1 hypothetical protein H6P81_021246 [Aristolochia fimbriata]3.9e-2942.92Show/hide
Query:  EAITVSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGGCCPRRREKSTEPYHLEEGEVVTSIVLHACTLPVPQQNPG----------AGRAKELK
        E   VSASHQLALTTSLPFVHTARRSYRLNGPV    R D          ++ E + +  G+   +     C  PV                +G  +   
Subjt:  EAITVSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGGCCPRRREKSTEPYHLEEGEVVTSIVLHACTLPVPQQNPG----------AGRAKELK

Query:  RIRPPPAPHRCAE----------GGAFLSYYSQ----RLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVT------HRCP
        R    P    CA+          G A  S        RLSATDI ALASMKNVAKCDTWCELQ+PANHRVFERKL P P G+G+ CLGVT        CP
Subjt:  RIRPPPAPHRCAE----------GGAFLSYYSQ----RLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVT------HRCP

Query:  PRNPPSGWLRRCGHTLASRAHRRADGLNSSP-RRLSSRHY
           PP+    R G         RAD  +  P  RL+ +H+
Subjt:  PRNPPSGWLRRCGHTLASRAHRRADGLNSSP-RRLSSRHY

TrEMBL top hitse value%identityAlignment
A0A0D3BAS1 Uncharacterized protein8.7e-2745.45Show/hide
Query:  VSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGG---CCPRRREKSTEPYHLEEGEVVTSIVLHACTLPVPQQNPGAGRAKELKRIRPPPAPHRC
        VSASHQLALTTSLPFVHTARRSYRLN PVKCSDRGDVGG      R  +  T  +H     ++   V       V + +P   R       R  P P   
Subjt:  VSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGG---CCPRRREKSTEPYHLEEGEVVTSIVLHACTLPVPQQNPGAGRAKELKRIRPPPAPHRC

Query:  AEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPP---RNPPSGWLRRCGHT
                  S +  +T   A A+ + V      C   +P NHRVFERKLRP+PSGRGHVCLGVT+R PP   R     W   C  T
Subjt:  AEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPP---RNPPSGWLRRCGHT

A0A2P5E587 Uncharacterized protein (Fragment)4.3e-2666.04Show/hide
Query:  LPVPQQNPGAGRAKELKRIR-PPPAPHRCAEGGAFLSY-YSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPP
        +P    NPGA  A+E KR   P   P   A     LS+  S+RLSATDISALASMKNVAKCDTWCELQ+  NHRVFERKLRP+P GRGHVCLGVT RCPP
Subjt:  LPVPQQNPGAGRAKELKRIR-PPPAPHRCAEGGAFLSY-YSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPP

Query:  RNPPSG
           PSG
Subjt:  RNPPSG

A0A6D2L4R4 Uncharacterized protein3.3e-2681.82Show/hide
Query:  LEGLWPLRPRKFEAITVSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGGCCPRRREKSTEPYHLEEGEVVT
        L G   +RPRKFEAITVSASHQLALTTSLPFVHTARRSYRLN PVKCSDRGDVGG      +KST+PYHLEEGEVVT
Subjt:  LEGLWPLRPRKFEAITVSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGGCCPRRREKSTEPYHLEEGEVVT

A0A6P5WYZ9 uncharacterized protein LOC1112783483.9e-2764.42Show/hide
Query:  QQNPGAGRAKELKRIRP------PPAPHRCAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPP
        Q+ PGA RAKE KR R       P A        +F    ++RLSATDISALASMKNVAKCDTWCELQ+P NHRVFERKLRP+P GRGH CLGV HR PP
Subjt:  QQNPGAGRAKELKRIRP------PPAPHRCAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPP

Query:  RNPP
         +PP
Subjt:  RNPP

V4UD02 Uncharacterized protein4.3e-2676.54Show/hide
Query:  YSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPPS-GWLRRCGHTLASRA
        Y +RLSATDISALASMKNVAKCDTWCELQ+P NHRVFERKLRP+P GRGHVCLGVTHRCP   P +      CG  LASRA
Subjt:  YSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPPS-GWLRRCGHTLASRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGAGGAATCCCTCCGTGGCCAGCTTCTTAGAGGGACTATGGCCGCTTAGGCCAAGGAAGTTTGAGGCAATAACAGTAAGCGCGAGTCATCAGCTCGCGTTGACTAC
GTCCCTGCCCTTTGTACACACCGCCCGTCGCTCCTACCGATTGAATGGTCCGGTGAAGTGTTCGGATCGCGGCGACGTGGGCGGTTGCTGCCCGCGACGTCGCGAGAAGT
CCACTGAACCTTATCATTTAGAGGAAGGAGAAGTCGTAACAAGCATCGTCTTGCACGCATGCACCCTCCCGGTGCCTCAACAAAACCCCGGCGCAGGTCGCGCCAAGGAA
CTCAAACGAATTCGCCCGCCCCCCGCCCCGCATCGGTGTGCGGAGGGCGGAGCATTCTTGTCGTATTATTCACAACGACTCTCGGCAACGGATATCTCGGCTCTCGCATC
GATGAAGAACGTAGCGAAATGCGATACTTGGTGTGAATTGCAGGATCCCGCGAACCACCGAGTCTTTGAACGCAAGTTGCGCCCGGAGCCTTCTGGCCGAGGGCACGTCT
GCCTGGGCGTCACGCATCGCTGCCCCCCACGCAACCCCCCTTCGGGTTGGTTGCGCAGGTGCGGGCACACGCTGGCCTCCCGTGCGCACCGTCGTGCGGATGGCTTAAAT
TCGAGTCCTCGGCGCCTGTCGTCGCGACACTACGGTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGAGGAATCCCTCCGTGGCCAGCTTCTTAGAGGGACTATGGCCGCTTAGGCCAAGGAAGTTTGAGGCAATAACAGTAAGCGCGAGTCATCAGCTCGCGTTGACTAC
GTCCCTGCCCTTTGTACACACCGCCCGTCGCTCCTACCGATTGAATGGTCCGGTGAAGTGTTCGGATCGCGGCGACGTGGGCGGTTGCTGCCCGCGACGTCGCGAGAAGT
CCACTGAACCTTATCATTTAGAGGAAGGAGAAGTCGTAACAAGCATCGTCTTGCACGCATGCACCCTCCCGGTGCCTCAACAAAACCCCGGCGCAGGTCGCGCCAAGGAA
CTCAAACGAATTCGCCCGCCCCCCGCCCCGCATCGGTGTGCGGAGGGCGGAGCATTCTTGTCGTATTATTCACAACGACTCTCGGCAACGGATATCTCGGCTCTCGCATC
GATGAAGAACGTAGCGAAATGCGATACTTGGTGTGAATTGCAGGATCCCGCGAACCACCGAGTCTTTGAACGCAAGTTGCGCCCGGAGCCTTCTGGCCGAGGGCACGTCT
GCCTGGGCGTCACGCATCGCTGCCCCCCACGCAACCCCCCTTCGGGTTGGTTGCGCAGGTGCGGGCACACGCTGGCCTCCCGTGCGCACCGTCGTGCGGATGGCTTAAAT
TCGAGTCCTCGGCGCCTGTCGTCGCGACACTACGGTGGTTGA
Protein sequenceShow/hide protein sequence
MRRNPSVASFLEGLWPLRPRKFEAITVSASHQLALTTSLPFVHTARRSYRLNGPVKCSDRGDVGGCCPRRREKSTEPYHLEEGEVVTSIVLHACTLPVPQQNPGAGRAKE
LKRIRPPPAPHRCAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPPSGWLRRCGHTLASRAHRRADGLN
SSPRRLSSRHYGG