; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007495 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007495
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionrRNA intron-encoded homing endonuclease
Genome locationchr9:101339..102248
RNA-Seq ExpressionLag0007495
SyntenyLag0007495
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAF2086620.1 unnamed protein product [Brassica napus]2.1e-2848.59Show/hide
Query:  MKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVATLRWLIQPRYRVATSVRAT
        MKNVAKCDTWCELQ+P NHRVFERKLRP+PSGRGH CLGVT+R  P++   ++             +CG  +F +S  +V + R                
Subjt:  MKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVATLRWLIQPRYRVATSVRAT

Query:  PPASERGLLCRPSERRPKDDALDATPVWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGR
             + LL   S +R  D +L+  P  R+      G   LE GA EGESPV  GPCRTTRRC RVGLFGNAAPIGR
Subjt:  PPASERGLLCRPSERRPKDDALDATPVWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGR

GEV87049.1 hypothetical protein [Tanacetum cinerariifolium]5.0e-4654.63Show/hide
Query:  GAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPV
        G +L Y  +RLSATDISA ASMKNVAKCDTWCELQ+P NHRVFERKLRP P GRGHVCLGVTHR  P   L     + A  GLPCA  CGW        +
Subjt:  GAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPV

Query:  VATLRWLIQPRYRVATSVRATPPASERGLLCRPSERRPKDDALDATPVWRSV--------------LSG----GPGTSPLEGGAGEGESPVALGPCRTTR
           LRW  +    V T V  +   + +     P+   P DDA  ATP    +              L G    GPG SPLEGGAGEGESPV  GPCRTTR
Subjt:  VATLRWLIQPRYRVATSVRATPPASERGLLCRPSERRPKDDALDATPVWRSV--------------LSG----GPGTSPLEGGAGEGESPVALGPCRTTR

Query:  RCQRVGLFGNAAPIGR
        RC  VGLFGNAAPIGR
Subjt:  RCQRVGLFGNAAPIGR

KAF9662358.1 hypothetical protein SADUNF_Sadunf18G0044900 [Salix dunnii]4.7e-2857.35Show/hide
Query:  PGASTKPRRRSRQGTQTNSPAPRPAR---CAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPP
        P  + + R R+R G+ TN P  R A+     E     S   +RLSATDISALASMKNVAKCDTWCELQ+P NHRVFERKLRP P GRGHVCLGV HR  P
Subjt:  PGASTKPRRRSRQGTQTNSPAPRPAR---CAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPP

Query:  RNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVAT
        R+P   + +  A  GLP AP+ GW K ES A    T
Subjt:  RNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVAT

KAG5129004.1 hypothetical protein JHK84_035401 [Glycine max]6.7e-4345.36Show/hide
Query:  GGEHRLARMHPPGASTK---PRRRSRQGTQTNSPAP-RPARCAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRG
        G +H      PP    K   P  R+ + +   S  P  P   + G    +        TDISALASMKNVAKCDTWCELQ+P NHRVFERKLRP+P GRG
Subjt:  GGEHRLARMHPPGASTK---PRRRSRQGTQTNSPAP-RPARCAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRG

Query:  HVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVATLRWLIQPRYRVAT--SVRATP---PASERGLLCRPSERRPKDDALDAT----
        H CLGVTHR P      + A  R +A LP AP+  WL        +   + + +PR R  T   V + P   P   R L  RP  RR    +L  +    
Subjt:  HVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVATLRWLIQPRYRVAT--SVRATP---PASERGLLCRPSERRPKDDALDAT----

Query:  ----------------------------PVWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGR
                                     VWRSVLSGGPG SPLEGGA EGESPV  GPCRTTRRC+RVGLFGNAAPIGR
Subjt:  ----------------------------PVWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGR

TKY53825.1 hypothetical protein E2542_SST25368 [Spatholobus suberectus]1.8e-4357.07Show/hide
Query:  MKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVATLRWLIQPRYR-------V
        MKNVAKCDTWCELQ+P NHRVFERKLRP+P GRGH CLGVTHRCPP + +              A    WL      P  A  R L + R R       +
Subjt:  MKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVATLRWLIQPRYR-------V

Query:  ATSVRATPPASERGLLCRPSERRPKDDALDATPVWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGR
            R+ P    R   C  S+     DAL  T VWRSVLSGGPG SPLEGGA EGESPV  GPCRTTRRC+RVGLFGNAAPIGR
Subjt:  ATSVRATPPASERGLLCRPSERRPKDDALDATPVWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGR

TrEMBL top hitse value%identityAlignment
A0A2N9IL11 Uncharacterized protein2.9e-3149.46Show/hide
Query:  DPANHRVFERKLRPEPSGRGHVCLGVTHRC----PPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVATLRWLIQPRYRVATSVRATPPASERGLLC
        +P NHRVFERKLRP P GRGHVCLGVTHRC    PPR   G+     +  G    P     + +SS     T R  + P    AT  +A  PA  + +  
Subjt:  DPANHRVFERKLRPEPSGRGHVCLGVTHRC----PPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVATLRWLIQPRYRVATSVRATPPASERGLLC

Query:  RPSERRPKDDALDATP-----------------VWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGR
        R  ++  +   + A+                  VWRSVLSGGPG SPLEGGAGEGESPV  GPCRTTRRC+RVGLFGNAAPIGR
Subjt:  RPSERRPKDDALDATP-----------------VWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGR

A0A3N7FVM0 Uncharacterized protein6.6e-2871.88Show/hide
Query:  RLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCP--PRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVAT
        RLSATDISALASMKNVAKCDTWCELQ+P NHRVFERKLRP P GRGHVCLGVTHR P  PR     + +  A  GLP AP+ GW K ESSA V  T
Subjt:  RLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCP--PRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVAT

A0A6J5UZ50 Uncharacterized protein1.5e-2765Show/hide
Query:  SQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVATLRWL
        S+RLSATDISALASMKNVAKCDTWCELQ+P NHRVFERKLRP+P GRGH CLGVT RCPP     + +   A  GLPCA S GW K++         +WL
Subjt:  SQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVATLRWL

M1D9E9 Uncharacterized protein1.9e-3850.95Show/hide
Query:  MKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLG-----LVAQVRAHAGLPCAPSCGW-----LKFESSAPVVATLR------
        MKNVAKCDTWCELQ+P NHRVFERKLRP+P GRGHVCLGVTHR  PR   G      V   RA AGL  +P         ++ + S  V AT R      
Subjt:  MKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLG-----LVAQVRAHAGLPCAPSCGW-----LKFESSAPVVATLR------

Query:  -----WLIQPR------YRVATSVRATPPASERGLLCRPSERRPKDDALDA------TPVWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVG
               ++PR      Y ++ S+  +             ER     AL++        VWRSVLSGGPG SPLEGGA EGESPV  GPCRTTRRC RVG
Subjt:  -----WLIQPR------YRVATSVRATPPASERGLLCRPSERRPKDDALDA------TPVWRSVLSGGPGTSPLEGGAGEGESPVALGPCRTTRRCQRVG

Query:  LFGNAAPIGR
        LFGNAA IGR
Subjt:  LFGNAAPIGR

V4UD02 Uncharacterized protein1.2e-2652.67Show/hide
Query:  YSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSC------------GWLKF
        Y +RLSATDISALASMKNVAKCDTWCELQ+P NHRVFERKLRP+P GRGHVCLGVTHRCP   P       +  AG   AP C            GW K 
Subjt:  YSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPSGRGHVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSC------------GWLKF

Query:  ESSAPVVATLRWLIQPRYRVATSVRATPPASERGLLCRPSERRPKDDALD
        ESSA   A +    +     A+   A  P SE G LC P   R +    D
Subjt:  ESSAPVVATLRWLIQPRYRVATSVRATPPASERGLLCRPSERRPKDDALD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTAAACATCAAACGACCCGCGAACGCGTTCACGAACTTTCTGTGCCTGGGGGGGAGCATCGTCTTGCACGCATGCACCCTCCCGGTGCCTCAACAAAACCCCGGCG
CAGGTCGCGCCAAGGAACTCAAACGAATTCGCCCGCCCCCCGCCCCGCTCGGTGTGCGGAGGGCGGAGCATTCTTGTCGTATTATTCACAACGACTCTCGGCAACGGATA
TCTCGGCTCTCGCATCGATGAAGAACGTAGCGAAATGCGATACTTGGTGTGAATTGCAGGATCCCGCGAACCACCGAGTCTTTGAACGCAAGTTGCGCCCGGAGCCTTCT
GGCCGAGGGCACGTCTGCCTGGGCGTCACGCATCGCTGCCCCCCACGCAACCCCCTTGGGTTGGTTGCGCAGGTGCGGGCACACGCTGGCCTCCCGTGCGCACCGTCGTG
CGGATGGCTTAAATTCGAGTCCTCGGCGCCTGTCGTCGCGACACTACGGTGGTTGATCCAACCTCGGTACCGCGTCGCGACCTCAGTCCGCGCAACTCCTCCTGCGAGCG
AGCGAGGACTTCTATGTCGACCCTCTGAACGTCGTCCCAAAGACGATGCTCTCGACGCGACCCCAGTCTGGAGAAGCGTCCTCAGCGGCGGACCGGGCACAAGTCCCCTG
GAAGGGGGCGCCGGAGAGGGTGAGAGCCCCGTTGCGCTCGGACCCTGTCGCACCACGAGGCGCTGTCAACGAGTCGGGTTGTTTGGGAATGCAGCCCCAATCGGGCGGTA
A
mRNA sequenceShow/hide mRNA sequence
ATGCCTAAACATCAAACGACCCGCGAACGCGTTCACGAACTTTCTGTGCCTGGGGGGGAGCATCGTCTTGCACGCATGCACCCTCCCGGTGCCTCAACAAAACCCCGGCG
CAGGTCGCGCCAAGGAACTCAAACGAATTCGCCCGCCCCCCGCCCCGCTCGGTGTGCGGAGGGCGGAGCATTCTTGTCGTATTATTCACAACGACTCTCGGCAACGGATA
TCTCGGCTCTCGCATCGATGAAGAACGTAGCGAAATGCGATACTTGGTGTGAATTGCAGGATCCCGCGAACCACCGAGTCTTTGAACGCAAGTTGCGCCCGGAGCCTTCT
GGCCGAGGGCACGTCTGCCTGGGCGTCACGCATCGCTGCCCCCCACGCAACCCCCTTGGGTTGGTTGCGCAGGTGCGGGCACACGCTGGCCTCCCGTGCGCACCGTCGTG
CGGATGGCTTAAATTCGAGTCCTCGGCGCCTGTCGTCGCGACACTACGGTGGTTGATCCAACCTCGGTACCGCGTCGCGACCTCAGTCCGCGCAACTCCTCCTGCGAGCG
AGCGAGGACTTCTATGTCGACCCTCTGAACGTCGTCCCAAAGACGATGCTCTCGACGCGACCCCAGTCTGGAGAAGCGTCCTCAGCGGCGGACCGGGCACAAGTCCCCTG
GAAGGGGGCGCCGGAGAGGGTGAGAGCCCCGTTGCGCTCGGACCCTGTCGCACCACGAGGCGCTGTCAACGAGTCGGGTTGTTTGGGAATGCAGCCCCAATCGGGCGGTA
A
Protein sequenceShow/hide protein sequence
MPKHQTTRERVHELSVPGGEHRLARMHPPGASTKPRRRSRQGTQTNSPAPRPARCAEGGAFLSYYSQRLSATDISALASMKNVAKCDTWCELQDPANHRVFERKLRPEPS
GRGHVCLGVTHRCPPRNPLGLVAQVRAHAGLPCAPSCGWLKFESSAPVVATLRWLIQPRYRVATSVRATPPASERGLLCRPSERRPKDDALDATPVWRSVLSGGPGTSPL
EGGAGEGESPVALGPCRTTRRCQRVGLFGNAAPIGR