; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022207 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022207
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr7:21179022..21180147
RNA-Seq ExpressionLag0022207
SyntenyLag0022207
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB2636102.1 hypothetical protein D8674_026636 [Pyrus ussuriensis x Pyrus communis]8.9e-1133.12Show/hide
Query:  SSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT-----KGISDSTIRCGGWTSP----------IGGIEVAQQMGFSSFGVELDSLKLIDVLRDEV
        SSN++ + + WK LW  NVP+K+K   W++  D +PT     K  ++S + C    +P            G  +  Q GF +F +E DSLK++  L D  
Subjt:  SSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT-----KGISDSTIRCGGWTSP----------IGGIEVAQQMGFSSFGVELDSLKLIDVLRDEV

Query:  TALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFV-YSDCVWLE
        T LS IG ++ + + LL  ++        RQ N  AH LA +A +  S+C+W E
Subjt:  TALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFV-YSDCVWLE

RXH82922.1 hypothetical protein DVH24_003420 [Malus domestica]2.0e-1032.02Show/hide
Query:  WKSLWKLNVPSKMKFFLWRLFHDRLPTK--------GISDSTIRCGG-------------------WTSPIG---------GIEVAQQMGFSSFGVELDS
        WK LWK N+P K+K F W   HD LPTK        G+  +   C G                   ++SP G          ++ AQ  GF    VE DS
Subjt:  WKSLWKLNVPSKMKFFLWRLFHDRLPTK--------GISDSTIRCGG-------------------WTSPIG---------GIEVAQQMGFSSFGVELDS

Query:  LKLIDVLRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFVYS-DCVWLEDWPCEVSGVLMGD
         K+I VL       S +G +  +V +L    S   F+   R  N VAH LA FA   S + VW+E+ P  +  +L+ D
Subjt:  LKLIDVLRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFVYS-DCVWLEDWPCEVSGVLMGD

XP_030509135.1 uncharacterized protein LOC115723805 [Cannabis sativa]1.2e-0753.45Show/hide
Query:  SFTVRSGYRLALSMVTQTRPSSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT
        S+TV SGY LA + + Q  PSSS+S     WWK LW L +P K+K FLWR+ +D LPT
Subjt:  SFTVRSGYRLALSMVTQTRPSSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT

XP_042952134.1 uncharacterized protein LOC122289223 [Carya illinoinensis]1.2e-0725.74Show/hide
Query:  FTVRSGYRLALSMVT-QTRPSSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT-------KGISDSTIRCGGWTSPIG-----------------
        ++V+S Y+  L M T + R   S++    + WK LWKLN+  +++ F WR   + LPT       K I ++  +C    S IG                 
Subjt:  FTVRSGYRLALSMVT-QTRPSSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT-------KGISDSTIRCGGWTSPIG-----------------

Query:  -----------------GIEVAQQMGFSSFGVELDSLKLIDVLRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFVYSD-CVWL
                         GI++   +G     +E+D+L +I+ L+ E  + +    L+ E++ LL  +   +  +  R+ N VAH LA FA+   D  +W 
Subjt:  -----------------GIEVAQQMGFSSFGVELDSLKLIDVLRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFVYSD-CVWL

Query:  ED
         D
Subjt:  ED

XP_042969199.1 uncharacterized protein LOC122301911 [Carya illinoinensis]1.5e-1024.42Show/hide
Query:  SFTVRSGYRLALSMVTQTRPSSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT------KGISDS----------------------TIRCGGWT
        SF+V+S Y         +   SS+ +   V+WKSLW L +P KMK F W+   ++LPT      K + D                        + C    
Subjt:  SFTVRSGYRLALSMVTQTRPSSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT------KGISDS----------------------TIRCGGWT

Query:  SPIG------------GIEVAQQMGFSSFGVELDSLKLIDVLRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFVYSD-CVWLE
          +             G+++  Q G     ++ D L L++ L +    L++  F++ ++RRL+        +   R GN VAH+LA   ++ +D C+W +
Subjt:  SPIG------------GIEVAQQMGFSSFGVELDSLKLIDVLRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFVYSD-CVWLE

Query:  DWPCEVSGVLMGDATLI
          P  +S  +  D   I
Subjt:  DWPCEVSGVLMGDATLI

TrEMBL top hitse value%identityAlignment
A0A2N9IYL5 RNase H domain-containing protein3.4e-0823.79Show/hide
Query:  FTVRSGYRLALSMVTQTRPSSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPTKG------------------ISDSTIRC---------------
        ++VRSGY+  L+   +  P SS+  RM   WKS+W LNVP K++ FLWR  H+ LPTK                    S+STI                 
Subjt:  FTVRSGYRLALSMVTQTRPSSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPTKG------------------ISDSTIRC---------------

Query:  ------------GGWTSPIGG-------------------------------------------------------IEVAQQMGFSSFGVELDSLKLIDV
                      W  P  G                                                       I+ A+ +GF+ F +E DS  ++D 
Subjt:  ------------GGWTSPIGG-------------------------------------------------------IEVAQQMGFSSFGVELDSLKLIDV

Query:  LRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFA
        L       +  G ++ +++++ Q +    FL T R+GN +AH+LA  A
Subjt:  LRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFA

A0A498ILS0 Uncharacterized protein9.6e-1132.02Show/hide
Query:  WKSLWKLNVPSKMKFFLWRLFHDRLPTK--------GISDSTIRCGG-------------------WTSPIG---------GIEVAQQMGFSSFGVELDS
        WK LWK N+P K+K F W   HD LPTK        G+  +   C G                   ++SP G          ++ AQ  GF    VE DS
Subjt:  WKSLWKLNVPSKMKFFLWRLFHDRLPTK--------GISDSTIRCGG-------------------WTSPIG---------GIEVAQQMGFSSFGVELDS

Query:  LKLIDVLRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFVYS-DCVWLEDWPCEVSGVLMGD
         K+I VL       S +G +  +V +L    S   F+   R  N VAH LA FA   S + VW+E+ P  +  +L+ D
Subjt:  LKLIDVLRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFVYS-DCVWLEDWPCEVSGVLMGD

A0A5N5IC60 Uncharacterized protein4.3e-1133.12Show/hide
Query:  SSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT-----KGISDSTIRCGGWTSP----------IGGIEVAQQMGFSSFGVELDSLKLIDVLRDEV
        SSN++ + + WK LW  NVP+K+K   W++  D +PT     K  ++S + C    +P            G  +  Q GF +F +E DSLK++  L D  
Subjt:  SSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT-----KGISDSTIRCGGWTSP----------IGGIEVAQQMGFSSFGVELDSLKLIDVLRDEV

Query:  TALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFV-YSDCVWLE
        T LS IG ++ + + LL  ++        RQ N  AH LA +A +  S+C+W E
Subjt:  TALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFV-YSDCVWLE

A0A803Q328 Uncharacterized protein4.5e-0851.72Show/hide
Query:  SFTVRSGYRLALSMVTQTRPSSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT
        S+ V SGY LA   +  T PS SN+     WWK+LW L++P K+K FLWR+ HD LPT
Subjt:  SFTVRSGYRLALSMVTQTRPSSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT

M5W2B9 Uncharacterized protein (Fragment)1.8e-0927.72Show/hide
Query:  PVVPYSF-----TVRSGYRLALSMVTQTRP-SSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT------KGISDSTI--RCGGWTSP-------
        P+ PY F     +V++GYR+ALS   +      S+S     +WK LWKL VPSK+K  +WR   D L T      + +  S +  +CG +          
Subjt:  PVVPYSF-----TVRSGYRLALSMVTQTRP-SSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPT------KGISDSTI--RCGGWTSP-------

Query:  -------------------IGGIEVAQQ---------MGFSSFGVELDSLKLIDVLRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLA
                           + GIE+             G++ F +  DS +++ +L+  +   S +G ++ ++RRL+    V + +F PR GN  AH LA
Subjt:  -------------------IGGIEVAQQ---------MGFSSFGVELDSLKLIDVLRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLA

Query:  NF
         F
Subjt:  NF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAAAGTGGACAATATCATACCATTGTGGAGATCTATGTCGCGTCCGGTTGTCCCTTATAGTTTCACTGTCAGGAGTGGGTACCGGTTGGCTCTATCAATGGTTAC
TCAAACACGTCCCTCCTCATCTAACTCTGACCGTATGTGTGTTTGGTGGAAGAGCCTGTGGAAGCTGAATGTTCCAAGCAAGATGAAGTTTTTTCTATGGCGGTTGTTCC
ATGATCGCTTGCCGACGAAGGGGATTTCGGATAGTACAATTCGATGTGGTGGTTGGACGTCGCCAATAGGTGGCATCGAGGTTGCACAGCAAATGGGGTTTTCTAGCTTT
GGTGTGGAGTTGGATTCGTTGAAGTTGATTGATGTGTTGCGCGACGAGGTGACTGCTTTGTCTGAAATTGGGTTCTTGATGGCTGAGGTCCGACGGTTGCTGCAGGGTAT
CTCAGTGGAGAATTTTTTGTTTACACCACGACAAGGGAATAAGGTGGCTCATGTGTTGGCCAATTTTGCTTTTGTTTACTCTGATTGTGTTTGGCTTGAGGATTGGCCTT
GTGAAGTCTCTGGTGTACTGATGGGTGATGCCACCTTAATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCCAAAGTGGACAATATCATACCATTGTGGAGATCTATGTCGCGTCCGGTTGTCCCTTATAGTTTCACTGTCAGGAGTGGGTACCGGTTGGCTCTATCAATGGTTAC
TCAAACACGTCCCTCCTCATCTAACTCTGACCGTATGTGTGTTTGGTGGAAGAGCCTGTGGAAGCTGAATGTTCCAAGCAAGATGAAGTTTTTTCTATGGCGGTTGTTCC
ATGATCGCTTGCCGACGAAGGGGATTTCGGATAGTACAATTCGATGTGGTGGTTGGACGTCGCCAATAGGTGGCATCGAGGTTGCACAGCAAATGGGGTTTTCTAGCTTT
GGTGTGGAGTTGGATTCGTTGAAGTTGATTGATGTGTTGCGCGACGAGGTGACTGCTTTGTCTGAAATTGGGTTCTTGATGGCTGAGGTCCGACGGTTGCTGCAGGGTAT
CTCAGTGGAGAATTTTTTGTTTACACCACGACAAGGGAATAAGGTGGCTCATGTGTTGGCCAATTTTGCTTTTGTTTACTCTGATTGTGTTTGGCTTGAGGATTGGCCTT
GTGAAGTCTCTGGTGTACTGATGGGTGATGCCACCTTAATCTAA
Protein sequenceShow/hide protein sequence
MPKVDNIIPLWRSMSRPVVPYSFTVRSGYRLALSMVTQTRPSSSNSDRMCVWWKSLWKLNVPSKMKFFLWRLFHDRLPTKGISDSTIRCGGWTSPIGGIEVAQQMGFSSF
GVELDSLKLIDVLRDEVTALSEIGFLMAEVRRLLQGISVENFLFTPRQGNKVAHVLANFAFVYSDCVWLEDWPCEVSGVLMGDATLI