; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022019 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022019
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr7:16009995..16011065
RNA-Seq ExpressionLag0022019
SyntenyLag0022019
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.2e-3138.46Show/hide
Query:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKS
        L  G + RVGNG++I+ F DPW+P+   FKPL         +  VA FIT    WD+  +       D  LI S+ I     +D W+WHY K G Y+ +S
Subjt:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKS

Query:  GYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC
        GYKL   L+ +  + S+  + + W  +WK  VP KIK+F+WR+ H+ +PT   L  RG+     C IC   +ESI HA   C
Subjt:  GYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]1.2e-2937.5Show/hide
Query:  GSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKSGYK
        GS+ R+GNG  +   K  WIPK   FKP  V + T  +E +V++ I   N WD E + +   + DA +I  I +  R  ED  IWH+ K+G+YT KSGY+
Subjt:  GSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKSGYK

Query:  LEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSCGTLKE
           K+R     +SS + ++ W I+W   +P KI++FVWRA  + LP+   LW+R +     C +C+   E++ HAL  C   K+
Subjt:  LEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSCGTLKE

XP_030487384.1 uncharacterized protein LOC115704310 [Cannabis sativa]2.0e-2935.98Show/hide
Query:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVY-QATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTK
        +  G + R+GN  S+R  +D W+P+ + FK   +Y +    +++ V D   P   WD E +R V    DA LI S+       ED  +WHY+K+GEY+ +
Subjt:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVY-QATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTK

Query:  SGYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRR-EKESIDHALCSCGTLKE
        SGY++ A L+     + + A   WW +LWK K+PPK+K FVW+  H  +PTN  L  R + V   C+ C   E E++ H L +C   +E
Subjt:  SGYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRR-EKESIDHALCSCGTLKE

XP_030502765.1 uncharacterized protein LOC115717936 [Cannabis sativa]1.5e-2938.17Show/hide
Query:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKS
        L  G + RVG+G +I    D W+P    FKP   ++    N +MVAD I+    WD+  L+    + D   I SI +     +DV IW ++  G Y  KS
Subjt:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKS

Query:  GYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSCGTLK
        GY+L   L     T SS +   WW+  WK K+PPK+++FVW+ +H  LP    L+RR +  S  C IC   +ESI HAL SC   K
Subjt:  GYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSCGTLK

XP_030509050.1 uncharacterized protein LOC115723712 [Cannabis sativa]8.9e-3039.04Show/hide
Query:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKS
        L  G + R+G+GA +     PWIP    FKPL     +   ++ VADFIT S  WD+ KL++     D   I SI +     EDV +WHY+  G YT KS
Subjt:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKS

Query:  GYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSCGTLKE
        GYKL + + + Q  +S   + +WW   W  K+P KI++F WRAYH+ LPT   L  R ++ S  C +C+   E+I+HA   C   K+
Subjt:  GYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSCGTLKE

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248743.0e-3138.46Show/hide
Query:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKS
        L  G + RVGNG++I+ F DPW+P+   FKPL         +  VA FIT    WD+  +       D  LI S+ I     +D W+WHY K G Y+ +S
Subjt:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKS

Query:  GYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC
        GYKL   L+ +  + S+  + + W  +WK  VP KIK+F+WR+ H+ +PT   L  RG+     C IC   +ESI HA   C
Subjt:  GYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC

A0A803NGM9 Uncharacterized protein2.7e-3241.27Show/hide
Query:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNE-MMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTK
        ++NG + RVGNG ++R  +DPW+P+ V FK   +Y      E + VAD       WD   +R V    DA LI S+   G   ED  +WHY+KNGEYT K
Subjt:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNE-MMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTK

Query:  SGYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICR-REKESIDHALCSCGTLKE
        SGYK+   L + Q  +       WW  LW+ K+PPKIK FVW+  ++ +PTN  L +RG++V N+C  C     E+  HAL  C   KE
Subjt:  SGYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICR-REKESIDHALCSCGTLKE

A0A803PD49 Uncharacterized protein4.3e-3040.68Show/hide
Query:  RVGNGASIRFFKDPWIPKEVLFKPLGVY-QATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKSGYKLEA
        R+G+G  +R  +DPW+P+ V FK   +Y Q    ++++V D    +  WD E  R V  + DA +I +I   G   +D  +WHYTKNGEYT KSGY++ +
Subjt:  RVGNGASIRFFKDPWIPKEVLFKPLGVY-QATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKSGYKLEA

Query:  KLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRRE-KESIDHALCSC
        +LR  +  +       WW  LW+ K+PPKIK FVW+  +  LPTN  L  R +  S++C  C +E  ESI HAL  C
Subjt:  KLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRRE-KESIDHALCSC

A0A803PM68 Uncharacterized protein5.1e-3139.67Show/hide
Query:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNE-MMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTK
        +  G + R+GNG S+R  +DPW+P+ V FK   VY   +  E M V D +  +  WD E +R      DA LI  +       ED  +WHY+KNGEY+ +
Subjt:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNE-MMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTK

Query:  SGYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRR-EKESIDHALCSC
        SGY++ A L+ H   +++ A   WW +LWK K+PPK+K FVW+  H  LPTN  L  R + V   C  C     E++ HAL  C
Subjt:  SGYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRR-EKESIDHALCSC

A0A803QFX1 Uncharacterized protein2.4e-3341.11Show/hide
Query:  QKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKSGYKLE
        +K VG+G SI+ F+DPWIP+   F+P+    A     +MV + I  S  WD+  L    +  D   I SI +     ED W WHYT NG YT KSGY + 
Subjt:  QKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKSGYKLE

Query:  AKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSCGTL
          L   +   SS+ Q SWW  LWK K+P K+K+F+WR YH  LP N  L +R + V  +C+ C    E+  HAL  C +L
Subjt:  AKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSCGTL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.9e-1525.86Show/hide
Query:  GNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRT-EEDVWIWHYTKNGEYTTKSGYKLEAKL
        G+G  IRF+ D W+  + L +     + T  + ++  D   P  GWD  K+         L ++++ +   T   D   W ++++G+++ +S Y++   L
Subjt:  GNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRT-EEDVWIWHYTKNGEYTTKSGYKLEAKL

Query:  RNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC
           +    ++A  S++  LWK +VP ++K F+W   +  + T     RR ++ SN+C +C+   ES+ H L  C
Subjt:  RNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein7.3e-0639.29Show/hide
Query:  LWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC
        +WK  V PKIK F+WR     L TN  L  R ++   +C  C  E+E+I H + +C
Subjt:  LWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC

AT2G22440.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT4G29090.1)6.6e-0734.09Show/hide
Query:  RFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITP-SNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKSGY
        R +KDPWIP  +L +P         + + V D I   +N W +++L+ ++   D  LI  I        D + W +TK+G YT KSGY
Subjt:  RFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITP-SNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYTTKSGY

AT3G09510.1 Ribonuclease H-like superfamily protein1.3e-1831.02Show/hide
Query:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNG---WDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYT
        L  G++  +G+G +IR   D  +       P  +    T  EM + +          WD  K+ + V + D   I  I +    + D  IW+Y   GEYT
Subjt:  LHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNG---WDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGEYT

Query:  TKSGYKL--EAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC
         +SGY L       N  A           T +W   + PK+K F+WRA    L T   L  RGM +   C  C RE ESI+HAL +C
Subjt:  TKSGYKL--EAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-0636.07Show/hide
Query:  SWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC
        +W   +W  K+ PKIKL +W+A ++ LP    L  R +++   C  C R+ E+I H L +C
Subjt:  SWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC

AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-1826.77Show/hide
Query:  YRTHSYLHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQ------ATTQNEMMVADFITPS-NGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWI
        + +   L  G++  VGNG  I  ++  W+  +     L + +      A+  + + V+D I  S   W  + +  +  E +  LI  +  GGR   D + 
Subjt:  YRTHSYLHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQ------ATTQNEMMVADFITPS-NGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWI

Query:  WHYTKNGEYTTKSGYKLEAKLRNHQATTSSIAQRSWWTI---LWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC
        W YT +G+YT KSGY +  ++ N +++   +++ S   I   +WK++  PKI+ F+W+   + LP    L  R ++  + C  C   KE+++H L  C
Subjt:  WHYTKNGEYTTKSGYKLEAKLRNHQATTSSIAQRSWWTI---LWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTGATACCACTTGTAAGGGACTACCGGACACACTCATATCTCCACAATGGCAGTCAGAAACGTGTGGGGAATGGTGCTTCAATACGTTTTTTCAAGGATCCATG
GATTCCTAAAGAAGTACTTTTTAAACCCTTGGGTGTTTACCAGGCCACTACACAGAATGAGATGATGGTAGCGGACTTTATAACTCCGTCGAACGGCTGGGACATGGAGA
AGCTTCGAAGGGTGGTCGTTGAAGGGGATGCTTTATTAATACAATCAATTTCGATTGGTGGGCGAACTGAAGAGGATGTGTGGATATGGCACTACACAAAAAATGGCGAA
TATACTACGAAGAGTGGATATAAGTTGGAAGCGAAATTGCGTAATCACCAAGCAACGACGAGTTCTATTGCTCAACGCTCTTGGTGGACTATATTATGGAAGACTAAGGT
GCCCCCGAAGATTAAATTATTCGTTTGGAGGGCATACCACGATTGCTTACCAACAAATTATTGTCTTTGGAGACGTGGCATGAATGTATCAAATTTGTGCAATATATGCC
GTAGAGAGAAGGAAAGTATTGATCATGCTCTTTGTAGTTGCGGAACGCTCAAAGAAAATTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTGATACCACTTGTAAGGGACTACCGGACACACTCATATCTCCACAATGGCAGTCAGAAACGTGTGGGGAATGGTGCTTCAATACGTTTTTTCAAGGATCCATG
GATTCCTAAAGAAGTACTTTTTAAACCCTTGGGTGTTTACCAGGCCACTACACAGAATGAGATGATGGTAGCGGACTTTATAACTCCGTCGAACGGCTGGGACATGGAGA
AGCTTCGAAGGGTGGTCGTTGAAGGGGATGCTTTATTAATACAATCAATTTCGATTGGTGGGCGAACTGAAGAGGATGTGTGGATATGGCACTACACAAAAAATGGCGAA
TATACTACGAAGAGTGGATATAAGTTGGAAGCGAAATTGCGTAATCACCAAGCAACGACGAGTTCTATTGCTCAACGCTCTTGGTGGACTATATTATGGAAGACTAAGGT
GCCCCCGAAGATTAAATTATTCGTTTGGAGGGCATACCACGATTGCTTACCAACAAATTATTGTCTTTGGAGACGTGGCATGAATGTATCAAATTTGTGCAATATATGCC
GTAGAGAGAAGGAAAGTATTGATCATGCTCTTTGTAGTTGCGGAACGCTCAAAGAAAATTTGTGA
Protein sequenceShow/hide protein sequence
MALIPLVRDYRTHSYLHNGSQKRVGNGASIRFFKDPWIPKEVLFKPLGVYQATTQNEMMVADFITPSNGWDMEKLRRVVVEGDALLIQSISIGGRTEEDVWIWHYTKNGE
YTTKSGYKLEAKLRNHQATTSSIAQRSWWTILWKTKVPPKIKLFVWRAYHDCLPTNYCLWRRGMNVSNLCNICRREKESIDHALCSCGTLKENL