; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036594 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036594
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr3:48997214..48998617
RNA-Seq ExpressionLag0036594
SyntenyLag0036594
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU30519.1 hypothetical protein TSUD_65290 [Trifolium subterraneum]2.5e-2930Show/hide
Query:  YEGDLGQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFKSILDRVWLYLQD---------------------------
        YE   GQ +NL KS +  S+N+ +  K  LS IL +K V   G+YLGLP    RS+   F  I DR+W  +                             
Subjt:  YEGDLGQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFKSILDRVWLYLQD---------------------------

Query:  ----------------IEEIIRIPINLSSL--DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCM
                         E+I+  P+ +SS+  DK +W  ++ GCY++KSGYK+ M        S    + G W G+WK   P+K +  +WR    CLP  
Subjt:  ----------------IEEIIRIPINLSSL--DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCM

Query:  VNLRKHHVPIEVMCPCCRDDFEDMVHAFFLYSRPKEVWNKLGLWEVT-GADFLR-EVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNL
          L +  V   + CP C ++ ED +H FF  +  ++ W+  GL  V   A + +  V DR   +CN+ S+ T+ R+ +  W IW++ + L
Subjt:  VNLRKHHVPIEVMCPCCRDDFEDMVHAFFLYSRPKEVWNKLGLWEVT-GADFLR-EVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.4e-2732.98Show/hide
Query:  QDIEEIIRIPINLSSL-DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCP
        +D + I+ +PI+  +L D W+WHYDK G Y+++SGYK+ M L   A+S+S+      W  +WKL +P K+K+FIWRS +  +P   NL    +     C 
Subjt:  QDIEEIIRIPINLSSL-DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCP

Query:  CCRDDFEDMVHAFFLYSRPKEVWNKL-GLWEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVHQRPIPSVDYLCD
         C D  E ++HAFF   R +++W  L        A+      + W  +   +    L    ++ W IWND N+L+H + +  V++ C+
Subjt:  CCRDDFEDMVHAFFLYSRPKEVWNKL-GLWEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVHQRPIPSVDYLCD

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.7e-0448.21Show/hide
Query:  GQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFK
        GQCIN  KS + FS NV  + + YL  IL +K V+  G YLGLP++F R R +  K
Subjt:  GQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFK

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.0e-2629.02Show/hide
Query:  YEGDLGQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFKSILDRVW--------------------------------
        YE   GQ +NL KS +  S+N+ +  K  LS IL +K V   G+YLGLP+   RS+   F  I DR+W                                
Subjt:  YEGDLGQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFKSILDRVW--------------------------------

Query:  ----------LYLQDI-EEIIRIPINLSSL--DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCM
                  L  QD+ E+I+  P+ +SS+  DK +W  ++  CY++KSGYK+ M     +     VG    W  +WK   P+K +  +W     CLP  
Subjt:  ----------LYLQDI-EEIIRIPINLSSL--DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCM

Query:  VNLRKHHVPIEVMCPCCRDDFEDMVHAFFLYSRPKEVWNKLGLWEVT-GADFLR-EVQDRWIHICNSVSTSTLERICVSAWAIWND
          L +  V   + CP C ++ ED +H FF  +  ++ W+  GL  +   A + +    DR   +C + S+ T+ R+ +  W IW++
Subjt:  VNLRKHHVPIEVMCPCCRDDFEDMVHAFFLYSRPKEVWNKLGLWEVT-GADFLR-EVQDRWIHICNSVSTSTLERICVSAWAIWND

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]4.5e-2635.2Show/hide
Query:  DIEEIIRIPIN-LSSLDKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPC
        D++ I++IP++ L   D+WIWHY+  G Y++ SGY +   L  E  SS S     WWK  WKLN+P+KVK+F W+     +P   +L    +     C  
Subjt:  DIEEIIRIPIN-LSSLDKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPC

Query:  CRDDFEDMVHAFFLYSRPKEVWNKLGL-WEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVHQRPI
        C+  +E + HA F     KEVW   G   + T AD L++  D  +H+ +    S  E I    W IW+D NN +H + +
Subjt:  CRDDFEDMVHAFFLYSRPKEVWNKLGL-WEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVHQRPI

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]3.4e-2636.57Show/hide
Query:  DIEEIIRIPINLSSL-DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPC
        DI+ I+ IP++ +S  D+W WHYD  G YT+KSGY +   L  +  SSSS     WW+  W LN+P+KV++F WR  N  LP   NL    V     C  
Subjt:  DIEEIIRIPINLSSL-DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPC

Query:  CRDDFEDMVHAFFLYSRPKEVWNKLGL-WEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVH
        C   +E + HA F     K VW       + T A F+++  D  + +   ++ S LE++  + W IW+D NN +H
Subjt:  CRDDFEDMVHAFFLYSRPKEVWNKLGL-WEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVH

TrEMBL top hitse value%identityAlignment
A0A2Z6N5F4 Uncharacterized protein9.8e-2729.02Show/hide
Query:  YEGDLGQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFKSILDRVW--------------------------------
        YE   GQ +NL KS +  S+N+ +  K  LS IL +K V   G+YLGLP+   RS+   F  I DR+W                                
Subjt:  YEGDLGQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFKSILDRVW--------------------------------

Query:  ----------LYLQDI-EEIIRIPINLSSL--DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCM
                  L  QD+ E+I+  P+ +SS+  DK +W  ++  CY++KSGYK+ M     +     VG    W  +WK   P+K +  +W     CLP  
Subjt:  ----------LYLQDI-EEIIRIPINLSSL--DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCM

Query:  VNLRKHHVPIEVMCPCCRDDFEDMVHAFFLYSRPKEVWNKLGLWEVT-GADFLR-EVQDRWIHICNSVSTSTLERICVSAWAIWND
          L +  V   + CP C ++ ED +H FF  +  ++ W+  GL  +   A + +    DR   +C + S+ T+ R+ +  W IW++
Subjt:  VNLRKHHVPIEVMCPCCRDDFEDMVHAFFLYSRPKEVWNKLGLWEVT-GADFLR-EVQDRWIHICNSVSTSTLERICVSAWAIWND

A0A2Z6NFP3 Reverse transcriptase domain-containing protein1.2e-2930Show/hide
Query:  YEGDLGQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFKSILDRVWLYLQD---------------------------
        YE   GQ +NL KS +  S+N+ +  K  LS IL +K V   G+YLGLP    RS+   F  I DR+W  +                             
Subjt:  YEGDLGQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFKSILDRVWLYLQD---------------------------

Query:  ----------------IEEIIRIPINLSSL--DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCM
                         E+I+  P+ +SS+  DK +W  ++ GCY++KSGYK+ M        S    + G W G+WK   P+K +  +WR    CLP  
Subjt:  ----------------IEEIIRIPINLSSL--DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCM

Query:  VNLRKHHVPIEVMCPCCRDDFEDMVHAFFLYSRPKEVWNKLGLWEVT-GADFLR-EVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNL
          L +  V   + CP C ++ ED +H FF  +  ++ W+  GL  V   A + +  V DR   +CN+ S+ T+ R+ +  W IW++ + L
Subjt:  VNLRKHHVPIEVMCPCCRDDFEDMVHAFFLYSRPKEVWNKLGLWEVT-GADFLR-EVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNL

A0A6J1DX30 uncharacterized protein LOC1110248741.2e-2732.98Show/hide
Query:  QDIEEIIRIPINLSSL-DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCP
        +D + I+ +PI+  +L D W+WHYDK G Y+++SGYK+ M L   A+S+S+      W  +WKL +P K+K+FIWRS +  +P   NL    +     C 
Subjt:  QDIEEIIRIPINLSSL-DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCP

Query:  CCRDDFEDMVHAFFLYSRPKEVWNKL-GLWEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVHQRPIPSVDYLCD
         C D  E ++HAFF   R +++W  L        A+      + W  +   +    L    ++ W IWND N+L+H + +  V++ C+
Subjt:  CCRDDFEDMVHAFFLYSRPKEVWNKL-GLWEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVHQRPIPSVDYLCD

A0A6J1DX30 uncharacterized protein LOC1110248741.8e-0448.21Show/hide
Query:  GQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFK
        GQCIN  KS + FS NV  + + YL  IL +K V+  G YLGLP++F R R +  K
Subjt:  GQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFK

A0A6J1DX30 uncharacterized protein LOC1110248742.6e-2732.8Show/hide
Query:  DIEEIIRIPINL-SSLDKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPC
        DI+ I+ IP++L  S D  IWHY   GCYT+KSGY+         +  SS   + WW+  W L +P+KV++F WR+F+  LP    L   H+  + +CP 
Subjt:  DIEEIIRIPINL-SSLDKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPC

Query:  CRDDFEDMVHAFFLYSRPKEVWNKLGLWEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVHQRPIPSVDYLCD
        C+   E + HA     RPK+VWN+  L        +  +QD  +H+ + +S    E      W +W++ N   H +    V  + D
Subjt:  CRDDFEDMVHAFFLYSRPKEVWNKLGLWEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVHQRPIPSVDYLCD

A0A803P5H2 Uncharacterized protein4.4e-2735.75Show/hide
Query:  DIEEIIRIPINLSSL-DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPC
        DI+ I+ IP++ +S  D+W WHYD  G YT+KSGY +   L  + +SSSS     WW+  W LN+P+KV++F WR  N  LP   NL    V     C  
Subjt:  DIEEIIRIPINLSSL-DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPC

Query:  CRDDFEDMVHAFFLYSRPKEVWNKLGL-WEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVHQRPI
        C   +E + HA F     K VW       + T A F+++  D  + +   ++ S LE++  + W IW+D NN +H + +
Subjt:  CRDDFEDMVHAFFLYSRPKEVWNKLGL-WEVTGADFLREVQDRWIHICNSVSTSTLERICVSAWAIWNDCNNLVHQRPI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein1.3e-0723.31Show/hide
Query:  KGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPCCRDDFEDMVHAFFLYSRPKEVWNKLGL-----WEVTGADFLREVQDRWIHICNSVST
        + +WKL++  K+K F+WR     L     LR  ++  + +C  C  + E + H  F     + VW    +     W    +    +  +R I +  + +T
Subjt:  KGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPCCRDDFEDMVHAFFLYSRPKEVWNKLGL-----WEVTGADFLREVQDRWIHICNSVST

Query:  STLERICV--SAWAIWNDCNNLVHQRPIPSVDY
        ++L+R       W +W   N  + Q+   S DY
Subjt:  STLERICV--SAWAIWNDCNNLVHQRPIPSVDY

AT3G09510.1 Ribonuclease H-like superfamily protein2.4e-0925.29Show/hide
Query:  DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSV-----GISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPCCRDDFEDMVHA
        DK IW+Y+  G YT++SGY    +L  + S++        G       +W L I  K+K F+WR+ +  L     L    + I+  CP C  + E + HA
Subjt:  DKWIWHYDKFGCYTIKSGYKMGMVLAAEASSSSSV-----GISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPCCRDDFEDMVHA

Query:  FFLYSRPKEVWNKLGLWEVTGADFLREVQDRWIHICNSVSTSTLERI-----CVSAWAIWNDCNNLVHQR
         F        W       +       + ++   +I N V  +T+            W IW   NN+V  +
Subjt:  FFLYSRPKEVWNKLGLWEVTGADFLREVQDRWIHICNSVSTSTLERI-----CVSAWAIWNDCNNLVHQR

AT3G25270.1 Ribonuclease H-like superfamily protein4.2e-0626.02Show/hide
Query:  LWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMC-PCCRDDFEDMVHAFFLYSRPKEVWNKLGL----WEVTGADFLREVQDRWIHICNSVSTST
        +WKL    K+K F+W+  +  L    NL++ H+     C  CC++D E   H FF     ++VW   G+       TG   +    +  +  C +     
Subjt:  LWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMC-PCCRDDFEDMVHAFFLYSRPKEVWNKLGL----WEVTGADFLREVQDRWIHICNSVSTST

Query:  LERICV-SAWAIWNDCNNLVHQR
        L  + +   W +W   N LV Q+
Subjt:  LERICV-SAWAIWNDCNNLVHQR

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.2e-0636.67Show/hide
Query:  SGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPCCRDDFEDMVHAFF
        + W   +W L I  K+K+ IW++ N  LP    L   ++ IE  C  CR DFE + H  F
Subjt:  SGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPCCRDDFEDMVHAFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGACTATGAAGGAGATTTGGGTCAATGCATAAATCTTGAGAAATCAAGGATATGTTTTTCGAAGAATGTTCCAGAAGACACAAAGACATATCTGAGTTCAATTCT
ACAAATGAAATCGGTTACTGAATTGGGCCTTTACCTTGGTCTTCCTGCATATTTTCATCGTAGTAGGACCAAGGATTTTAAGAGCATTCTTGATCGAGTATGGTTATATC
TTCAAGATATTGAGGAAATTATTAGGATTCCCATCAATTTGTCGTCTTTGGATAAGTGGATTTGGCATTATGATAAATTTGGTTGCTATACTATCAAAAGTGGATATAAA
ATGGGGATGGTGTTGGCCGCTGAGGCATCCTCTTCAAGCTCGGTGGGTATCAGTGGATGGTGGAAAGGGTTGTGGAAATTGAATATTCCAAATAAAGTTAAGGTTTTTAT
CTGGCGATCTTTCAATCTTTGTCTTCCTTGTATGGTGAACCTCAGAAAGCATCATGTTCCAATAGAGGTGATGTGTCCATGTTGTAGGGATGACTTCGAAGATATGGTCC
ATGCATTTTTCCTCTACAGTAGACCTAAAGAAGTATGGAATAAGTTGGGTTTATGGGAGGTGACAGGAGCGGACTTTCTAAGGGAGGTTCAGGATCGGTGGATCCACATT
TGTAATAGTGTTTCCACAAGTACCCTTGAGAGGATTTGTGTGAGTGCCTGGGCCATATGGAATGATTGTAACAATTTGGTTCATCAACGGCCTATTCCATCTGTGGACTA
TCTCTGTGACTTGATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCGACTATGAAGGAGATTTGGGTCAATGCATAAATCTTGAGAAATCAAGGATATGTTTTTCGAAGAATGTTCCAGAAGACACAAAGACATATCTGAGTTCAATTCT
ACAAATGAAATCGGTTACTGAATTGGGCCTTTACCTTGGTCTTCCTGCATATTTTCATCGTAGTAGGACCAAGGATTTTAAGAGCATTCTTGATCGAGTATGGTTATATC
TTCAAGATATTGAGGAAATTATTAGGATTCCCATCAATTTGTCGTCTTTGGATAAGTGGATTTGGCATTATGATAAATTTGGTTGCTATACTATCAAAAGTGGATATAAA
ATGGGGATGGTGTTGGCCGCTGAGGCATCCTCTTCAAGCTCGGTGGGTATCAGTGGATGGTGGAAAGGGTTGTGGAAATTGAATATTCCAAATAAAGTTAAGGTTTTTAT
CTGGCGATCTTTCAATCTTTGTCTTCCTTGTATGGTGAACCTCAGAAAGCATCATGTTCCAATAGAGGTGATGTGTCCATGTTGTAGGGATGACTTCGAAGATATGGTCC
ATGCATTTTTCCTCTACAGTAGACCTAAAGAAGTATGGAATAAGTTGGGTTTATGGGAGGTGACAGGAGCGGACTTTCTAAGGGAGGTTCAGGATCGGTGGATCCACATT
TGTAATAGTGTTTCCACAAGTACCCTTGAGAGGATTTGTGTGAGTGCCTGGGCCATATGGAATGATTGTAACAATTTGGTTCATCAACGGCCTATTCCATCTGTGGACTA
TCTCTGTGACTTGATCTAG
Protein sequenceShow/hide protein sequence
MVDYEGDLGQCINLEKSRICFSKNVPEDTKTYLSSILQMKSVTELGLYLGLPAYFHRSRTKDFKSILDRVWLYLQDIEEIIRIPINLSSLDKWIWHYDKFGCYTIKSGYK
MGMVLAAEASSSSSVGISGWWKGLWKLNIPNKVKVFIWRSFNLCLPCMVNLRKHHVPIEVMCPCCRDDFEDMVHAFFLYSRPKEVWNKLGLWEVTGADFLREVQDRWIHI
CNSVSTSTLERICVSAWAIWNDCNNLVHQRPIPSVDYLCDLI