; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028556 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028556
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr8:24822790..24823526
RNA-Seq ExpressionLag0028556
SyntenyLag0028556
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4359769.1 hypothetical protein F8388_008331 [Cannabis sativa]1.8e-1528.88Show/hide
Query:  EASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW
        + S SN E  + WWK +W++ +P K++ FV++   + +PT  NL+N H   +  CP C    E+  HALF+C   ++ W   +   +  LW   + ++  
Subjt:  EASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW

Query:  LSLADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISDG---GDLIMHTDAFVMETQSKCGIGIVLRDKQGHLKA
         S+  N +  P  +   +   DYL+ +  A  K G  +Q+  D   ++ +    G L ++TDA +   Q+K G G ++RD  G + A
Subjt:  LSLADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISDG---GDLIMHTDAFVMETQSKCGIGIVLRDKQGHLKA

KAF4386115.1 hypothetical protein F8388_016367 [Cannabis sativa]1.4e-1528.88Show/hide
Query:  EASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW
        + S SN E  T WWK +W++ +P K++ FV++   + +PT  NL+N H   +  CP C    E+  HALF+C   ++ W   +   +  LW   + ++  
Subjt:  EASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW

Query:  LSLADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISDG---GDLIMHTDAFVMETQSKCGIGIVLRDKQGHLKA
         S+  N +  P  +   +   DYL+ +  A  K G  +Q+  D   ++ +    G L ++TDA +   Q++ G G ++RD  G + A
Subjt:  LSLADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISDG---GDLIMHTDAFVMETQSKCGIGIVLRDKQGHLKA

KAF4395712.1 hypothetical protein G4B88_013486 [Cannabis sativa]1.4e-1528.88Show/hide
Query:  EASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW
        + S SN E  T WWK +W++ +P K++ FV++   + +PT  NL+N H   +  CP C    E+  HALF+C   ++ W   +   +  LW   + ++  
Subjt:  EASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW

Query:  LSLADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISDG---GDLIMHTDAFVMETQSKCGIGIVLRDKQGHLKA
         S+  N +  P  +   +   DYL+ +  A  K G  +Q+  D   ++ +    G L ++TDA +   Q++ G G ++RD  G + A
Subjt:  LSLADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISDG---GDLIMHTDAFVMETQSKCGIGIVLRDKQGHLKA

XP_030486818.1 uncharacterized protein LOC115703724 [Cannabis sativa]1.4e-1528.16Show/hide
Query:  WWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRWLSL----ADNSM
        WWK+ WK+ +PSKV+IF+W+  H+++P    L++ H++ +  C  C+ E ET  HALF C R + +W+    P+   L   M +K+  L +    +D  +
Subjt:  WWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRWLSL----ADNSM

Query:  EIPSPMLRSEWIN---------------------DYLSEFLKA---------NPKSGTIAQTEEDIVNIISDGGDLIMHTDAFVMETQSKCGIGIVLRDK
        E    +L   W                        +L EF  A         +  S ++A T  D+  +    G L ++TDA V    +  G G +LRD 
Subjt:  EIPSPMLRSEWIN---------------------DYLSEFLKA---------NPKSGTIAQTEEDIVNIISDGGDLIMHTDAFVMETQSKCGIGIVLRDK

Query:  QGHLKA
         G++ A
Subjt:  QGHLKA

XP_042939545.1 uncharacterized protein LOC122274584 [Carya illinoinensis]2.7e-1629.03Show/hide
Query:  MEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW----LS
        M  +   W+ +WK+ +   VK F+WK  H  +PT +NL+  H+  N  CP C++E+ETT HAL+ C  A +VWE    P+ +    ++D  + W    LS
Subjt:  MEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW----LS

Query:  LADNSMEIPSPMLRSEWI--NDYLSEFLKANPKSG-TIAQTEEDIVNII--------SDGGDLIMHTD-----------AFVMETQSKCGIGIVLRDKQG
        L +  +E+ +  LR  W+  N  + E    NP+   +IA+   D    +        SD  + +               A + E + K GIG++ R+ +G
Subjt:  LADNSMEIPSPMLRSEWI--NDYLSEFLKANPKSG-TIAQTEEDIVNII--------SDGGDLIMHTD-----------AFVMETQSKCGIGIVLRDKQG

Query:  HLKAVQNLSSQAANSPL
         +     L+ QAA  P+
Subjt:  HLKAVQNLSSQAANSPL

TrEMBL top hitse value%identityAlignment
A0A7J6HKE1 Uncharacterized protein6.5e-1628.88Show/hide
Query:  EASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW
        + S SN E  T WWK +W++ +P K++ FV++   + +PT  NL+N H   +  CP C    E+  HALF+C   ++ W   +   +  LW   + ++  
Subjt:  EASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW

Query:  LSLADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISDG---GDLIMHTDAFVMETQSKCGIGIVLRDKQGHLKA
         S+  N +  P  +   +   DYL+ +  A  K G  +Q+  D   ++ +    G L ++TDA +   Q++ G G ++RD  G + A
Subjt:  LSLADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISDG---GDLIMHTDAFVMETQSKCGIGIVLRDKQGHLKA

A0A803NZC3 Uncharacterized protein2.7e-1727.93Show/hide
Query:  MKGQEASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDI
        ++ Q+ S+S M+ +  WWK+ WK+ +PSKV+IF+W+  H+++P    L++ H++ +  C  C+ E ET  HALF C R + +W+    P+   L   M +
Subjt:  MKGQEASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDI

Query:  KDRWLSL----ADNSMEIPSPMLRSEWIN---------------------DYLSEFLKA---------NPKSGTIAQTEEDIVNIISDGGDLIMHTDAFV
        K+  L +    +D  +E    +L   W                        +L EF  A         +  S ++A T  D+  +    G L ++TDA V
Subjt:  KDRWLSL----ADNSMEIPSPMLRSEWIN---------------------DYLSEFLKA---------NPKSGTIAQTEEDIVNIISDGGDLIMHTDAFV

Query:  METQSKCGIGIVLRDKQGHLKA
            +  G G +LRD  G++ A
Subjt:  METQSKCGIGIVLRDKQGHLKA

A0A803PQ54 Uncharacterized protein3.1e-1831.43Show/hide
Query:  SLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRWLS
        S SN      WWK  W   +P KVK F W+TFH+ +PT  NL+   V  + +C  C   +ET  HAL  C+R R+VW++        L +N DIKD  LS
Subjt:  SLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRWLS

Query:  ----LADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQ---------TEEDIVNIISDG---------------GDLIMHTDAFVMETQSKCGIGIV
            L  +   +    L S W       F  ANP +G + Q          E  + NI  D                G   ++TDA + +   K G+G V
Subjt:  ----LADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQ---------TEEDIVNIISDG---------------GDLIMHTDAFVMETQSKCGIGIV

Query:  LRDKQGHLKA
        ++D +G + A
Subjt:  LRDKQGHLKA

A0A803Q6M0 Uncharacterized protein2.2e-1629.19Show/hide
Query:  EASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW
        + S S+M   T WWK +W + +PSK++ F+++   N IPT  NL++ H   +  CP C    E+  HALF+C   ++ W   +   +  LW   + +   
Subjt:  EASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRW

Query:  LSLADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISDG---GDLIMHTDAFVMETQSKCGIGIVLRDKQGHL
         + A    ++ +P+   +   DYL  +  A  K     Q+  D+ N+  +    G L ++TDA V   Q+K G G+++RD  G +
Subjt:  LSLADNSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISDG---GDLIMHTDAFVMETQSKCGIGIVLRDKQGHL

A0A803QG04 Uncharacterized protein1.3e-1628.7Show/hide
Query:  SLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRWLS
        S SN      WWK  W  ++P K+K F W+TFH+ +PT  NL    V  + +C  C   +ET  HAL  C+R R+VW++           N DIKD  LS
Subjt:  SLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRWLS

Query:  L-----ADNSMEI--------------------PSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISD---GGDLIMHTDAFVMETQSKCGIGIV
              AD+   +                    P      +W   YLS++++A  K   +  T +D   +++     G   + TDA + E   K G+G V
Subjt:  L-----ADNSMEI--------------------PSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISD---GGDLIMHTDAFVMETQSKCGIGIV

Query:  LRDKQGHLKAVQNLSSQAANSPL
        ++D  G + A  ++   A   P+
Subjt:  LRDKQGHLKAVQNLSSQAANSPL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657505.5e-0426.97Show/hide
Query:  MMKGQEASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYP
        M+   E    NM    +++  +WK+RVP +VK F+W   +  + T       H+S +  C  C+  +E+  H L  C     +W  + P
Subjt:  MMKGQEASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYP

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein2.3e-0530.16Show/hide
Query:  VWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVW
        +WK+ V  K+K F+W+     + T   L + ++  +  C  C  E ET  H +F C   + VW
Subjt:  VWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVW

AT3G09510.1 Ribonuclease H-like superfamily protein1.7e-0827.71Show/hide
Query:  RVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKD
        R+W + +  K+K F+W+     + T   L    + ++ +CP C +E E+ +HALF C  A   W +    ++RN  ++ D ++
Subjt:  RVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKD

AT3G25270.1 Ribonuclease H-like superfamily protein8.5e-0830.95Show/hide
Query:  RVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPP--MMRNLWVNMDIK
        ++WK++   K+K F+WK     + T  NL   H+  +  C  C +E ET+ H  F C  A++VW     P   +R   + M+ K
Subjt:  RVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPP--MMRNLWVNMDIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAGGGCCAAGAGGCCTCATTGTCAAATATGGAAAAAGAGACTACTTGGTGGAAAAGGGTGTGGAAGATGAGAGTGCCTAGCAAAGTGAAAATATTCGTCTGGAA
AACTTTTCACAACTTCATCCCAACCATAGTAAACTTATGGAATCATCATGTATCGGTTAACGGGAATTGCCCGACTTGCCAGAAGGAGATGGAGACTACAGACCATGCCC
TATTTCAGTGTACGAGGGCTCGGGAGGTATGGGAAATTATTTATCCACCGATGATGAGGAATTTATGGGTTAATATGGATATCAAAGACCGCTGGTTGAGCTTGGCTGAC
AATTCCATGGAGATCCCTAGTCCAATGCTTAGAAGTGAATGGATTAATGACTATCTGTCAGAGTTCTTGAAGGCCAACCCGAAAAGTGGTACTATTGCTCAAACGGAGGA
AGATATTGTTAATATAATTTCAGACGGTGGAGATCTTATTATGCACACTGACGCATTTGTCATGGAAACACAGAGTAAATGCGGTATTGGAATAGTATTGCGTGATAAAC
AGGGGCATCTCAAGGCGGTGCAGAATCTATCTTCTCAGGCAGCTAACTCTCCTTTGAATGTGCAGCGATACTCGAAGGGAAGCGATAGCGATACTCGAAGGGATGCATCT
GGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGAAGGGCCAAGAGGCCTCATTGTCAAATATGGAAAAAGAGACTACTTGGTGGAAAAGGGTGTGGAAGATGAGAGTGCCTAGCAAAGTGAAAATATTCGTCTGGAA
AACTTTTCACAACTTCATCCCAACCATAGTAAACTTATGGAATCATCATGTATCGGTTAACGGGAATTGCCCGACTTGCCAGAAGGAGATGGAGACTACAGACCATGCCC
TATTTCAGTGTACGAGGGCTCGGGAGGTATGGGAAATTATTTATCCACCGATGATGAGGAATTTATGGGTTAATATGGATATCAAAGACCGCTGGTTGAGCTTGGCTGAC
AATTCCATGGAGATCCCTAGTCCAATGCTTAGAAGTGAATGGATTAATGACTATCTGTCAGAGTTCTTGAAGGCCAACCCGAAAAGTGGTACTATTGCTCAAACGGAGGA
AGATATTGTTAATATAATTTCAGACGGTGGAGATCTTATTATGCACACTGACGCATTTGTCATGGAAACACAGAGTAAATGCGGTATTGGAATAGTATTGCGTGATAAAC
AGGGGCATCTCAAGGCGGTGCAGAATCTATCTTCTCAGGCAGCTAACTCTCCTTTGAATGTGCAGCGATACTCGAAGGGAAGCGATAGCGATACTCGAAGGGATGCATCT
GGCTAG
Protein sequenceShow/hide protein sequence
MMKGQEASLSNMEKETTWWKRVWKMRVPSKVKIFVWKTFHNFIPTIVNLWNHHVSVNGNCPTCQKEMETTDHALFQCTRAREVWEIIYPPMMRNLWVNMDIKDRWLSLAD
NSMEIPSPMLRSEWINDYLSEFLKANPKSGTIAQTEEDIVNIISDGGDLIMHTDAFVMETQSKCGIGIVLRDKQGHLKAVQNLSSQAANSPLNVQRYSKGSDSDTRRDAS
G