; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039262 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039262
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr2:40185377..40188053
RNA-Seq ExpressionLag0039262
SyntenyLag0039262
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059476.1 Transposon TX1 uncharacterized [Cucumis melo var. makuwa]3.1e-2442.68Show/hide
Query:  KDCWCDEQPLKSLFSDLFLISNKE-ATIADHWSNDSQTWNLAFRRGLFDREIGR----------WQVDKIKDVNLVNDHNLIRWKLEGSGNYSTKSMFQS
        +D WC  QPL SL+ D++LIS+K+ A +A +W + +Q W+L  RR +FDREIG           W V++ +D         +R KLE SG +STKS F  
Subjt:  KDCWCDEQPLKSLFSDLFLISNKE-ATIADHWSNDSQTWNLAFRRGLFDREIGR----------WQVDKIKDVNLVNDHNLIRWKLEGSGNYSTKSMFQS

Query:  MVTASPKINQPTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYAC
        +   + K+N P  +LIWK K P       WSLAYRSLN  EKL+++F  WSL    C
Subjt:  MVTASPKINQPTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYAC

KAG6706807.1 hypothetical protein I3842_07G239500 [Carya illinoinensis]5.8e-2331.82Show/hide
Query:  IFEQFTKLLLKAVEASSFGKDCWCDEQPLKSLFSDLFLIS-NKEATIAD--HWSNDSQTWNLAFRRGLFDREIGRW-----QVDKIKDVNLVNDHNLIRW
        +F   TKLL+      SF +D WC E+ LK  F  + L++ ++EA++AD    S D   WN+ F R   D E+  +     +V  +K   L  D   + W
Subjt:  IFEQFTKLLLKAVEASSFGKDCWCDEQPLKSLFSDLFLIS-NKEATIAD--HWSNDSQTWNLAFRRGLFDREIGRW-----QVDKIKDVNLVNDHNLIRW

Query:  KLEGSGNYSTKSMFQSMVTASPKINQPTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKK---FSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAW
        K  G G +S  S ++++ TA P ++ P   L W++K+P K+  F+W++A   + T + L K+    + W      C +C K  E+++HL LHCE A + W
Subjt:  KLEGSGNYSTKSMFQSMVTASPKINQPTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKK---FSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAW

Query:  NFVARLLGISFCMPRKIDE----WLIEGLNAWNLKRKAKILASCAFRTTFGFRGKKEMLEPSKI
          V R +G+++ MP+ + E    W + G    ++K   K++  C     +G  GK EM E  KI
Subjt:  NFVARLLGISFCMPRKIDE----WLIEGLNAWNLKRKAKILASCAFRTTFGFRGKKEMLEPSKI

ONI36148.1 hypothetical protein PRUPE_1G572100 [Prunus persica]3.1e-2433.77Show/hide
Query:  SSFGKDCWCDEQPLKSLFSDLFLISNKEATIADHWSNDSQ----TWNLAFRRGLFDREIGR--WQVDKIKDVNL-VNDHNLIRWKLEGSGNYSTKSMFQS
        S F +D W  E  LK +F  LF +S+K  T++ +   D+Q     W+  FRR L +RE       ++ ++ + L  +  +  RW LE SG+++ KS FQS
Subjt:  SSFGKDCWCDEQPLKSLFSDLFLISNKEATIADHWSNDSQ----TWNLAFRRGLFDREIGR--WQVDKIKDVNL-VNDHNLIRWKLEGSGNYSTKSMFQS

Query:  MVTASPKINQ-PTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAWNFVARLLGISFCMPRKID
         +    +    P  SL+WK KSP KVKVF+W +A   +NT + +++K     LS   C LC    E++DHLFLHC F+ S W  + R +G  + +P+   
Subjt:  MVTASPKINQ-PTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAWNFVARLLGISFCMPRKID

Query:  EWLIEGLNAWNLKRKAKILASCAFRTTF
        ++L      W L +    L  C   + F
Subjt:  EWLIEGLNAWNLKRKAKILASCAFRTTF

TYK31299.1 protein FAM91A1 [Cucumis melo var. makuwa]1.4e-2952.24Show/hide
Query:  IRWKLEGSGNYSTKSMFQSMVTASPKINQPTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAW
        +RWKLE SG +STK +F  M   + K N  T +LIWK K  KKVK FLWSLAYRSLN   KL++KF   SLS   C LCLK AE  DHLFLHC+FA   W
Subjt:  IRWKLEGSGNYSTKSMFQSMVTASPKINQPTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAW

Query:  NFVARLLGISFCMPRKIDEWLIEGLNAWNLKRKA
        N +  L  +  C+P+KID+ + +GL   +   KA
Subjt:  NFVARLLGISFCMPRKIDEWLIEGLNAWNLKRKA

VVA21329.1 Hypothetical predicted protein [Prunus dulcis]3.4e-2332.6Show/hide
Query:  SSFGKDCWCDEQPLKSLFSDLFLISNKEATIADHWSNDS---QTWNLAFRRGLFDREIGR--WQVDKIKDVNL-VNDHNLIRWKLEGSGNYSTKSMFQSM
        S F +D W  E  LK +F  LF +S+K     D + +       W+  FRR L +RE       ++ ++ + L  +  +  RW LE SG+++ KS FQS 
Subjt:  SSFGKDCWCDEQPLKSLFSDLFLISNKEATIADHWSNDS---QTWNLAFRRGLFDREIGR--WQVDKIKDVNL-VNDHNLIRWKLEGSGNYSTKSMFQSM

Query:  VTASPKINQ-PTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAWNFVARLLGISFCMPRKIDE
        +    +    P  SL+WK KSP KVKVF+W +A   +NT + +++K     LS   C LC    E++DHLFLHC F+ S W  + R +G  + + +   +
Subjt:  VTASPKINQ-PTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAWNFVARLLGISFCMPRKIDE

Query:  WLIEGLNAWNLKRKAKILASCAFRTTF
        +L      W L +    L  C   + F
Subjt:  WLIEGLNAWNLKRKAKILASCAFRTTF

TrEMBL top hitse value%identityAlignment
A0A251RJG1 zf-RVT domain-containing protein1.5e-2433.77Show/hide
Query:  SSFGKDCWCDEQPLKSLFSDLFLISNKEATIADHWSNDSQ----TWNLAFRRGLFDREIGR--WQVDKIKDVNL-VNDHNLIRWKLEGSGNYSTKSMFQS
        S F +D W  E  LK +F  LF +S+K  T++ +   D+Q     W+  FRR L +RE       ++ ++ + L  +  +  RW LE SG+++ KS FQS
Subjt:  SSFGKDCWCDEQPLKSLFSDLFLISNKEATIADHWSNDSQ----TWNLAFRRGLFDREIGR--WQVDKIKDVNL-VNDHNLIRWKLEGSGNYSTKSMFQS

Query:  MVTASPKINQ-PTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAWNFVARLLGISFCMPRKID
         +    +    P  SL+WK KSP KVKVF+W +A   +NT + +++K     LS   C LC    E++DHLFLHC F+ S W  + R +G  + +P+   
Subjt:  MVTASPKINQ-PTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAWNFVARLLGISFCMPRKID

Query:  EWLIEGLNAWNLKRKAKILASCAFRTTF
        ++L      W L +    L  C   + F
Subjt:  EWLIEGLNAWNLKRKAKILASCAFRTTF

A0A5D3BWG7 Transposon TX1 uncharacterized1.5e-2442.68Show/hide
Query:  KDCWCDEQPLKSLFSDLFLISNKE-ATIADHWSNDSQTWNLAFRRGLFDREIGR----------WQVDKIKDVNLVNDHNLIRWKLEGSGNYSTKSMFQS
        +D WC  QPL SL+ D++LIS+K+ A +A +W + +Q W+L  RR +FDREIG           W V++ +D         +R KLE SG +STKS F  
Subjt:  KDCWCDEQPLKSLFSDLFLISNKE-ATIADHWSNDSQTWNLAFRRGLFDREIGR----------WQVDKIKDVNLVNDHNLIRWKLEGSGNYSTKSMFQS

Query:  MVTASPKINQPTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYAC
        +   + K+N P  +LIWK K P       WSLAYRSLN  EKL+++F  WSL    C
Subjt:  MVTASPKINQPTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYAC

A0A5D3E632 Protein FAM91A16.9e-3052.24Show/hide
Query:  IRWKLEGSGNYSTKSMFQSMVTASPKINQPTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAW
        +RWKLE SG +STK +F  M   + K N  T +LIWK K  KKVK FLWSLAYRSLN   KL++KF   SLS   C LCLK AE  DHLFLHC+FA   W
Subjt:  IRWKLEGSGNYSTKSMFQSMVTASPKINQPTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAW

Query:  NFVARLLGISFCMPRKIDEWLIEGLNAWNLKRKA
        N +  L  +  C+P+KID+ + +GL   +   KA
Subjt:  NFVARLLGISFCMPRKIDEWLIEGLNAWNLKRKA

A0A5E4F2L4 zf-RVT domain-containing protein1.6e-2332.6Show/hide
Query:  SSFGKDCWCDEQPLKSLFSDLFLISNKEATIADHWSNDS---QTWNLAFRRGLFDREIGR--WQVDKIKDVNL-VNDHNLIRWKLEGSGNYSTKSMFQSM
        S F +D W  E  LK +F  LF +S+K     D + +       W+  FRR L +RE       ++ ++ + L  +  +  RW LE SG+++ KS FQS 
Subjt:  SSFGKDCWCDEQPLKSLFSDLFLISNKEATIADHWSNDS---QTWNLAFRRGLFDREIGR--WQVDKIKDVNL-VNDHNLIRWKLEGSGNYSTKSMFQSM

Query:  VTASPKINQ-PTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAWNFVARLLGISFCMPRKIDE
        +    +    P  SL+WK KSP KVKVF+W +A   +NT + +++K     LS   C LC    E++DHLFLHC F+ S W  + R +G  + + +   +
Subjt:  VTASPKINQ-PTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAWNFVARLLGISFCMPRKIDE

Query:  WLIEGLNAWNLKRKAKILASCAFRTTF
        +L      W L +    L  C   + F
Subjt:  WLIEGLNAWNLKRKAKILASCAFRTTF

M5XJT6 zf-RVT domain-containing protein (Fragment)1.5e-2433.77Show/hide
Query:  SSFGKDCWCDEQPLKSLFSDLFLISNKEATIADHWSNDSQ----TWNLAFRRGLFDREIGR--WQVDKIKDVNL-VNDHNLIRWKLEGSGNYSTKSMFQS
        S F +D W  E  LK +F  LF +S+K  T++ +   D+Q     W+  FRR L +RE       ++ ++ + L  +  +  RW LE SG+++ KS FQS
Subjt:  SSFGKDCWCDEQPLKSLFSDLFLISNKEATIADHWSNDSQ----TWNLAFRRGLFDREIGR--WQVDKIKDVNL-VNDHNLIRWKLEGSGNYSTKSMFQS

Query:  MVTASPKINQ-PTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAWNFVARLLGISFCMPRKID
         +    +    P  SL+WK KSP KVKVF+W +A   +NT + +++K     LS   C LC    E++DHLFLHC F+ S W  + R +G  + +P+   
Subjt:  MVTASPKINQ-PTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAWNFVARLLGISFCMPRKID

Query:  EWLIEGLNAWNLKRKAKILASCAFRTTF
        ++L      W L +    L  C   + F
Subjt:  EWLIEGLNAWNLKRKAKILASCAFRTTF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25270.1 Ribonuclease H-like superfamily protein4.9e-0433.33Show/hide
Query:  IWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAW
        IWK K+  K+K FLW L   +L T + L+++  +   +   C  C +  E   HLF  C +A   W
Subjt:  IWKHKSPKKVKVFLWSLAYRSLNTDEKLEKKFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTACGGCTTCTCGGAAACTAATAAGAGATGCTATCTGTAAAGGGGCGGTTTCACGTCCCTTCCAAGGCGCAAGACACTGCTCTTCTGCTTCTGACACCATTCA
TAAGATCCCTCATACTTCTAAGAAGTTTAGAAACCGGATTTCTGAGCGAGAAGACTTGATGCTTGATGGTGATTGTTTTGTTCTTGTGCTCATATTTGAGCAGTTTACTA
AATTATTGTTAAAAGCGGTAGAAGCATCAAGTTTTGGGAAGGACTGTTGGTGTGATGAGCAACCCCTTAAATCCCTTTTCTCAGATTTATTTCTTATCTCTAACAAGGAG
GCAACCATAGCAGACCATTGGAGTAATGACTCCCAAACATGGAACTTGGCTTTTAGAAGAGGCCTTTTTGATAGAGAAATCGGCCGTTGGCAGGTGGACAAAATTAAGGA
TGTGAATTTGGTGAATGACCACAACCTGATTAGATGGAAGTTGGAAGGCTCAGGTAATTACTCGACAAAGTCTATGTTTCAATCGATGGTTACTGCTTCTCCCAAAATAA
ATCAGCCCACGAGTAGCCTTATCTGGAAACATAAGAGCCCTAAAAAAGTGAAAGTTTTTTTGTGGTCCCTTGCTTATAGAAGCCTAAACACAGATGAGAAGTTGGAAAAG
AAGTTCAGCCAGTGGTCGTTATCCCTCTATGCTTGTAGGCTGTGTCTTAAGGCAGCAGAAAACTTAGACCATCTCTTCTTACACTGTGAATTTGCGGGGTCTGCTTGGAA
TTTCGTTGCAAGGCTGTTGGGAATCTCATTTTGTATGCCGAGGAAGATTGATGAATGGTTAATTGAAGGTTTGAATGCGTGGAACCTTAAGAGGAAAGCGAAGATTTTGG
CTAGTTGCGCGTTCAGGACTACTTTTGGCTTTCGTGGAAAGAAAGAAATGCTAGAACCTTCGAAGATAAGTGCGGTAGTTTCAATTCTTTTTGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTACGGCTTCTCGGAAACTAATAAGAGATGCTATCTGTAAAGGGGCGGTTTCACGTCCCTTCCAAGGCGCAAGACACTGCTCTTCTGCTTCTGACACCATTCA
TAAGATCCCTCATACTTCTAAGAAGTTTAGAAACCGGATTTCTGAGCGAGAAGACTTGATGCTTGATGGTGATTGTTTTGTTCTTGTGCTCATATTTGAGCAGTTTACTA
AATTATTGTTAAAAGCGGTAGAAGCATCAAGTTTTGGGAAGGACTGTTGGTGTGATGAGCAACCCCTTAAATCCCTTTTCTCAGATTTATTTCTTATCTCTAACAAGGAG
GCAACCATAGCAGACCATTGGAGTAATGACTCCCAAACATGGAACTTGGCTTTTAGAAGAGGCCTTTTTGATAGAGAAATCGGCCGTTGGCAGGTGGACAAAATTAAGGA
TGTGAATTTGGTGAATGACCACAACCTGATTAGATGGAAGTTGGAAGGCTCAGGTAATTACTCGACAAAGTCTATGTTTCAATCGATGGTTACTGCTTCTCCCAAAATAA
ATCAGCCCACGAGTAGCCTTATCTGGAAACATAAGAGCCCTAAAAAAGTGAAAGTTTTTTTGTGGTCCCTTGCTTATAGAAGCCTAAACACAGATGAGAAGTTGGAAAAG
AAGTTCAGCCAGTGGTCGTTATCCCTCTATGCTTGTAGGCTGTGTCTTAAGGCAGCAGAAAACTTAGACCATCTCTTCTTACACTGTGAATTTGCGGGGTCTGCTTGGAA
TTTCGTTGCAAGGCTGTTGGGAATCTCATTTTGTATGCCGAGGAAGATTGATGAATGGTTAATTGAAGGTTTGAATGCGTGGAACCTTAAGAGGAAAGCGAAGATTTTGG
CTAGTTGCGCGTTCAGGACTACTTTTGGCTTTCGTGGAAAGAAAGAAATGCTAGAACCTTCGAAGATAAGTGCGGTAGTTTCAATTCTTTTTGGATAA
Protein sequenceShow/hide protein sequence
MAATASRKLIRDAICKGAVSRPFQGARHCSSASDTIHKIPHTSKKFRNRISEREDLMLDGDCFVLVLIFEQFTKLLLKAVEASSFGKDCWCDEQPLKSLFSDLFLISNKE
ATIADHWSNDSQTWNLAFRRGLFDREIGRWQVDKIKDVNLVNDHNLIRWKLEGSGNYSTKSMFQSMVTASPKINQPTSSLIWKHKSPKKVKVFLWSLAYRSLNTDEKLEK
KFSQWSLSLYACRLCLKAAENLDHLFLHCEFAGSAWNFVARLLGISFCMPRKIDEWLIEGLNAWNLKRKAKILASCAFRTTFGFRGKKEMLEPSKISAVVSILFG