; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004934 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004934
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr6:8699455..8700200
RNA-Seq ExpressionLag0004934
SyntenyLag0004934
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA38592.1 PREDICTED: reverse mRNAase, partial [Prunus dulcis]3.5e-1332.18Show/hide
Query:  MEAKDILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEASLKE-ASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKK---RDI
        ++  DI+ IPL  +   D+I+W  +K G F+VKSAYR+A+   S  E  S S  S  + +W+ +W A    + KI AW+  +DI+PTKAN+ KK      
Subjt:  MEAKDILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEASLKE-ASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKK---RDI

Query:  SCCLCG------LH-----------W---------------TPMDYWHWLAENSSKDELAWAIILIWSIWNARS
         C  CG      LH           W               T  D+   + E +SK +    I L+W+IW  R+
Subjt:  SCCLCG------LH-----------W---------------TPMDYWHWLAENSSKDELAWAIILIWSIWNARS

XP_022149515.1 uncharacterized protein LOC111017927 [Momordica charantia]2.7e-1343.69Show/hide
Query:  EAKDILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKRDI---SC
        EA  ILNIPL   N +D++IW  +K+  FSVKS YRL +  AS  E   S+  + AK WK LW      + KIC W+  NDII T A + KK  +    C
Subjt:  EAKDILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKRDI---SC

Query:  CLC
          C
Subjt:  CLC

XP_023911662.1 uncharacterized protein LOC112023276 [Quercus suber]1.6e-1333.09Show/hide
Query:  MEAKDILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEA-SLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKRDISCC
        +EA+ I +I L  +   D++IWA    GFF+VKSAY++A+E A +L   S SD S L + WK +W+ +   +     W+A  DI+PTK N+ +++ +   
Subjt:  MEAKDILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEA-SLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKRDISCC

Query:  LCGLHWTPMDYWHWLAENSSKDELAWAIILIWSIWNARS
           L W  M     + ++  +D +   + + WSIW  R+
Subjt:  LCGLHWTPMDYWHWLAENSSKDELAWAIILIWSIWNARS

XP_023923255.1 uncharacterized protein LOC112034669 [Quercus suber]4.6e-1334.59Show/hide
Query:  EAKDILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEA-SLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKR---DIS
        +A  IL+ PL ++   ++IIWA  K G FSV+SAYRLAMEE    +    SD S + +VWK +W  +   + +   WKA ++I+ TK N+ K+    D  
Subjt:  EAKDILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEA-SLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKR---DIS

Query:  CCLCGLHWTPMD-YWHWLAENSSKDELAWAIILI-WSIWNARSRATSNNIRIEAEHLLA
        C  CG  W  +D  W  +  +S+   L   +  I W IW  R       I ++   L A
Subjt:  CCLCGLHWTPMD-YWHWLAENSSKDELAWAIILI-WSIWNARSRATSNNIRIEAEHLLA

XP_024155779.1 uncharacterized protein LOC112163738 [Rosa chinensis]9.3e-1428.24Show/hide
Query:  ILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAME-EASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANI---GKKRDISCCLC
        +L+IPL  +   D+I W L+KRG FSVKSAY +A +       AS S+    A +WK+LW+A    +  I  W+A ++++PT+  +   G   D++CC+C
Subjt:  ILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAME-EASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANI---GKKRDISCCLC

Query:  --------------------------GLHWTPMDYWHWL---AENSSKDELAWAIILIWSIWNARSRATSNNIRIEAEHLLAAVNLAW-NEIERSNQKKG
                                   L  +P+ +  WL   A N S       ++L+WS W  R+     N R +    L A ++AW  E  ++N+   
Subjt:  --------------------------GLHWTPMDYWHWL---AENSSKDELAWAIILIWSIWNARSRATSNNIRIEAEHLLAAVNLAW-NEIERSNQKKG

Query:  IDKARNQASHATWKKP
           +  Q +   W  P
Subjt:  IDKARNQASHATWKKP

TrEMBL top hitse value%identityAlignment
A0A2N9H1N4 RNase H domain-containing protein2.0e-1428.9Show/hide
Query:  EAKDILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKR---DISC
        EA  IL IPL  +N  D ++W   K G +SVKS Y L + ++  +E+  SDPS+++++WKS+W     P+ +   W+A ++ +PT++N+  +    D  C
Subjt:  EAKDILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKR---DISC

Query:  CLCG------LH-----------WTPMDYWHWLAENS---------------SKDELAWAIILIWSIWNARSR
         +C       +H           W  + +   LAE++               S  EL    ++ WSIW  R+R
Subjt:  CLCG------LH-----------WTPMDYWHWLAENS---------------SKDELAWAIILIWSIWNARSR

A0A2N9J5D9 CCHC-type domain-containing protein7.7e-1427.98Show/hide
Query:  ILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANI---GKKRDISCCLCG
        I  IPL  +   D +IW+  K+G F+VKSAY + + ++S  EA  S   +L+  WK+LW  + +P+ K+ AW+A  +I+PTK  +   G     SC  C 
Subjt:  ILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANI---GKKRDISCCLCG

Query:  LHWTPMDYWHWLAENSSK--------------------DELAW------------AIILIWSIWNARSRAT-SNNIRIEAEHLLAAVNLAWNEIERSNQK
             +D+  W  E + K                    D LA              I   WS+W AR+     + +   A+  L A   A N +E  N +
Subjt:  LHWTPMDYWHWLAENSSK--------------------DELAW------------AIILIWSIWNARSRAT-SNNIRIEAEHLLAAVNLAWNEIERSNQK

Query:  KGIDKARNQASHATWKKP
             AR       W  P
Subjt:  KGIDKARNQASHATWKKP

A0A2N9J6I3 Uncharacterized protein1.1e-1530.11Show/hide
Query:  ILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANI---GKKRDISCCLCG
        I  IPL  +   D +IW+  K+G F+VKSAY + + ++S  EA  S   +L+  WK+LW  + +P+ K+ AW+A  +I+PTK  +   G     +C  C 
Subjt:  ILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANI---GKKRDISCCLCG

Query:  LHWTPMDYWHWLAENSSKDELAWAIILIWSIWNARSRAT-SNNIRIEAEHLLAAVNLAWNEIERSNQKKGIDKARNQASHATWKKP
             +D+  W        E+   I   WS+W AR+     + +   A+  L A   A N +E  N +     AR       W  P
Subjt:  LHWTPMDYWHWLAENSSKDELAWAIILIWSIWNARSRAT-SNNIRIEAEHLLAAVNLAWNEIERSNQKKGIDKARNQASHATWKKP

M5XSK0 Reverse transcriptase domain-containing protein7.0e-1529.9Show/hide
Query:  ILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAME-EASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKR---DISCCLC
        I +IPL  + + D ++W  +K+G F+VKSAY +A    +S   AS S+   +A+ W  LW+A    R K   W+ I+ I+PTKAN+ +K+   D  C LC
Subjt:  ILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAME-EASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKR---DISCCLC

Query:  ------------------GLHWTPMDYWHWLAENSSKDELAWAIILIWSIWNARSRATSNNIRIEAEHLLAAVNLAWNEIERSNQKKGIDKARNQASHAT
                          G H +P D+    AE  S  + A  +++ W+IW AR+    NN +   E +    +L  ++  R +   G    + Q     
Subjt:  ------------------GLHWTPMDYWHWLAENSSKDELAWAIILIWSIWNARSRATSNNIRIEAEHLLAAVNLAWNEIERSNQKKGIDKARNQASHAT

Query:  WKKP
        W+ P
Subjt:  WKKP

M5Y023 zf-RVT domain-containing protein (Fragment)1.2e-1427.85Show/hide
Query:  ILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAME-EASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKR---DISCCLC
        I +IPL  + + D ++W  +K+G F+VKSAY +A    +S   AS S+   +A+ W  LW+A    R K   W+ I+ I+PTKAN+ +K+   D  C LC
Subjt:  ILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAME-EASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKR---DISCCLC

Query:  ---------------------------------GLHWTPMDYWHWLAENSSKDELAWAIILIWSIWNARSRATSNNIRIEAEHLLAAVNLAWNEIERSNQ
                                         G H +P D+    AE  S  + A  +++ W+IW AR+    NN +   E +    +L   +  R + 
Subjt:  ---------------------------------GLHWTPMDYWHWLAENSSKDELAWAIILIWSIWNARSRATSNNIRIEAEHLLAAVNLAWNEIERSNQ

Query:  KKGIDKARNQASHATWKKP
          G    + Q     W+ P
Subjt:  KKGIDKARNQASHATWKKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCCAAAGATATCCTGAACATCCCTCTCGGTGACAAGAACTCAAACGATCAGATCATATGGGCCTTGGAAAAAAGAGGTTTTTTTTCAGTTAAAAGTGCTTACCG
TTTGGCTATGGAAGAAGCTTCCCTCAAAGAAGCCTCCCAATCGGATCCCTCTAAACTTGCAAAGGTTTGGAAATCTTTATGGGAAGCAAAAGCTAGTCCTAGAGCCAAAA
TTTGTGCTTGGAAAGCTATCAACGATATAATCCCAACAAAGGCCAATATAGGAAAAAAAAGGGATATCTCTTGTTGTTTATGCGGGCTACATTGGACACCTATGGACTAT
TGGCATTGGTTGGCAGAAAATTCAAGCAAGGATGAATTAGCCTGGGCGATTATCCTAATTTGGTCTATCTGGAATGCAAGGAGCAGAGCAACATCAAACAACATAAGAAT
TGAAGCAGAGCACCTCCTAGCAGCAGTCAATTTAGCGTGGAATGAGATAGAAAGATCTAACCAGAAGAAGGGGATCGACAAGGCGAGGAACCAGGCGAGTCATGCCACTT
GGAAAAAACCCCTCAAACATGTGGAAGCTGAACACGGATGCATCCTGGTTCGCGAACTCGGGAAGAGGCGGCTTGGGATGGACGGTGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCCAAAGATATCCTGAACATCCCTCTCGGTGACAAGAACTCAAACGATCAGATCATATGGGCCTTGGAAAAAAGAGGTTTTTTTTCAGTTAAAAGTGCTTACCG
TTTGGCTATGGAAGAAGCTTCCCTCAAAGAAGCCTCCCAATCGGATCCCTCTAAACTTGCAAAGGTTTGGAAATCTTTATGGGAAGCAAAAGCTAGTCCTAGAGCCAAAA
TTTGTGCTTGGAAAGCTATCAACGATATAATCCCAACAAAGGCCAATATAGGAAAAAAAAGGGATATCTCTTGTTGTTTATGCGGGCTACATTGGACACCTATGGACTAT
TGGCATTGGTTGGCAGAAAATTCAAGCAAGGATGAATTAGCCTGGGCGATTATCCTAATTTGGTCTATCTGGAATGCAAGGAGCAGAGCAACATCAAACAACATAAGAAT
TGAAGCAGAGCACCTCCTAGCAGCAGTCAATTTAGCGTGGAATGAGATAGAAAGATCTAACCAGAAGAAGGGGATCGACAAGGCGAGGAACCAGGCGAGTCATGCCACTT
GGAAAAAACCCCTCAAACATGTGGAAGCTGAACACGGATGCATCCTGGTTCGCGAACTCGGGAAGAGGCGGCTTGGGATGGACGGTGCGTGA
Protein sequenceShow/hide protein sequence
MEAKDILNIPLGDKNSNDQIIWALEKRGFFSVKSAYRLAMEEASLKEASQSDPSKLAKVWKSLWEAKASPRAKICAWKAINDIIPTKANIGKKRDISCCLCGLHWTPMDY
WHWLAENSSKDELAWAIILIWSIWNARSRATSNNIRIEAEHLLAAVNLAWNEIERSNQKKGIDKARNQASHATWKKPLKHVEAEHGCILVRELGKRRLGMDGA