; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G004210 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G004210
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr06:4635302..4637321
RNA-Seq ExpressionLsi06G004210
SyntenyLsi06G004210
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7115099.1 hypothetical protein RHSIM_RhsimUnG0064500 [Rhododendron simsii]8.3e-4844.21Show/hide
Query:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI
        IWK RMED+LY  +L+ PI G++ KP+  S +DW++ N++AVA IR W+   ++ +++ ET+AY LW KLE ++ERKT +NK  L+++L+NLKY +G S+
Subjt:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI

Query:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNS----
        + H+S+ Q ++NQLS+MK+VLD+ELQALLLLSSLPD W  LV ++SN+  S  +++D VK  + NEE  R   G+ +++++ L  + +GRSR  NS    
Subjt:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNS----

Query:  --------DCGEPARKNRRENQCYYCKKMGHKKVECRKWKKD
                       K R E +C++C KMGH + ECR  +K+
Subjt:  --------DCGEPARKNRRENQCYYCKKMGHKKVECRKWKKD

KAF7129225.1 hypothetical protein RHSIM_Rhsim10G0050800 [Rhododendron simsii]1.1e-4744.21Show/hide
Query:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI
        IWK RMED+LY  +L+ PI G++ KP+  S +DW++ N++AVA IR W+   ++ +++ ET+AY LW KLE ++ERKT +NK  L+++L+NLKY +G S+
Subjt:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI

Query:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNS----
        + H+S+ Q ++NQLS+MK+VLD+ELQALLLLSSLPD W  LV ++SN+  S  +++D VK  + NEE  R   G+ +++++ L  + +GRSR  NS    
Subjt:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNS----

Query:  --------DCGEPARKNRRENQCYYCKKMGHKKVECRKWKKD
                       K R E +C++C KMGH + ECR  +K+
Subjt:  --------DCGEPARKNRRENQCYYCKKMGHKKVECRKWKKD

KAF7129546.1 hypothetical protein RHSIM_Rhsim10G0154200 [Rhododendron simsii]1.8e-4744.21Show/hide
Query:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI
        IWK RMED+LY  +L+ PI G++ KP+  S +DW++ N++AVA IR W+   ++ +++ ET+AY LW KLE ++ERKT +NK  L+++L+NLKY +G S+
Subjt:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI

Query:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNS----
        + H+S+ Q ++NQLS+MK+VLD+ELQALLLLSSLPD W  LV ++SN+  S  +++D VK  + NEE  R   G+ +++++ L  + +GRSR  NS    
Subjt:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNS----

Query:  --------DCGEPARKNRRENQCYYCKKMGHKKVECRKWKKD
                       K R E +C++C KMGH + ECR  +K+
Subjt:  --------DCGEPARKNRRENQCYYCKKMGHKKVECRKWKKD

KAF7143526.1 hypothetical protein RHSIM_Rhsim05G0092400 [Rhododendron simsii]8.3e-4846.5Show/hide
Query:  WKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSIS
        WK +MEDLLYC +LH P+ G+  KP +M ++DW  LN++ V  IR W+   ++ ++S ETSAY LWKKLE L++RK+  NK FL KKL+NLKY EG SI+
Subjt:  WKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSIS

Query:  SHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKD---NSDC
         HL+E+ SI+NQL+SMKIV DDELQ L+LLSSLP+ W  LV ++SN+     +S+ +V SSLLNEE  R +  S   E+  +  + + R R     N D 
Subjt:  SHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKD---NSDC

Query:  GEPARK--------NRRENQCYYCKKMGHKKVECRKWKKDQYN
           + +        ++++ +C+YCKK GH K EC K K  + N
Subjt:  GEPARK--------NRRENQCYYCKKMGHKKVECRKWKKDQYN

KAG5549868.1 hypothetical protein RHGRI_014986 [Rhododendron griersonianum]1.2e-4947.7Show/hide
Query:  WKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSIS
        WK +MEDLLYC +LH P+ G++ KP +M ++DW +LN++AV  IR W+   ++ ++S ETSAY LWKKLE L++RK+  NK FL KKL+NLK+ EG SI+
Subjt:  WKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSIS

Query:  SHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNSDCGEP
         HL+E+ SI+NQL+SMKIV DDELQAL+LLSSLP+ W  LV ++SN+     +S  +V SSLLNEE  R + GS   E+  +  + + R R       + 
Subjt:  SHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNSDCGEP

Query:  AR-------KNRRENQCYYCKKMGHKKVECRKWKKDQYN
        +R        ++++ +C+YCKK GH K EC K K  + N
Subjt:  AR-------KNRRENQCYYCKKMGHKKVECRKWKKDQYN

TrEMBL top hitse value%identityAlignment
A0A2N9EZ52 Uncharacterized protein3.5e-4445.3Show/hide
Query:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI
        IWK  MED+LYC +LH PI G+S KP +M  ++W  ++++ +  IR  +   ++ ++S ET A +LWKKLE L+ERKT +NK F ++KL +LK  EG S+
Subjt:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI

Query:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNSDCGE
        + HLSE Q ++NQL+ M +V+DDELQALLLLSSLPD W  LV SLSN+  +  L +  VK SL N+E  R  +G    ++  L T+ +GRS+  NS    
Subjt:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNSDCGE

Query:  PAR---KNRRENQCYYCKKMGHKKVECRKWKKDQ
         +R   + + + +C+YC K GH K  C+ WK  Q
Subjt:  PAR---KNRRENQCYYCKKMGHKKVECRKWKKDQ

A0A438HI91 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-4746.78Show/hide
Query:  MEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSISSHLS
        MEDLL+C +L+ PI G+S KPE M   +W+ L+++AV +IR W+   ++ ++S E SA+ LW KLE L++RKT  NK FL +KL+N KY EGT I+ HL+
Subjt:  MEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSISSHLS

Query:  EVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNS-----DCGE
        E++SI+NQL++MKI  DDELQALLLLSSLP+ W  LV ++SN+     +++ +V SSLLNEE  R + GS   E+  ++ + +GRS+   S       G 
Subjt:  EVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNS-----DCGE

Query:  PARKNRRENQCYYCKKMGHKKVECRKWKKDQYN
         +  ++++ +CYYC K GH K ECRK K  + N
Subjt:  PARKNRRENQCYYCKKMGHKKVECRKWKKDQYN

A0A4Y1QYG0 Uncharacterized protein6.4e-4644.63Show/hide
Query:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI
        IW  RMED+LYC +L+ P+     KPE  S   W +LN++ V  IR W+   ++ ++S ET AY LW KL  ++ERKT +NK  ++++L+NLKY +G S+
Subjt:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI

Query:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRS------RKD
        + HLS+ Q ++N L++MK+VLDDELQAL+LLSSLPD W  LV SLSN+     L++D VK S+ NEE  R   G +A ES+ L ++ +GR+      R+D
Subjt:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRS------RKD

Query:  NS-----DCGEPARKNRRENQCYYCKKMGHKKVECRKWKKDQ
         S     D      K R++ +CY+C  +GH K ECR +K++Q
Subjt:  NS-----DCGEPARKNRRENQCYYCKKMGHKKVECRKWKKDQ

A0A4Y1RJM3 CCHC-type domain-containing protein4.1e-4544.21Show/hide
Query:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI
        IW  RMED+LYC +L+ P+     KP   S   W +LN++ V  IR W+   ++ ++S ET AY LW KL  ++ERKT +NK  ++++L+NLKY +G S+
Subjt:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI

Query:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRS------RKD
        + HLS+ Q ++N L++MK+VLDDELQAL+LLSSLPD W  LV SLSN+     L++D VK S+ NEE  R   G +A ES+ L ++ +GR+      R+D
Subjt:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRS------RKD

Query:  NS-----DCGEPARKNRRENQCYYCKKMGHKKVECRKWKKDQ
         S     D      K R++ +CY+C  +GH K ECR +K++Q
Subjt:  NS-----DCGEPARKNRRENQCYYCKKMGHKKVECRKWKKDQ

A0A5J5B7H2 CCHC-type domain-containing protein1.2e-4443.22Show/hide
Query:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI
        IWK +MED++YC +L+ PI G+  KP++M  + W+ L+++ +  IR W+   ++ ++S+ET A  LWKKLE  +E+KT  NK FL++KL+N+K+ EG SI
Subjt:  IWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSI

Query:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGS----LAIESDFLKTKIQGRSRKDNS
          HL+E QS++NQL++MK+V++DELQA LLLSSLPD W  LV ++SN+    KLS+ +V SSL NEE  R   G+      +  +  ++K  G      S
Subjt:  SSHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGS----LAIESDFLKTKIQGRSRKDNS

Query:  DCGEPAR-KNRRENQCYYCKKMGHKKVECRKWKKDQ
             +R K+    +CY+C K GH K  C  WK++Q
Subjt:  DCGEPAR-KNRRENQCYYCKKMGHKKVECRKWKKDQ

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-2234.32Show/hide
Query:  WKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSIS
        W+ RM DLL    LH  ++ +S KP+ M  +DW  L++RA + IRL +   +   I DE +A  +W +LE L+  KT  NKL+L K+L  L   EGT+  
Subjt:  WKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWM---LYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSIS

Query:  SHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRS--RKDNSDCG
        SHL+    ++ QL+++ + +++E +A+LLL+SLP  +  L  ++ +  ++ +L  D   + LLNE++ +        +   L T+ +GRS  R  N+   
Subjt:  SHLSEVQSIMNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRS--RKDNSDCG

Query:  EPAR---KNR---RENQCYYCKKMGHKKVECRKWKK
          AR   KNR   R   CY C + GH K +C   +K
Subjt:  EPAR---KNR---RENQCYYCKKMGHKKVECRKWKK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAGATATGGAAAACAAGAATGGAAGATCTTCTTTACTGTAACAACCTACATACTCCTATTAATGGTGAGTCAAACAAGCCAGAAAATATGAGCAAACAAGATTG
GGAGTTATTGAATAAACGAGCTGTTGCATATATTCGTTTGTGGATGTTGTACCAATACATTTCAGATGAGACGTCAGCATATTCGTTATGGAAGAAATTAGAAGAATTGT
TCGAGAGAAAAACAAATGAGAATAAACTTTTCTTGGTAAAAAAGCTTATCAACCTGAAATATGACGAGGGTACTTCAATTTCCAGTCATTTGAGTGAGGTGCAAAGCATA
ATGAATCAACTATCATCAATGAAGATAGTTTTAGATGATGAGTTGCAGGCTTTACTGCTTCTTAGTTCTTTGCCAGATGATTGGGTCAGGTTAGTAGAATCGTTGAGTAA
TAACGATTCTAGTGAGAAGTTGAGTATAGATAAGGTTAAGAGTAGTTTGTTAAATGAAGAATTAATGAGGATGGCATTGGGTTCTTTAGCTATAGAGTCGGATTTTTTGA
AGACAAAAATCCAAGGGAGAAGTCGAAAGGACAATAGTGATTGTGGCGAACCGGCAAGGAAAAACAGGAGAGAAAATCAATGCTATTATTGCAAGAAGATGGGACACAAG
AAAGTTGAATGTAGAAAATGGAAAAAGGACCAATATAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAGATATGGAAAACAAGAATGGAAGATCTTCTTTACTGTAACAACCTACATACTCCTATTAATGGTGAGTCAAACAAGCCAGAAAATATGAGCAAACAAGATTG
GGAGTTATTGAATAAACGAGCTGTTGCATATATTCGTTTGTGGATGTTGTACCAATACATTTCAGATGAGACGTCAGCATATTCGTTATGGAAGAAATTAGAAGAATTGT
TCGAGAGAAAAACAAATGAGAATAAACTTTTCTTGGTAAAAAAGCTTATCAACCTGAAATATGACGAGGGTACTTCAATTTCCAGTCATTTGAGTGAGGTGCAAAGCATA
ATGAATCAACTATCATCAATGAAGATAGTTTTAGATGATGAGTTGCAGGCTTTACTGCTTCTTAGTTCTTTGCCAGATGATTGGGTCAGGTTAGTAGAATCGTTGAGTAA
TAACGATTCTAGTGAGAAGTTGAGTATAGATAAGGTTAAGAGTAGTTTGTTAAATGAAGAATTAATGAGGATGGCATTGGGTTCTTTAGCTATAGAGTCGGATTTTTTGA
AGACAAAAATCCAAGGGAGAAGTCGAAAGGACAATAGTGATTGTGGCGAACCGGCAAGGAAAAACAGGAGAGAAAATCAATGCTATTATTGCAAGAAGATGGGACACAAG
AAAGTTGAATGTAGAAAATGGAAAAAGGACCAATATAATTAG
Protein sequenceShow/hide protein sequence
MAKIWKTRMEDLLYCNNLHTPINGESNKPENMSKQDWELLNKRAVAYIRLWMLYQYISDETSAYSLWKKLEELFERKTNENKLFLVKKLINLKYDEGTSISSHLSEVQSI
MNQLSSMKIVLDDELQALLLLSSLPDDWVRLVESLSNNDSSEKLSIDKVKSSLLNEELMRMALGSLAIESDFLKTKIQGRSRKDNSDCGEPARKNRRENQCYYCKKMGHK
KVECRKWKKDQYN