; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011083 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011083
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionEnzymatic polyprotein
Genome locationchr1:13930374..13931796
RNA-Seq ExpressionLag0011083
SyntenyLag0011083
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP65886.1 hypothetical protein KK1_012162 [Cajanus cajan]8.0e-2733.09Show/hide
Query:  GLRLCN------ESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQE--RQNYRPYRQFQRKHYRKTYNPYNKKRPFYKGGSSKR
        G++ CN      +  +Q KL K  G+ + E+G FC+QYG   IR PS IK+ + ++   +Q+  +++Y  Y++  +K ++K  + + K + F K  S+K+
Subjt:  GLRLCN------ESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQE--RQNYRPYRQFQRKHYRKTYNPYNKKRPFYKGGSSKR

Query:  NVQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSI-IQSDSEESMIYSSDE--CQILELQEETDSYSSSDYNDEEYDGKRVCSGCIC----VLTKD
        N+ C+KC  + H+ANKC  ++KINELEID+  K+ L++I I S+SEE    S ++   +I+   EE  S S     D+   G  +C+   C    +LT D
Subjt:  NVQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSI-IQSDSEESMIYSSDE--CQILELQEETDSYSSSDYNDEEYDGKRVCSGCIC----VLTKD

Query:  Q-ETLLNVVDKIDNVELQKSILHSLKNSILEKRDQEEPSNFQPYEWKNVVKKFEEPKPVTIQDLQAEINILK
        Q   L+ ++ ++++  ++  ++  L+   L K+D++E    +  + K +  +F +P P+TI+DLQ EI ILK
Subjt:  Q-ETLLNVVDKIDNVELQKSILHSLKNSILEKRDQEEPSNFQPYEWKNVVKKFEEPKPVTIQDLQAEINILK

KYP65886.1 hypothetical protein KK1_012162 [Cajanus cajan]4.7e-0350Show/hide
Query:  MFIASNAYRQRGKRDHEIAQLLVAGFTRQLKQWWDQYLTDYDREGL
        M +A++AY++RG  D   A  LV GFT QLK WWD++ T  DRE +
Subjt:  MFIASNAYRQRGKRDHEIAQLLVAGFTRQLKQWWDQYLTDYDREGL

XP_022933039.1 uncharacterized protein LOC111439730 [Cucurbita moschata]5.9e-3846.32Show/hide
Query:  EGLRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQERQNYRPYRQFQRKHYRKTYNPYNKKRPF-YKGGSSKRNVQCYK
        EGLRL NES +Q KL  S  S R ELG FCDQYGC  I  PST  ++ +K + K     +YRP   ++ K  +     Y++++    K    K+   C+K
Subjt:  EGLRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQERQNYRPYRQFQRKHYRKTYNPYNKKRPF-YKGGSSKRNVQCYK

Query:  CRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEESMIYSSDECQILELQEETDSYSSSDYNDEEYDGKRVCSGCICVLTKDQETLLNVVDKIDN
        CR  GHYANKCP++ KINEL+ID + K QLL +  +DSE+     S E +ILELQEE+DSYSS++Y  E+ +GKR C GCI VLTKDQE LL VV+K+ +
Subjt:  CRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEESMIYSSDECQILELQEETDSYSSSDYNDEEYDGKRVCSGCICVLTKDQETLLNVVDKIDN

Query:  VELQKSILHSLKNSI-LEKRDQEEPSNFQPY
         E+Q+ I   L++++ + K  + E  N  PY
Subjt:  VELQKSILHSLKNSI-LEKRDQEEPSNFQPY

XP_023520850.1 uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo]1.0e-3447.85Show/hide
Query:  EGLRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKR----YRKYQERQNYR--PYRQFQRKHYRKTYNPYNKKRPFYKGGSSKRN
        EGLRLCNES +Q KL  S  S R ELG FCDQYGC  I  P T ++  +K     Y  Y+ R+ YR  P +  +  + R+ Y P    R        K+ 
Subjt:  EGLRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKR----YRKYQERQNYR--PYRQFQRKHYRKTYNPYNKKRPFYKGGSSKRN

Query:  VQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEESMIYSSDECQILELQEETDSYSSSDYNDEEYDGKRVCSGCICVLTKDQETLLNVV
          C+KCR  GHYA KCP+K KINEL+ID + K QLL +  ++SE+     S E +IL+LQEE+DS SS+ Y  E+ +GKR C GCI VLTKDQE LL VV
Subjt:  VQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEESMIYSSDECQILELQEETDSYSSSDYNDEEYDGKRVCSGCICVLTKDQETLLNVV

Query:  DKIDNVELQ
        +K+ + E+Q
Subjt:  DKIDNVELQ

XP_023521035.1 uncharacterized protein LOC111784623 [Cucurbita pepo subsp. pepo]1.8e-3132.61Show/hide
Query:  MFIASNAYRQRG-KRDHEIAQLLVAGFTRQLKQWWDQYLTDYDREGL----------RLCNESNVQAKL-------------------------------
        M +A+ AY+ +G K DH+IAQ+LV GFT QLK WWD+YL +  R+ +          ++  E     +                                
Subjt:  MFIASNAYRQRG-KRDHEIAQLLVAGFTRQLKQWWDQYLTDYDREGL----------RLCNESNVQAKL-------------------------------

Query:  ----------CKSFG------------------------------------SRR---------------------AELGIFCDQYGCGKIRPPSTIKKSS
                  C + G                                    SRR                      ELG FCDQYGC  I  PST ++  
Subjt:  ----------CKSFG------------------------------------SRR---------------------AELGIFCDQYGCGKIRPPSTIKKSS

Query:  IKR----YRKYQERQNYR--PYRQFQRKHYRKTYNPYNKKRPFYKGGSSKRNVQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEESMI
        +K     Y  Y+ R+ YR  P +  +  + R+ Y P    R        K+   C+KCR  GHYAN+CP++ KINEL+ID + K QLL +  +DSE+   
Subjt:  IKR----YRKYQERQNYR--PYRQFQRKHYRKTYNPYNKKRPFYKGGSSKRNVQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEESMI

Query:  YSSDECQILELQEETDSYSSSDYNDEEYDGKRVCSGCICVLTKDQETLLNVVDKIDNVELQKSILHSLKNS
          S E +ILELQEE+DSYSS++Y   + +GKR C GCI VLTKDQE LL VV+K    ++QK    SL +S
Subjt:  YSSDECQILELQEETDSYSSSDYNDEEYDGKRVCSGCICVLTKDQETLLNVVDKIDNVELQKSILHSLKNS

XP_023552915.1 uncharacterized protein LOC111810441 [Cucurbita pepo subsp. pepo]2.5e-2842.99Show/hide
Query:  EGLRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKR----YRKYQERQNYR--PYRQFQRKHYRKTYNPYNKKRPFYKGGSSKRN
        EGLRLCNES +Q KL  S  S R ELG FCDQYGC  I  PST ++   K     Y  Y+ R+ YR  P +  +  + R+ Y P        K    K+ 
Subjt:  EGLRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKR----YRKYQERQNYR--PYRQFQRKHYRKTYNPYNKKRPFYKGGSSKRN

Query:  VQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEESMIYSSDECQILELQEETDSYSSSDYNDEEYDGKRVCSGCICVLTKDQETLLNVV
          C+KCR  GHYANKCP++ KINELEID + K QLL +  +DSE+     S + +ILELQEE+DSYS+++Y  E+ +GKR   GC     +   T    +
Subjt:  VQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEESMIYSSDECQILELQEETDSYSSSDYNDEEYDGKRVCSGCICVLTKDQETLLNVV

Query:  DKIDNVELQKSILHSLKNSIL
         K+ +  L+ S L ++KN  L
Subjt:  DKIDNVELQKSILHSLKNSIL

TrEMBL top hitse value%identityAlignment
A0A151TFW5 CCHC-type domain-containing protein3.9e-2733.09Show/hide
Query:  GLRLCN------ESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQE--RQNYRPYRQFQRKHYRKTYNPYNKKRPFYKGGSSKR
        G++ CN      +  +Q KL K  G+ + E+G FC+QYG   IR PS IK+ + ++   +Q+  +++Y  Y++  +K ++K  + + K + F K  S+K+
Subjt:  GLRLCN------ESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQE--RQNYRPYRQFQRKHYRKTYNPYNKKRPFYKGGSSKR

Query:  NVQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSI-IQSDSEESMIYSSDE--CQILELQEETDSYSSSDYNDEEYDGKRVCSGCIC----VLTKD
        N+ C+KC  + H+ANKC  ++KINELEID+  K+ L++I I S+SEE    S ++   +I+   EE  S S     D+   G  +C+   C    +LT D
Subjt:  NVQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSI-IQSDSEESMIYSSDE--CQILELQEETDSYSSSDYNDEEYDGKRVCSGCIC----VLTKD

Query:  Q-ETLLNVVDKIDNVELQKSILHSLKNSILEKRDQEEPSNFQPYEWKNVVKKFEEPKPVTIQDLQAEINILK
        Q   L+ ++ ++++  ++  ++  L+   L K+D++E    +  + K +  +F +P P+TI+DLQ EI ILK
Subjt:  Q-ETLLNVVDKIDNVELQKSILHSLKNSILEKRDQEEPSNFQPYEWKNVVKKFEEPKPVTIQDLQAEINILK

A0A151TFW5 CCHC-type domain-containing protein2.3e-0350Show/hide
Query:  MFIASNAYRQRGKRDHEIAQLLVAGFTRQLKQWWDQYLTDYDREGL
        M +A++AY++RG  D   A  LV GFT QLK WWD++ T  DRE +
Subjt:  MFIASNAYRQRGKRDHEIAQLLVAGFTRQLKQWWDQYLTDYDREGL

A0A151TFW5 CCHC-type domain-containing protein3.3e-2637.41Show/hide
Query:  LRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQERQNYRPYRQFQRKHYRKTYNPYNKKRPFYKGG-----SSKRNVQC
        + LC E+    K+ K     R ELG FC QYG    + P   KK   KRY            R+F RK+ +   +P  ++R +YKG      SSK N  C
Subjt:  LRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQERQNYRPYRQFQRKHYRKTYNPYNKKRPFYKGG-----SSKRNVQC

Query:  YKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEES--MIYSSDECQILELQEETDS-----YSSSDYNDEE--------YDGKRVCSGCICV
        +KC   GHYAN+CP+K KIN L +D++TKQ LL  I++D E S     SS+E  I  LQEE  S     YS S+ +D+E          GK  CSG I V
Subjt:  YKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEES--MIYSSDECQILELQEETDS-----YSSSDYNDEE--------YDGKRVCSGCICV

Query:  LTKDQETLLNVVDKIDNVELQKSILHSLKNSILEKRDQEEPSNFQPYEWKNVVK--KFEEPKPVTIQDLQAEINILKR
        +TKDQETL +++++I +   +++ L  LK S+ E+  Q+   N   Y +++++   K E   P+ ++DL  E+ ILK+
Subjt:  LTKDQETLLNVVDKIDNVELQKSILHSLKNSILEKRDQEEPSNFQPYEWKNVVK--KFEEPKPVTIQDLQAEINILKR

A0A5A7VRE0 Reverse transcriptase3.3e-2637.41Show/hide
Query:  LRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQERQNYRPYRQFQRKHYRKTYNPYNKKRPFYKGG-----SSKRNVQC
        + LC E+    K+ K     R ELG FC QYG    + P   KK   KRY            R+F RK+ +   +P  ++R +YKG      SSK N  C
Subjt:  LRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQERQNYRPYRQFQRKHYRKTYNPYNKKRPFYKGG-----SSKRNVQC

Query:  YKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEES--MIYSSDECQILELQEETDS-----YSSSDYNDEE--------YDGKRVCSGCICV
        +KC   GHYAN+CP+K KIN L +D++TKQ LL  I++D E S     SS+E  I  LQEE  S     YS S+ +D+E          GK  CSG I V
Subjt:  YKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEES--MIYSSDECQILELQEETDS-----YSSSDYNDEE--------YDGKRVCSGCICV

Query:  LTKDQETLLNVVDKIDNVELQKSILHSLKNSILEKRDQEEPSNFQPYEWKNVVK--KFEEPKPVTIQDLQAEINILKR
        +TKDQETL +++++I +   +++ L  LK S+ E+  Q+   N   Y +++++   K E   P+ ++DL  E+ ILK+
Subjt:  LTKDQETLLNVVDKIDNVELQKSILHSLKNSILEKRDQEEPSNFQPYEWKNVVK--KFEEPKPVTIQDLQAEINILKR

A0A5D3DZV1 Enzymatic polyprotein3.9e-2733.33Show/hide
Query:  ASNAYRQRGKRDHEIAQLLVAGFTRQLKQWWDQYLTDYDREGLRLCNESNVQ-------------------AKLCKSFGSRRAELGIFCDQYGC--GKIR
        A+  Y  R K  +E  Q+L+ GF   L+ WW   LT+ D++ + +   + V+                    K+ K     R ELG FC QYG   G   
Subjt:  ASNAYRQRGKRDHEIAQLLVAGFTRQLKQWWDQYLTDYDREGLRLCNESNVQ-------------------AKLCKSFGSRRAELGIFCDQYGC--GKIR

Query:  PPSTIKKSSIKRYRKYQERQNYRPYRQFQRKHYRKTYNPYNKKRPFYKGGSSKRNVQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEE
          +  KK S K+     + ++  P R  +RKHY   YN    K+ +    S K N  C+KC   GHYAN+CP+K +IN L ID++TKQ LL  I++D++ 
Subjt:  PPSTIKKSSIKRYRKYQERQNYRPYRQFQRKHYRKTYNPYNKKRPFYKGGSSKRNVQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEE

Query:  SMIYSS----DECQIL---ELQEETDSYSSSDYNDEE--------YDGKRVCSGCICVLTKDQETLLNVVDKIDNVELQKSILHSLKNSILEKRDQEEPS
        S    S    D   IL   E   E + YS S+ +D+E          GK  CSG I V+T+DQETL +++++I + E + + L  L+ S+ E+  Q+   
Subjt:  SMIYSS----DECQIL---ELQEETDSYSSSDYNDEE--------YDGKRVCSGCICVLTKDQETLLNVVDKIDNVELQKSILHSLKNSILEKRDQEEPS

Query:  NFQPYEWKNVVK--KFEEPKPVTIQDLQAEINILKR
        N   Y +++++   K E   PV ++DL  E+ ILKR
Subjt:  NFQPYEWKNVVK--KFEEPKPVTIQDLQAEINILKR

A0A6J1EYM2 uncharacterized protein LOC1114397302.8e-3846.32Show/hide
Query:  EGLRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQERQNYRPYRQFQRKHYRKTYNPYNKKRPF-YKGGSSKRNVQCYK
        EGLRL NES +Q KL  S  S R ELG FCDQYGC  I  PST  ++ +K + K     +YRP   ++ K  +     Y++++    K    K+   C+K
Subjt:  EGLRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQERQNYRPYRQFQRKHYRKTYNPYNKKRPF-YKGGSSKRNVQCYK

Query:  CRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEESMIYSSDECQILELQEETDSYSSSDYNDEEYDGKRVCSGCICVLTKDQETLLNVVDKIDN
        CR  GHYANKCP++ KINEL+ID + K QLL +  +DSE+     S E +ILELQEE+DSYSS++Y  E+ +GKR C GCI VLTKDQE LL VV+K+ +
Subjt:  CRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEESMIYSSDECQILELQEETDSYSSSDYNDEEYDGKRVCSGCICVLTKDQETLLNVVDKIDN

Query:  VELQKSILHSLKNSI-LEKRDQEEPSNFQPY
         E+Q+ I   L++++ + K  + E  N  PY
Subjt:  VELQKSILHSLKNSI-LEKRDQEEPSNFQPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCATTGCCTCAAACGCTTATCGACAAAGAGGCAAAAGAGATCATGAAATAGCCCAACTGCTTGTTGCAGGATTTACAAGGCAATTGAAACAATGGTGGGATCAATA
TCTGACAGATTACGATCGTGAAGGTCTTCGTCTCTGTAATGAATCAAATGTTCAAGCAAAACTTTGTAAATCCTTTGGATCAAGACGAGCTGAGTTGGGAATTTTCTGCG
ATCAATATGGATGTGGAAAAATTCGTCCACCATCCACAATAAAAAAATCTTCTATCAAAAGGTATCGAAAATATCAAGAACGACAAAACTATAGACCATACAGGCAGTTT
CAAAGGAAGCATTATAGAAAGACGTATAATCCTTATAATAAAAAACGACCTTTCTACAAAGGAGGAAGCTCCAAAAGAAACGTCCAGTGTTACAAATGTAGATCAAGTGG
ACATTATGCCAATAAATGTCCAATGAAAAAGAAGATCAATGAATTAGAAATTGATGATGATACAAAGCAACAACTTTTGAGTATCATTCAATCTGATTCAGAAGAGTCAA
TGATTTATAGTTCGGATGAGTGTCAAATTCTTGAATTACAAGAAGAAACAGATTCGTATTCTAGTAGCGATTATAATGACGAAGAATATGATGGCAAACGAGTCTGCTCA
GGATGCATATGTGTCCTGACAAAAGACCAAGAAACTCTACTCAACGTTGTTGACAAAATAGACAACGTCGAACTCCAAAAATCAATCTTACACAGTCTTAAAAACTCGAT
TCTTGAAAAAAGAGATCAAGAAGAACCAAGTAACTTTCAACCTTATGAATGGAAGAATGTTGTCAAAAAATTTGAAGAACCAAAACCAGTTACCATTCAAGATCTTCAAG
CTGAGATTAATATTCTAAAGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCATTGCCTCAAACGCTTATCGACAAAGAGGCAAAAGAGATCATGAAATAGCCCAACTGCTTGTTGCAGGATTTACAAGGCAATTGAAACAATGGTGGGATCAATA
TCTGACAGATTACGATCGTGAAGGTCTTCGTCTCTGTAATGAATCAAATGTTCAAGCAAAACTTTGTAAATCCTTTGGATCAAGACGAGCTGAGTTGGGAATTTTCTGCG
ATCAATATGGATGTGGAAAAATTCGTCCACCATCCACAATAAAAAAATCTTCTATCAAAAGGTATCGAAAATATCAAGAACGACAAAACTATAGACCATACAGGCAGTTT
CAAAGGAAGCATTATAGAAAGACGTATAATCCTTATAATAAAAAACGACCTTTCTACAAAGGAGGAAGCTCCAAAAGAAACGTCCAGTGTTACAAATGTAGATCAAGTGG
ACATTATGCCAATAAATGTCCAATGAAAAAGAAGATCAATGAATTAGAAATTGATGATGATACAAAGCAACAACTTTTGAGTATCATTCAATCTGATTCAGAAGAGTCAA
TGATTTATAGTTCGGATGAGTGTCAAATTCTTGAATTACAAGAAGAAACAGATTCGTATTCTAGTAGCGATTATAATGACGAAGAATATGATGGCAAACGAGTCTGCTCA
GGATGCATATGTGTCCTGACAAAAGACCAAGAAACTCTACTCAACGTTGTTGACAAAATAGACAACGTCGAACTCCAAAAATCAATCTTACACAGTCTTAAAAACTCGAT
TCTTGAAAAAAGAGATCAAGAAGAACCAAGTAACTTTCAACCTTATGAATGGAAGAATGTTGTCAAAAAATTTGAAGAACCAAAACCAGTTACCATTCAAGATCTTCAAG
CTGAGATTAATATTCTAAAGAGATAA
Protein sequenceShow/hide protein sequence
MFIASNAYRQRGKRDHEIAQLLVAGFTRQLKQWWDQYLTDYDREGLRLCNESNVQAKLCKSFGSRRAELGIFCDQYGCGKIRPPSTIKKSSIKRYRKYQERQNYRPYRQF
QRKHYRKTYNPYNKKRPFYKGGSSKRNVQCYKCRSSGHYANKCPMKKKINELEIDDDTKQQLLSIIQSDSEESMIYSSDECQILELQEETDSYSSSDYNDEEYDGKRVCS
GCICVLTKDQETLLNVVDKIDNVELQKSILHSLKNSILEKRDQEEPSNFQPYEWKNVVKKFEEPKPVTIQDLQAEINILKR