; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041914 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041914
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr13:31278651..31279229
RNA-Seq ExpressionLag0041914
SyntenyLag0041914
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043186.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.2e-2940.18Show/hide
Query:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKK--ETSDGLFVKGKTR-NKEV
        M+  K L ENLDEFKK T       EK+G   EA +L+NS+ D+YKEVK ALKYGR TIT +++I+A+K KELEL    K    ++ LF KGK    K  
Subjt:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKK--ETSDGLFVKGKTR-NKEV

Query:  KQQTEDKGKPKVKCNYCHKEGHIKREC--------------------YSLKRKNQYHRSKKNKQSE-----ASVGENSITYSDALATTDQRSEQQDSKEK
        K Q   K KP +KC  CHKEGH KR C                        R   Y R  + +  E       VG  +  Y++ L  T++++ + D++E+
Subjt:  KQQTEDKGKPKVKCNYCHKEGHIKREC--------------------YSLKRKNQYHRSKKNKQSE-----ASVGENSITYSDALATTDQRSEQQDSKEK

Query:  HDWVIDSGCSFHMTPSKGY
         DWV+DSGC++HMT  K +
Subjt:  HDWVIDSGCSFHMTPSKGY

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]4.6e-2840.1Show/hide
Query:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKETSDG--LFVKGKT-----R
        MD +K L ENLDEF+K+  +  N+GEK+ DEN+A +LLNSLP++Y+EVK A+KYGR ++T   ++ A+K + LE   +KKE  DG  L  +G++     +
Subjt:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKETSDG--LFVKGKT-----R

Query:  NKEVKQQTEDKGKPKVKCNYCHKEGHIKRECYSLKRKNQYHRSKKNKQSEASV--GENSITYSDALATTDQRSEQQD-----SKEKHD-WVIDSGCSFHM
         KE   +++ KGK + KC  CHKEGH K+ C         ++S++   SEA+V  G NS   +D   + +   E  +      ++  D W++DSGC+FHM
Subjt:  NKEVKQQTEDKGKPKVKCNYCHKEGHIKRECYSLKRKNQYHRSKKNKQSEASV--GENSITYSDALATTDQRSEQQD-----SKEKHD-WVIDSGCSFHM

Query:  TPSKGYL
        TP + +L
Subjt:  TPSKGYL

TYK12279.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.2e-2940.18Show/hide
Query:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKK--ETSDGLFVKGKTR-NKEV
        M+  K L ENLDEFKK T       EK+G   EA +L+NS+ D+YKEVK ALKYGR TIT +++I+A+K KELEL    K    ++ LF KGK    K  
Subjt:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKK--ETSDGLFVKGKTR-NKEV

Query:  KQQTEDKGKPKVKCNYCHKEGHIKREC--------------------YSLKRKNQYHRSKKNKQSE-----ASVGENSITYSDALATTDQRSEQQDSKEK
        K Q   K KP +KC  CHKEGH KR C                        R   Y R  + +  E       VG  +  Y++ L  T++++ + D++E+
Subjt:  KQQTEDKGKPKVKCNYCHKEGHIKREC--------------------YSLKRKNQYHRSKKNKQSE-----ASVGENSITYSDALATTDQRSEQQDSKEK

Query:  HDWVIDSGCSFHMTPSKGY
         DWV+DSGC++HMT  K +
Subjt:  HDWVIDSGCSFHMTPSKGY

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]4.6e-2840.1Show/hide
Query:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKETSDG--LFVKGKT-----R
        MD +K L ENLDEF+K+  +  N+GEK+ DEN+A +LLNSLP++Y+EVK A+KYGR ++T   ++ A+K + LE   +KKE  DG  L  +G++     +
Subjt:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKETSDG--LFVKGKT-----R

Query:  NKEVKQQTEDKGKPKVKCNYCHKEGHIKRECYSLKRKNQYHRSKKNKQSEASV--GENSITYSDALATTDQRSEQQD-----SKEKHD-WVIDSGCSFHM
         KE   +++ KGK + KC  CHKEGH K+ C         ++S++   SEA+V  G NS   +D   + +   E  +      ++  D W++DSGC+FHM
Subjt:  NKEVKQQTEDKGKPKVKCNYCHKEGHIKRECYSLKRKNQYHRSKKNKQSEASV--GENSITYSDALATTDQRSEQQD-----SKEKHD-WVIDSGCSFHM

Query:  TPSKGYL
        TP + +L
Subjt:  TPSKGYL

XP_038885928.1 uncharacterized protein LOC120076236 [Benincasa hispida]1.9e-2949.39Show/hide
Query:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKE--TSDGLFVKGKTRNKEVK
        MD AK L++NL+EFK ++++F+++G+ IG+ENEAF+LLNSLP+++K+VK ALKYGR  ITT AIISA+ +KELEL   KK+    +G F KG  +     
Subjt:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKE--TSDGLFVKGKTRNKEVK

Query:  QQTEDKGKPKVKCNYCHKEGHIKRECYSLKRK-NQYHRSKKNKQSEASVGENSITYSDALATTD
              G+       C ++  +K++CY+LKRK NQ  ++K  KQ+EA+VGENS+ YSDALA T+
Subjt:  QQTEDKGKPKVKCNYCHKEGHIKRECYSLKRK-NQYHRSKKNKQSEASVGENSITYSDALATTD

TrEMBL top hitse value%identityAlignment
A0A5A7TMB4 Pentatricopeptide repeat-containing protein1.5e-2940.18Show/hide
Query:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKK--ETSDGLFVKGKTR-NKEV
        M+  K L ENLDEFKK T       EK+G   EA +L+NS+ D+YKEVK ALKYGR TIT +++I+A+K KELEL    K    ++ LF KGK    K  
Subjt:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKK--ETSDGLFVKGKTR-NKEV

Query:  KQQTEDKGKPKVKCNYCHKEGHIKREC--------------------YSLKRKNQYHRSKKNKQSE-----ASVGENSITYSDALATTDQRSEQQDSKEK
        K Q   K KP +KC  CHKEGH KR C                        R   Y R  + +  E       VG  +  Y++ L  T++++ + D++E+
Subjt:  KQQTEDKGKPKVKCNYCHKEGHIKREC--------------------YSLKRKNQYHRSKKNKQSE-----ASVGENSITYSDALATTDQRSEQQDSKEK

Query:  HDWVIDSGCSFHMTPSKGY
         DWV+DSGC++HMT  K +
Subjt:  HDWVIDSGCSFHMTPSKGY

A0A5A7UB25 Putative gag-pol polyprotein2.2e-2840.1Show/hide
Query:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKETSDG--LFVKGKT-----R
        MD +K L ENLDEF+K+  +  N+GEK+ DEN+A +LLNSLP++Y+EVK A+KYGR ++T   ++ A+K + LE   +KKE  DG  L  +G++     +
Subjt:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKETSDG--LFVKGKT-----R

Query:  NKEVKQQTEDKGKPKVKCNYCHKEGHIKRECYSLKRKNQYHRSKKNKQSEASV--GENSITYSDALATTDQRSEQQD-----SKEKHD-WVIDSGCSFHM
         KE   +++ KGK + KC  CHKEGH K+ C         ++S++   SEA+V  G NS   +D   + +   E  +      ++  D W++DSGC+FHM
Subjt:  NKEVKQQTEDKGKPKVKCNYCHKEGHIKRECYSLKRKNQYHRSKKNKQSEASV--GENSITYSDALATTDQRSEQQD-----SKEKHD-WVIDSGCSFHM

Query:  TPSKGYL
        TP + +L
Subjt:  TPSKGYL

A0A5D3CPM8 Pentatricopeptide repeat-containing protein1.5e-2940.18Show/hide
Query:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKK--ETSDGLFVKGKTR-NKEV
        M+  K L ENLDEFKK T       EK+G   EA +L+NS+ D+YKEVK ALKYGR TIT +++I+A+K KELEL    K    ++ LF KGK    K  
Subjt:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKK--ETSDGLFVKGKTR-NKEV

Query:  KQQTEDKGKPKVKCNYCHKEGHIKREC--------------------YSLKRKNQYHRSKKNKQSE-----ASVGENSITYSDALATTDQRSEQQDSKEK
        K Q   K KP +KC  CHKEGH KR C                        R   Y R  + +  E       VG  +  Y++ L  T++++ + D++E+
Subjt:  KQQTEDKGKPKVKCNYCHKEGHIKREC--------------------YSLKRKNQYHRSKKNKQSE-----ASVGENSITYSDALATTDQRSEQQDSKEK

Query:  HDWVIDSGCSFHMTPSKGY
         DWV+DSGC++HMT  K +
Subjt:  HDWVIDSGCSFHMTPSKGY

A0A5D3DNU1 Putative gag-pol polyprotein2.2e-2840.1Show/hide
Query:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKETSDG--LFVKGKT-----R
        MD +K L ENLDEF+K+  +  N+GEK+ DEN+A +LLNSLP++Y+EVK A+KYGR ++T   ++ A+K + LE   +KKE  DG  L  +G++     +
Subjt:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKETSDG--LFVKGKT-----R

Query:  NKEVKQQTEDKGKPKVKCNYCHKEGHIKRECYSLKRKNQYHRSKKNKQSEASV--GENSITYSDALATTDQRSEQQD-----SKEKHD-WVIDSGCSFHM
         KE   +++ KGK + KC  CHKEGH K+ C         ++S++   SEA+V  G NS   +D   + +   E  +      ++  D W++DSGC+FHM
Subjt:  NKEVKQQTEDKGKPKVKCNYCHKEGHIKRECYSLKRKNQYHRSKKNKQSEASV--GENSITYSDALATTDQRSEQQD-----SKEKHD-WVIDSGCSFHM

Query:  TPSKGYL
        TP + +L
Subjt:  TPSKGYL

A0A5D3DVM0 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-2838.64Show/hide
Query:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKK--ETSDGLFVKGKT--RNKE
        M+  K L ENLDEFKK+T       EK+G E+EA +L+N + D+YKEVK +LKYGR TIT +++I+A+K KELEL    K    ++ LF KG    R   
Subjt:  MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKK--ETSDGLFVKGKT--RNKE

Query:  VKQQTEDKGKPKVKCNYCHKEGHIKREC--------------------YSLKRKNQYHRSKKNKQSE-----ASVGENSITYSDALATTDQRSEQQDSKE
         K Q   + KP +KC  CHKEGH KR C                        R   Y R  + +  E       VG  +  Y+  LA T++R+ + + +E
Subjt:  VKQQTEDKGKPKVKCNYCHKEGHIKREC--------------------YSLKRKNQYHRSKKNKQSE-----ASVGENSITYSDALATTDQRSEQQDSKE

Query:  KHDWVIDSGCSFHMTPSKGY
        + DWV+DSGC+++MT  K +
Subjt:  KHDWVIDSGCSFHMTPSKGY

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-1125.93Show/hide
Query:  NLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKETSDGLFVKGKTRNKE----------VKQ
        +L+ F  + T+  NLG KI +E++A +LLNSLP SY  +   + +G+ TI    + SA+ + E ++    +     L  +G+ R+ +           + 
Subjt:  NLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKETSDGLFVKGKTRNKE----------VKQ

Query:  QTEDKGKPKVK-CNYCHKEGHIKRECYSLKRKNQYHRSKKNKQSEASVGENSITYSDALATTDQRSEQQDSKEKHDWVIDSGCSFHMTP
        +++++ K +V+ C  C++ GH KR+C + ++       +KN  + A++ +N+      L   ++      S  + +WV+D+  S H TP
Subjt:  QTEDKGKPKVK-CNYCHKEGHIKRECYSLKRKNQYHRSKKNKQSEASVGENSITYSDALATTDQRSEQQDSKEKHDWVIDSGCSFHMTP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCAGCGAAACCATTGTCAGAAAATCTTGATGAGTTCAAGAAAATGACTACCGAGTTCAAGAACCTAGGTGAAAAGATAGGAGATGAGAACGAGGCATTTGTGTT
ACTTAATTCACTTCCAGATTCATATAAAGAAGTGAAAAATGCCCTCAAGTATGGGAGAGTGACTATTACTACCGATGCAATAATTTCAGCTATTAAAATCAAAGAGTTGG
AATTGATGGCTGTAAAGAAAGAAACCTCAGATGGACTTTTTGTTAAGGGTAAAACCAGAAACAAAGAAGTCAAGCAACAGACAGAAGATAAAGGGAAACCAAAAGTGAAA
TGCAATTATTGCCACAAGGAAGGGCATATCAAGAGAGAATGCTACTCCTTGAAGAGAAAGAACCAATACCATCGATCTAAGAAGAATAAACAATCCGAGGCTTCAGTTGG
AGAGAACTCCATTACATATTCGGATGCTTTGGCTACTACAGACCAAAGAAGTGAGCAACAAGACTCAAAGGAGAAGCATGATTGGGTGATAGATTCGGGATGCTCATTCC
ATATGACTCCTTCAAAAGGCTACCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGCAGCGAAACCATTGTCAGAAAATCTTGATGAGTTCAAGAAAATGACTACCGAGTTCAAGAACCTAGGTGAAAAGATAGGAGATGAGAACGAGGCATTTGTGTT
ACTTAATTCACTTCCAGATTCATATAAAGAAGTGAAAAATGCCCTCAAGTATGGGAGAGTGACTATTACTACCGATGCAATAATTTCAGCTATTAAAATCAAAGAGTTGG
AATTGATGGCTGTAAAGAAAGAAACCTCAGATGGACTTTTTGTTAAGGGTAAAACCAGAAACAAAGAAGTCAAGCAACAGACAGAAGATAAAGGGAAACCAAAAGTGAAA
TGCAATTATTGCCACAAGGAAGGGCATATCAAGAGAGAATGCTACTCCTTGAAGAGAAAGAACCAATACCATCGATCTAAGAAGAATAAACAATCCGAGGCTTCAGTTGG
AGAGAACTCCATTACATATTCGGATGCTTTGGCTACTACAGACCAAAGAAGTGAGCAACAAGACTCAAAGGAGAAGCATGATTGGGTGATAGATTCGGGATGCTCATTCC
ATATGACTCCTTCAAAAGGCTACCTATAG
Protein sequenceShow/hide protein sequence
MDAAKPLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLPDSYKEVKNALKYGRVTITTDAIISAIKIKELELMAVKKETSDGLFVKGKTRNKEVKQQTEDKGKPKVK
CNYCHKEGHIKRECYSLKRKNQYHRSKKNKQSEASVGENSITYSDALATTDQRSEQQDSKEKHDWVIDSGCSFHMTPSKGYL