; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039775 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039775
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:49664895..49665793
RNA-Seq ExpressionLag0039775
SyntenyLag0039775
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR011016 - Zinc finger, RING-CH-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043186.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]2.7e-2836.04Show/hide
Query:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR-DNKHHPEERNKAKIRCNYCHKEGHL
        +  KLG   EA +L+NS+ D YKEVKT LKYGRE IT++ +I A+++KELE+ ++ K    +E LF K K   R +NK+    ++K  ++C  CHKEGH 
Subjt:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR-DNKHHPEERNKAKIRCNYCHKEGHL

Query:  KMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTY
        K +C                 Y  +  N+N+ +++    +          VG  +  Y+  L  +++ + +  + E+ DWV+DSGC++HMT  K+WF  Y
Subjt:  KMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTY

Query:  REWDGGIVYMGNNNSCRVIGIG
        +  +G  VYMGNN    +IG+G
Subjt:  REWDGGIVYMGNNNSCRVIGIG

KAG6420009.1 hypothetical protein SASPL_116523 [Salvia splendens]1.4e-2433.5Show/hide
Query:  EHSNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKE---LEILSQKKE-TSEGLFVKSKPKGRDNKHHPEERNKAK---IRCNYC
        E+ + K+ +E++A +LL SL  + K  + T+ YG++ I+   +  A+++KE    +I  +     +EGLFV+ +P+ +D ++  + ++K+K    +CNYC
Subjt:  EHSNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKE---LEILSQKKE-TSEGLFVKSKPKGRDNKHHPEERNKAK---IRCNYC

Query:  HKEGHLKMDCYSLKRKNQNQRFKKNKPPKVVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTYREWDGGIVYMGNNNSCRVI
         K+GH+K DC+ LK K +N   + N   +  V  +    +  +A+++   S        +W++DSGCS+H+  ++D FSTY  +DGG V MGNN+ C+V+
Subjt:  HKEGHLKMDCYSLKRKNQNQRFKKNKPPKVVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTYREWDGGIVYMGNNNSCRVI

Query:  GIG
        G G
Subjt:  GIG

PON41343.1 Zinc finger, CCHC-type [Parasponia andersonii]1.1e-2638.12Show/hide
Query:  EHSNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRE-NITIDVIIPAIRTKELEILSQKKE--TSEGLFVKSKPKGRDNK---HHPEERNKAKIRCNYCH
        E+   KL DE++A +LLN+L  AY+  K  + YGRE  IT+D +  A++ KEL    + KE  T EGL  + + +  DNK   +    ++K +++C +CH
Subjt:  EHSNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRE-NITIDVIIPAIRTKELEILSQKKE--TSEGLFVKSKPKGRDNK---HHPEERNKAKIRCNYCH

Query:  KEGHLKMDCYSLKRKNQNQRFKKNKPPKVVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTYREWDGGIVYMGNNNSCRVIG
        KEGH K DC   K+K      K   P +  V  +    +  L  +D+ SS        +W++DSGCSFHM  +K WF    + DGG V +GNN  C+V G
Subjt:  KEGHLKMDCYSLKRKNQNQRFKKNKPPKVVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTYREWDGGIVYMGNNNSCRVIG

Query:  IG
        IG
Subjt:  IG

TYK12279.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]2.7e-2836.04Show/hide
Query:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR-DNKHHPEERNKAKIRCNYCHKEGHL
        +  KLG   EA +L+NS+ D YKEVKT LKYGRE IT++ +I A+++KELE+ ++ K    +E LF K K   R +NK+    ++K  ++C  CHKEGH 
Subjt:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR-DNKHHPEERNKAKIRCNYCHKEGHL

Query:  KMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTY
        K +C                 Y  +  N+N+ +++    +          VG  +  Y+  L  +++ + +  + E+ DWV+DSGC++HMT  K+WF  Y
Subjt:  KMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTY

Query:  REWDGGIVYMGNNNSCRVIGIG
        +  +G  VYMGNN    +IG+G
Subjt:  REWDGGIVYMGNNNSCRVIGIG

TYK27723.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.2e-2935.87Show/hide
Query:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR--DNKHHPEERNKAKIRCNYCHKEGH
        +  KLG E+EA +L+N + D YKEVKT+LKYGRE IT++ +I A+++KELE+ ++ K    +E LF K     R   NK+    R+K  ++C  CHKEGH
Subjt:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR--DNKHHPEERNKAKIRCNYCHKEGH

Query:  LKMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFST
         K +C                 Y  +  N+N+ +++    +          VG  +  Y+  LA +++  + +   E+ DWV+DSGC+++MT  K+WF  
Subjt:  LKMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFST

Query:  YREWDGGIVYMGNNNSCRVIGIG
        Y+  +G  V MGNN  C +IG+G
Subjt:  YREWDGGIVYMGNNNSCRVIGIG

TrEMBL top hitse value%identityAlignment
A0A2P5AXR9 Zinc finger, CCHC-type5.6e-2738.12Show/hide
Query:  EHSNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRE-NITIDVIIPAIRTKELEILSQKKE--TSEGLFVKSKPKGRDNK---HHPEERNKAKIRCNYCH
        E+   KL DE++A +LLN+L  AY+  K  + YGRE  IT+D +  A++ KEL    + KE  T EGL  + + +  DNK   +    ++K +++C +CH
Subjt:  EHSNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRE-NITIDVIIPAIRTKELEILSQKKE--TSEGLFVKSKPKGRDNK---HHPEERNKAKIRCNYCH

Query:  KEGHLKMDCYSLKRKNQNQRFKKNKPPKVVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTYREWDGGIVYMGNNNSCRVIG
        KEGH K DC   K+K      K   P +  V  +    +  L  +D+ SS        +W++DSGCSFHM  +K WF    + DGG V +GNN  C+V G
Subjt:  KEGHLKMDCYSLKRKNQNQRFKKNKPPKVVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTYREWDGGIVYMGNNNSCRVIG

Query:  IG
        IG
Subjt:  IG

A0A5A7TMB4 Pentatricopeptide repeat-containing protein1.3e-2836.04Show/hide
Query:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR-DNKHHPEERNKAKIRCNYCHKEGHL
        +  KLG   EA +L+NS+ D YKEVKT LKYGRE IT++ +I A+++KELE+ ++ K    +E LF K K   R +NK+    ++K  ++C  CHKEGH 
Subjt:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR-DNKHHPEERNKAKIRCNYCHKEGHL

Query:  KMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTY
        K +C                 Y  +  N+N+ +++    +          VG  +  Y+  L  +++ + +  + E+ DWV+DSGC++HMT  K+WF  Y
Subjt:  KMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTY

Query:  REWDGGIVYMGNNNSCRVIGIG
        +  +G  VYMGNN    +IG+G
Subjt:  REWDGGIVYMGNNNSCRVIGIG

A0A5D3CPM8 Pentatricopeptide repeat-containing protein1.3e-2836.04Show/hide
Query:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR-DNKHHPEERNKAKIRCNYCHKEGHL
        +  KLG   EA +L+NS+ D YKEVKT LKYGRE IT++ +I A+++KELE+ ++ K    +E LF K K   R +NK+    ++K  ++C  CHKEGH 
Subjt:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR-DNKHHPEERNKAKIRCNYCHKEGHL

Query:  KMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTY
        K +C                 Y  +  N+N+ +++    +          VG  +  Y+  L  +++ + +  + E+ DWV+DSGC++HMT  K+WF  Y
Subjt:  KMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTY

Query:  REWDGGIVYMGNNNSCRVIGIG
        +  +G  VYMGNN    +IG+G
Subjt:  REWDGGIVYMGNNNSCRVIGIG

A0A5D3DVM0 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-2935.87Show/hide
Query:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR--DNKHHPEERNKAKIRCNYCHKEGH
        +  KLG E+EA +L+N + D YKEVKT+LKYGRE IT++ +I A+++KELE+ ++ K    +E LF K     R   NK+    R+K  ++C  CHKEGH
Subjt:  SNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKK--ETSEGLFVKSKPKGR--DNKHHPEERNKAKIRCNYCHKEGH

Query:  LKMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFST
         K +C                 Y  +  N+N+ +++    +          VG  +  Y+  LA +++  + +   E+ DWV+DSGC+++MT  K+WF  
Subjt:  LKMDC-----------------YSLKRKNQNQRFKKNKPPK--------VVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFST

Query:  YREWDGGIVYMGNNNSCRVIGIG
        Y+  +G  V MGNN  C +IG+G
Subjt:  YREWDGGIVYMGNNNSCRVIGIG

G8DCX4 Putative Ty-1 copia retrotransposon4.4e-2434.74Show/hide
Query:  ALAHKKTEHSNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKEL---EILSQKKETSEGLFV-----KSKPKGRDNKHHPEERNK
        AL  K  + S+ K  DE+ A +LL+SL D ++ ++TTL +G+EN+++DV+  A+ + EL   + +  K  TSE   V     +S+ K R      + R  
Subjt:  ALAHKKTEHSNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKEL---EILSQKKETSEGLFV-----KSKPKGRDNKHHPEERNK

Query:  AKIRCNYCHKEGHLKMDCYSLKRKNQNQRFKKNKPPKVVVGENSITYSNTLATSD--QCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTYREWDGGIVY
        AK  C +CH++GH K DC  L++K +            V+ + ++    +   SD     S  +S    +W++DSGC++HM   +DWF  ++E DGG+VY
Subjt:  AKIRCNYCHKEGHLKMDCYSLKRKNQNQRFKKNKPPKVVVGENSITYSNTLATSD--QCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTYREWDGGIVY

Query:  MGNNNSCRVIGIG
        MGN+N C+ +GIG
Subjt:  MGNNNSCRVIGIG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.3e-1225.98Show/hide
Query:  KLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKKETSEGLFVKSKPKGRDNKH------------HPEERNKAKIR-CNY
        K+ +E++A +LLNSL  +Y  + TT+ +G+  I +  +  A+   E     +KK  ++G  + ++ +GR  +               + R+K+++R C  
Subjt:  KLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKELEILSQKKETSEGLFVKSKPKGRDNKH------------HPEERNKAKIR-CNY

Query:  CHKEGHLKMDCYSLKRKNQNQRFKKNKPPKVVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTYREWDGGIVYMGNNNSCRV
        C++ GH K DC + ++       +KN      + +N+      +   ++C     S  + +WV+D+  S H T  +D F  Y   D G V MGN +  ++
Subjt:  CHKEGHLKMDCYSLKRKNQNQRFKKNKPPKVVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSFHMTLSKDWFSTYREWDGGIVYMGNNNSCRV

Query:  IGIG
         GIG
Subjt:  IGIG

Arabidopsis top hitse value%identityAlignment
AT5G03180.1 RING/U-box superfamily protein7.2e-1169.57Show/hide
Query:  ENDDANGEDISEEDVVCRICMVELCEGGETLKMECSCKGALALAHK
        ENDD  GED+ EE+ VCRICMVE+ E  E  KMEC CKG LALAHK
Subjt:  ENDDANGEDISEEDVVCRICMVELCEGGETLKMECSCKGALALAHK

AT5G60580.1 RING/U-box superfamily protein2.6e-1671.7Show/hide
Query:  SSQSHGPENDDANGEDISEEDVVCRICMVELCEGGETLKMECSCKGALALAHK
        +S++   E  DA+GEDI E++ VCRIC+VELCEGGETLKMECSCKG LALAHK
Subjt:  SSQSHGPENDDANGEDISEEDVVCRICMVELCEGGETLKMECSCKGALALAHK

AT5G60580.2 RING/U-box superfamily protein2.6e-1671.7Show/hide
Query:  SSQSHGPENDDANGEDISEEDVVCRICMVELCEGGETLKMECSCKGALALAHK
        +S++   E  DA+GEDI E++ VCRIC+VELCEGGETLKMECSCKG LALAHK
Subjt:  SSQSHGPENDDANGEDISEEDVVCRICMVELCEGGETLKMECSCKGALALAHK

AT5G60580.3 RING/U-box superfamily protein2.6e-1671.7Show/hide
Query:  SSQSHGPENDDANGEDISEEDVVCRICMVELCEGGETLKMECSCKGALALAHK
        +S++   E  DA+GEDI E++ VCRIC+VELCEGGETLKMECSCKG LALAHK
Subjt:  SSQSHGPENDDANGEDISEEDVVCRICMVELCEGGETLKMECSCKGALALAHK

AT5G60580.4 RING/U-box superfamily protein2.6e-1671.7Show/hide
Query:  SSQSHGPENDDANGEDISEEDVVCRICMVELCEGGETLKMECSCKGALALAHK
        +S++   E  DA+GEDI E++ VCRIC+VELCEGGETLKMECSCKG LALAHK
Subjt:  SSQSHGPENDDANGEDISEEDVVCRICMVELCEGGETLKMECSCKGALALAHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGTCTCATCTCAATCTCATGGTCCAGAAAATGACGATGCTAATGGCGAAGACATCTCTGAAGAAGACGTCGTTTGTAGAATTTGCATGGTTGAACTGTGTGAAGG
AGGTGAGACTCTAAAGATGGAGTGTAGCTGCAAAGGTGCACTTGCTTTGGCTCACAAAAAAACAGAGCATTCGAACTCAAAGCTTGGTGATGAAAATGAGGCTTTTGTTC
TCCTAAACTCCTTGCTAGATGCCTATAAAGAAGTCAAGACGACCCTCAAATATGGAAGGGAGAATATCACGATTGATGTCATAATACCAGCTATTAGAACTAAAGAGCTG
GAGATATTGTCCCAAAAGAAGGAAACAAGTGAAGGCCTTTTTGTTAAAAGTAAGCCAAAAGGTCGAGACAACAAACATCATCCAGAAGAAAGGAATAAGGCTAAGATTCG
ATGTAATTACTGCCACAAGGAAGGCCACCTAAAAATGGACTGCTACTCTCTTAAAAGGAAGAATCAAAATCAGAGATTCAAGAAAAACAAGCCACCAAAAGTTGTTGTGG
GTGAAAATTCGATCACTTACTCGAATACTTTGGCTACTTCGGACCAGTGTAGCAGTGATCAATCATCATTTGAAAAACACGATTGGGTGATTGATTCAGGTTGTTCCTTC
CATATGACACTCTCTAAAGACTGGTTTAGCACATATCGAGAGTGGGATGGAGGAATAGTCTACATGGGGAACAACAACTCTTGTAGAGTTATTGGAATAGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGCTAGTCTCATCTCAATCTCATGGTCCAGAAAATGACGATGCTAATGGCGAAGACATCTCTGAAGAAGACGTCGTTTGTAGAATTTGCATGGTTGAACTGTGTGAAGG
AGGTGAGACTCTAAAGATGGAGTGTAGCTGCAAAGGTGCACTTGCTTTGGCTCACAAAAAAACAGAGCATTCGAACTCAAAGCTTGGTGATGAAAATGAGGCTTTTGTTC
TCCTAAACTCCTTGCTAGATGCCTATAAAGAAGTCAAGACGACCCTCAAATATGGAAGGGAGAATATCACGATTGATGTCATAATACCAGCTATTAGAACTAAAGAGCTG
GAGATATTGTCCCAAAAGAAGGAAACAAGTGAAGGCCTTTTTGTTAAAAGTAAGCCAAAAGGTCGAGACAACAAACATCATCCAGAAGAAAGGAATAAGGCTAAGATTCG
ATGTAATTACTGCCACAAGGAAGGCCACCTAAAAATGGACTGCTACTCTCTTAAAAGGAAGAATCAAAATCAGAGATTCAAGAAAAACAAGCCACCAAAAGTTGTTGTGG
GTGAAAATTCGATCACTTACTCGAATACTTTGGCTACTTCGGACCAGTGTAGCAGTGATCAATCATCATTTGAAAAACACGATTGGGTGATTGATTCAGGTTGTTCCTTC
CATATGACACTCTCTAAAGACTGGTTTAGCACATATCGAGAGTGGGATGGAGGAATAGTCTACATGGGGAACAACAACTCTTGTAGAGTTATTGGAATAGGATAA
Protein sequenceShow/hide protein sequence
MLVSSQSHGPENDDANGEDISEEDVVCRICMVELCEGGETLKMECSCKGALALAHKKTEHSNSKLGDENEAFVLLNSLLDAYKEVKTTLKYGRENITIDVIIPAIRTKEL
EILSQKKETSEGLFVKSKPKGRDNKHHPEERNKAKIRCNYCHKEGHLKMDCYSLKRKNQNQRFKKNKPPKVVVGENSITYSNTLATSDQCSSDQSSFEKHDWVIDSGCSF
HMTLSKDWFSTYREWDGGIVYMGNNNSCRVIGIG