; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039492 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039492
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:44818314..44818715
RNA-Seq ExpressionLag0039492
SyntenyLag0039492
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043826.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.9e-2042.97Show/hide
Query:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEA----TKGLFVKGTTRNKE
        MD  KSL ENLDEF+K+  +  N+GEK+ D+N+  +LLNSLP+ Y+EVK A+KYGR+S+T   ++ A++T+ L++   +K+      +G   K + + KE
Subjt:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEA----TKGLFVKGTTRNKE

Query:  GKNQSDDKGNTKVKCNYCYKEGHIKREC
           +S  KG ++ KC  C+KEGH K+ C
Subjt:  GKNQSDDKGNTKVKCNYCYKEGHIKREC

KAA0065687.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.9e-2161.96Show/hide
Query:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEAT--KGLFVKG
        MD  KSL ENL EFKK++S+F+ LG+KIGD+N+ F+LLNSLP+AYKEVK AL+YGR+ ITT  +ISAIRT+EL L S +++ +  +GLF KG
Subjt:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEAT--KGLFVKG

TYK27723.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.4e-2045.31Show/hide
Query:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKK--EATKGLFVKGTT--RNKE
        M+  K+L ENLDEFKK+T+      EK+G +++  +L+N + D YKEVKT+LKYGRE+IT  ++I+A+++KEL+L +  K   A + LF KG    R   
Subjt:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKK--EATKGLFVKGTT--RNKE

Query:  GKNQSDDKGNTKVKCNYCYKEGHIKREC
         KNQ   +    +KC  C+KEGH KR C
Subjt:  GKNQSDDKGNTKVKCNYCYKEGHIKREC

XP_022152111.1 uncharacterized protein LOC111019900 [Momordica charantia]1.4e-2069.33Show/hide
Query:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKL
        MD  K+L++NLD+FKK++SEF +LGEKIG +N+ F+LLNSLP++Y+EVK ALKYGRESITT AIISA++TKEL+L
Subjt:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKL

XP_038885928.1 uncharacterized protein LOC120076236 [Benincasa hispida]1.1e-2348.48Show/hide
Query:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEATK--GLFVKGTTRNKEGK
        MD  KSLT+NL+EFK ++S+FR++G+ IG++N+ F+LLNSLP+ +K+VKTALKYGRE ITTYAIISA+  KEL+L   KK+  +  G F KG  +     
Subjt:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEATK--GLFVKGTTRNKEGK

Query:  NQSDDKGNTKVKCNYCYKEGHIKRECYSLKRK
        N++            C ++  +K++CY+LKRK
Subjt:  NQSDDKGNTKVKCNYCYKEGHIKRECYSLKRK

TrEMBL top hitse value%identityAlignment
A0A5A7UB25 Putative gag-pol polyprotein9.1e-2142.97Show/hide
Query:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEA----TKGLFVKGTTRNKE
        MD  KSL ENLDEF+K+  +  N+GEK+ D+N+  +LLNSLP+ Y+EVK A+KYGR+S+T   ++ A++T+ L++   +K+      +G   K + + KE
Subjt:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEA----TKGLFVKGTTRNKE

Query:  GKNQSDDKGNTKVKCNYCYKEGHIKREC
           +S  KG ++ KC  C+KEGH K+ C
Subjt:  GKNQSDDKGNTKVKCNYCYKEGHIKREC

A0A5D3CAI4 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-2161.96Show/hide
Query:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEAT--KGLFVKG
        MD  KSL ENL EFKK++S+F+ LG+KIGD+N+ F+LLNSLP+AYKEVK AL+YGR+ ITT  +ISAIRT+EL L S +++ +  +GLF KG
Subjt:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEAT--KGLFVKG

A0A5D3DNU1 Putative gag-pol polyprotein9.1e-2142.97Show/hide
Query:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEA----TKGLFVKGTTRNKE
        MD  KSL ENLDEF+K+  +  N+GEK+ D+N+  +LLNSLP+ Y+EVK A+KYGR+S+T   ++ A++T+ L++   +K+      +G   K + + KE
Subjt:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEA----TKGLFVKGTTRNKE

Query:  GKNQSDDKGNTKVKCNYCYKEGHIKREC
           +S  KG ++ KC  C+KEGH K+ C
Subjt:  GKNQSDDKGNTKVKCNYCYKEGHIKREC

A0A5D3DVM0 Retrovirus-related Pol polyprotein from transposon TNT 1-946.9e-2145.31Show/hide
Query:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKK--EATKGLFVKGTT--RNKE
        M+  K+L ENLDEFKK+T+      EK+G +++  +L+N + D YKEVKT+LKYGRE+IT  ++I+A+++KEL+L +  K   A + LF KG    R   
Subjt:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKK--EATKGLFVKGTT--RNKE

Query:  GKNQSDDKGNTKVKCNYCYKEGHIKREC
         KNQ   +    +KC  C+KEGH KR C
Subjt:  GKNQSDDKGNTKVKCNYCYKEGHIKREC

A0A6J1DGM8 uncharacterized protein LOC1110199006.9e-2169.33Show/hide
Query:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKL
        MD  K+L++NLD+FKK++SEF +LGEKIG +N+ F+LLNSLP++Y+EVK ALKYGRESITT AIISA++TKEL+L
Subjt:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKL

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.6e-0628.26Show/hide
Query:  SLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALK-YGRESITTYAIISAIRTKELKLLSIKKEATK-----------GLFVKGTTR
        SL  +   F ++ SE    G KI + +K   LL +LP  Y  + TA++    E++T   + + +  +E+K+ +   + +K             +     +
Subjt:  SLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALK-YGRESITTYAIISAIRTKELKLLSIKKEATK-----------GLFVKGTTR

Query:  NKEGKNQSDDKGNT--KVKCNYCYKEGHIKRECYSLKR
        N+  K +   KGN+  KVKC++C +EGHIK++C+  KR
Subjt:  NKEGKNQSDDKGNT--KVKCNYCYKEGHIKRECYSLKR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-0726.67Show/hide
Query:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEATKGLFVKGTTRNKE----
        M  G +   +L+ F  + ++  NLG KI +++K  +LLNSLP +Y  + T + +G+ +I    + SA+   E K+    +   + L  +G  R+ +    
Subjt:  MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEATKGLFVKGTTRNKE----

Query:  ------GKNQSDDKGNTKVK-CNYCYKEGHIKREC
               + +S ++  ++V+ C  C + GH KR+C
Subjt:  ------GKNQSDDKGNTKVK-CNYCYKEGHIKREC

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTTGGAAAGTCACTAACCGAGAATCTTGATGAGTTCAAAAAGATGACATCCGAGTTCAGGAATCTTGGAGAGAAGATAGGGGATGATAACAAGCCTTTTGTTCT
ATTAAATTCACTTCCTGATGCTTATAAGGAAGTGAAGACTGCCCTCAAATATGGAAGAGAGTCCATTACTACTTATGCTATTATATCAGCTATCAGAACCAAGGAATTGA
AGTTGTTGTCTATAAAGAAAGAGGCTACTAAGGGTTTGTTTGTCAAAGGTACAACCAGAAATAAAGAGGGGAAAAATCAGTCAGATGATAAGGGCAATACCAAGGTTAAG
TGCAACTACTGTTATAAGGAGGGGCATATCAAGAGGGAATGTTATTCTTTAAAAAGGAAGAACCAAAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGTTGGAAAGTCACTAACCGAGAATCTTGATGAGTTCAAAAAGATGACATCCGAGTTCAGGAATCTTGGAGAGAAGATAGGGGATGATAACAAGCCTTTTGTTCT
ATTAAATTCACTTCCTGATGCTTATAAGGAAGTGAAGACTGCCCTCAAATATGGAAGAGAGTCCATTACTACTTATGCTATTATATCAGCTATCAGAACCAAGGAATTGA
AGTTGTTGTCTATAAAGAAAGAGGCTACTAAGGGTTTGTTTGTCAAAGGTACAACCAGAAATAAAGAGGGGAAAAATCAGTCAGATGATAAGGGCAATACCAAGGTTAAG
TGCAACTACTGTTATAAGGAGGGGCATATCAAGAGGGAATGTTATTCTTTAAAAAGGAAGAACCAAAACTAG
Protein sequenceShow/hide protein sequence
MDVGKSLTENLDEFKKMTSEFRNLGEKIGDDNKPFVLLNSLPDAYKEVKTALKYGRESITTYAIISAIRTKELKLLSIKKEATKGLFVKGTTRNKEGKNQSDDKGNTKVK
CNYCYKEGHIKRECYSLKRKNQN