; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005745 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005745
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:27989246..27991391
RNA-Seq ExpressionLag0005745
SyntenyLag0005745
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TKR86343.1 hypothetical protein D5086_0000239010 [Populus alba]3.3e-5440.84Show/hide
Query:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS
        + LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  + N   T+ NQL SV+IEF DE+  + LL SLP SWE M+T VS
Subjt:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS

Query:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN
        NS G + LK+ ++ DL +A E+RR+ S + S+ GSAL + T+G+  D+        S   S+ K+ SR +VEC+ C K GHF   C K K+ +     A 
Subjt:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN

Query:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL
           V++ ++  V S         +WIL+S AS H      +  ++ GG HG+V +  G      GI DV +KT  G    L++VR+VP +K  LIS+G+L
Subjt:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL

Query:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTLY
         D G+   F     K+   + V+A G +  TLY
Subjt:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTLY

TKR89927.1 hypothetical protein D5086_0000238200 [Populus alba]2.2e-5340.36Show/hide
Query:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS
        + LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  + N   T+ NQL SV+IEF DE+  + LL SLP SWE M+T VS
Subjt:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS

Query:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN
        NS G + LK+ ++ DL +A E+RR+ S + S+ GSAL + T+G+  D+        S   S+ K+ SR +VEC+ C K GHF   C K K+ +     A 
Subjt:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN

Query:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL
           V++ ++  V+S         +WIL+S AS H      +  ++ GG HG+V +  G      GI DV +KT  G    L+++R+VP +K  LIS+G+L
Subjt:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL

Query:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL
         D G+   F     K+   + V+A G +  TL
Subjt:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL

TKS02608.1 hypothetical protein D5086_0000161380 [Populus alba]5.7e-5440.96Show/hide
Query:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS
        + LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  + N   T+ NQL SV+IEF DE+  + LL SLP SWE M+T VS
Subjt:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS

Query:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN
        NS G + LK+ ++ DL +A E+RR+ S + S+ GSAL + T+G+  D+        S   S+ K+ SR +VEC+ C K GHF   C K K+ +     A 
Subjt:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN

Query:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL
           V++ ++  V+S         +WIL+S AS H      +  ++ GG HG+V +  G   K  GI DV +KT  G    L++VR+VP +K  LIS+G+L
Subjt:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL

Query:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL
         D G+   F     K+   + V+A G +  TL
Subjt:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL

TKS09800.1 hypothetical protein D5086_0000089010 [Populus alba]4.3e-5440.96Show/hide
Query:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS
        + LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  + N   T+ NQL SV+IEF DE+  + LL SLP SWE M+T VS
Subjt:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS

Query:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN
        NS G + LK+ ++ DL +A E+RR+ S + S+ GSAL + T+G+  D+        S   S+ K+ SR +VEC+ C K GHF   C K K+ +     A 
Subjt:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN

Query:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL
           V++ ++  V+S         +WIL+S AS H      +  ++ GG HG+V +  G   K  GI DV +KT  G    L++VR+VP +K  LIS+G+L
Subjt:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL

Query:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL
         D G+   F     K+   + V+A G +  TL
Subjt:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL

TKS13843.1 hypothetical protein D5086_0000049350 [Populus alba]9.7e-5440.66Show/hide
Query:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS
        + LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  + N   T+ NQL SV+IEF DE+  + LL SLP SWE M+T VS
Subjt:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS

Query:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN
        NS G + LK+ ++ DL +A E+RR+ S + S+ GSAL + T+G+  D+        S   S+ K+ SR +VEC+ C K GHF   C K K+ +     A 
Subjt:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN

Query:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL
           V++ ++  V+S         +WIL+S AS H      +  ++ GG HG++ +  G      GI DV +KT  G    L++VR+VP +K  LIS+G+L
Subjt:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL

Query:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL
         D GY   F     K+   + V+A G +  TL
Subjt:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL

TrEMBL top hitse value%identityAlignment
A0A4U5NTA0 CCHC-type domain-containing protein1.6e-5440.84Show/hide
Query:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS
        + LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  + N   T+ NQL SV+IEF DE+  + LL SLP SWE M+T VS
Subjt:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS

Query:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN
        NS G + LK+ ++ DL +A E+RR+ S + S+ GSAL + T+G+  D+        S   S+ K+ SR +VEC+ C K GHF   C K K+ +     A 
Subjt:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN

Query:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL
           V++ ++  V S         +WIL+S AS H      +  ++ GG HG+V +  G      GI DV +KT  G    L++VR+VP +K  LIS+G+L
Subjt:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL

Query:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTLY
         D G+   F     K+   + V+A G +  TLY
Subjt:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTLY

A0A4U5P1P0 CCHC-type domain-containing protein1.0e-5340.36Show/hide
Query:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS
        + LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  + N   T+ NQL SV+IEF DE+  + LL SLP SWE M+T VS
Subjt:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS

Query:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN
        NS G + LK+ ++ DL +A E+RR+ S + S+ GSAL + T+G+  D+        S   S+ K+ SR +VEC+ C K GHF   C K K+ +     A 
Subjt:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN

Query:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL
           V++ ++  V+S         +WIL+S AS H      +  ++ GG HG+V +  G      GI DV +KT  G    L+++R+VP +K  LIS+G+L
Subjt:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL

Query:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL
         D G+   F     K+   + V+A G +  TL
Subjt:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL

A0A4U5PY83 CCHC-type domain-containing protein2.7e-5440.96Show/hide
Query:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS
        + LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  + N   T+ NQL SV+IEF DE+  + LL SLP SWE M+T VS
Subjt:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS

Query:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN
        NS G + LK+ ++ DL +A E+RR+ S + S+ GSAL + T+G+  D+        S   S+ K+ SR +VEC+ C K GHF   C K K+ +     A 
Subjt:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN

Query:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL
           V++ ++  V+S         +WIL+S AS H      +  ++ GG HG+V +  G   K  GI DV +KT  G    L++VR+VP +K  LIS+G+L
Subjt:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL

Query:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL
         D G+   F     K+   + V+A G +  TL
Subjt:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL

A0A4U5QGR0 Uncharacterized protein2.1e-5440.96Show/hide
Query:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS
        + LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  + N   T+ NQL SV+IEF DE+  + LL SLP SWE M+T VS
Subjt:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS

Query:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN
        NS G + LK+ ++ DL +A E+RR+ S + S+ GSAL + T+G+  D+        S   S+ K+ SR +VEC+ C K GHF   C K K+ +     A 
Subjt:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN

Query:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL
           V++ ++  V+S         +WIL+S AS H      +  ++ GG HG+V +  G   K  GI DV +KT  G    L++VR+VP +K  LIS+G+L
Subjt:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL

Query:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL
         D G+   F     K+   + V+A G +  TL
Subjt:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL

A0A4U5QS59 CCHC-type domain-containing protein4.7e-5440.66Show/hide
Query:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS
        + LS  VA  V  E +  KLMEAL+  YEKPSANNKV+L+KK FN++M E+ SV  + N   T+ NQL SV+IEF DE+  + LL SLP SWE M+T VS
Subjt:  MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVS

Query:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN
        NS G + LK+ ++ DL +A E+RR+ S + S+ GSAL + T+G+  D+        S   S+ K+ SR +VEC+ C K GHF   C K K+ +     A 
Subjt:  NSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSAL-VMTKGK--DKVDEDNEPSS---SRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREAN

Query:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL
           V++ ++  V+S         +WIL+S AS H      +  ++ GG HG++ +  G      GI DV +KT  G    L++VR+VP +K  LIS+G+L
Subjt:  I--VEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKL

Query:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL
         D GY   F     K+   + V+A G +  TL
Subjt:  ADDGYMYEFGSRQCKLKFESQVVAVGHRKSTL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-3633.24Show/hide
Query:  LSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVSNS
        LS DV + +  E TA  +   L + Y   +  NK+YL K+ + + MSE  +  S+ N    LI QL ++ ++  +E   I LL SLP S++ + TT+ + 
Subjt:  LSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVSNS

Query:  TGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSALVMT-KGKDKVDEDNEPSSSRKKWKSRNEVE-----CYYCHKKGHFKYQC---RKFK-EDQKRKRE
         G  T++  +V    +  E  R+   K    G AL+   +G+      N    S  + KS+N  +     CY C++ GHFK  C   RK K E   +K +
Subjt:  TGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSALVMT-KGKDKVDEDNEPSSSRKKWKSRNEVE-----CYYCHKKGHFKYQC---RKFK-EDQKRKRE

Query:  ANIV------EEVVLACVESD--TKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKM
         N        + VVL   E +     S   S+W++++AAS H    R LF  +  G  G V+MG    SK  GI D+ +KT  G  LVL+DVR+VP+++M
Subjt:  ANIV------EEVVLACVESD--TKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKM

Query:  NLISIGKLADDGYMYEFGSRQCKLKFESQVVAVGHRKSTLYRCQLNVAKGSM
        NLIS   L  DGY   F +++ +L   S V+A G  + TLYR    + +G +
Subjt:  NLISIGKLADDGYMYEFGSRQCKLKFESQVVAVGHRKSTLYRCQLNVAKGSM

P25601 Putative transposon Ty5-1 protein YCL075W1.7e-0534.48Show/hide
Query:  CVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISI
        C+ S T  +  SS+WI ++  + H+  DRS+F+SFT         G G +    G   V++     G + L DV YVP++ +NLIS+
Subjt:  CVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISI

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein1.4e-0529.36Show/hide
Query:  NIVEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKT-----RGIEDVSLKTECGGKLVLRDVRYVPNIKMNLIS
        +++ EV     E  +KY+ H + W++ S  S H+      FT+        V+   G  S+T      GI DV+  T  G K  +++V YVP I+ N +S
Subjt:  NIVEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKT-----RGIEDVSLKTECGGKLVLRDVRYVPNIKMNLIS

Query:  IGKLADDGY
        + +L  +G+
Subjt:  IGKLADDGY

AT3G21000.1 Gag-Pol-related retrotransposon family protein6.1e-0621.21Show/hide
Query:  LVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVSNSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSALV
        L K+  +++M +  S +SY ++   ++ +L   K+E SD      + T+L  S++ + + +      + +    + +       R   S+ E  +   L 
Subjt:  LVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVSNSTGNNTLKFSEVCDLAIAGEIRRQGSNKESTVGSALV

Query:  MTKGKDKVDEDNEPSSSRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREANIVEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFT
            KD             + KS++E  C  C+K  H +  C+      K ++E  IV +  L  V +    +     WI+   A +++      FT+  
Subjt:  MTKGKDKVDEDNEPSSSRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREANIVEEVVLACVESDTKYSNHSSDWILESAASVHIALDRSLFTSFT

Query:  GGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMYEFG
              V    G      G  DV ++ + G K  +R+V +VP +  N++S GK+    Y    G
Subjt:  GGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMYEFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTTGTCAATGGATGTGGCAAGTCTAGTAGCTCATGAGACAACTGCAGTCAAGTTGATGGAAGCGCTTACAAACAGGTATGAAAAACCCTCTGCAAATAATAAGGT
CTACCTAGTTAAGAAGTTTTTCAACATGCAAATGTCTGAGGATGCTTCTGTGAATTCCTATAATAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAG
AATTTTCTGATGAGGTGAATGTTATTCAGTTGTTAACGTCTTTACCTGATAGTTGGGAAACGATGAAGACAACAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTT
TCAGAAGTTTGTGATTTAGCCATAGCTGGGGAAATTCGTAGGCAGGGTAGTAATAAGGAGTCTACAGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATAAGGTTGA
TGAAGATAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAAAGTAGGAATGAGGTGGAATGTTATTACTGCCATAAGAAAGGTCACTTCAAGTATCAGTGTAGGAAATTTA
AAGAGGATCAGAAAAGAAAACGAGAGGCAAATATAGTGGAGGAGGTTGTCTTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAACCACTCATCAGATTGGATATTAGAA
AGTGCAGCTTCTGTTCACATAGCTTTAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAAGGGTAGAACCTCCAAAACTAGAGG
GATTGAAGATGTTAGTCTGAAGACAGAATGTGGAGGTAAATTGGTACTGCGAGATGTCAGGTACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGGCAG
ATGATGGTTACATGTATGAGTTTGGTAGTCGCCAGTGTAAACTCAAGTTCGAATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAAT
GTTGCCAAAGGTTCAATGAGACAGTGGATGCCGGTTAAAGATGCAGATGGTAGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAATTTCGATCAGTCCGATCA
AGATCCTTCAGTTCAGAAACAATTGGGAAGTCCAGGAGAGAAAGTTGATGACTATCGTGAATCCCCAATTGTCAGACGGTCGAATGAATTGAACAAGTCGCTTAGGCGAG
TTGAAGCATCAGAGTGGAAGGCAGGAGCAGTTGCTAAAGTCAAAGATGCAAGGAATGAAAAGAGTAAAAGTGGAGAAAAGTCAAATCTCGGTCAACAGCAGGTTAGCGTC
GAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGCGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTACCGTTTT
TCCTTATTCAGAACGCGCGTATAAGAGGCAGCGTCGCAACGCTGTCTTGACAGCGTCTCGATGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTTGTCAATGGATGTGGCAAGTCTAGTAGCTCATGAGACAACTGCAGTCAAGTTGATGGAAGCGCTTACAAACAGGTATGAAAAACCCTCTGCAAATAATAAGGT
CTACCTAGTTAAGAAGTTTTTCAACATGCAAATGTCTGAGGATGCTTCTGTGAATTCCTATAATAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAG
AATTTTCTGATGAGGTGAATGTTATTCAGTTGTTAACGTCTTTACCTGATAGTTGGGAAACGATGAAGACAACAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTT
TCAGAAGTTTGTGATTTAGCCATAGCTGGGGAAATTCGTAGGCAGGGTAGTAATAAGGAGTCTACAGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATAAGGTTGA
TGAAGATAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAAAGTAGGAATGAGGTGGAATGTTATTACTGCCATAAGAAAGGTCACTTCAAGTATCAGTGTAGGAAATTTA
AAGAGGATCAGAAAAGAAAACGAGAGGCAAATATAGTGGAGGAGGTTGTCTTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAACCACTCATCAGATTGGATATTAGAA
AGTGCAGCTTCTGTTCACATAGCTTTAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAAGGGTAGAACCTCCAAAACTAGAGG
GATTGAAGATGTTAGTCTGAAGACAGAATGTGGAGGTAAATTGGTACTGCGAGATGTCAGGTACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGGCAG
ATGATGGTTACATGTATGAGTTTGGTAGTCGCCAGTGTAAACTCAAGTTCGAATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAAT
GTTGCCAAAGGTTCAATGAGACAGTGGATGCCGGTTAAAGATGCAGATGGTAGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAATTTCGATCAGTCCGATCA
AGATCCTTCAGTTCAGAAACAATTGGGAAGTCCAGGAGAGAAAGTTGATGACTATCGTGAATCCCCAATTGTCAGACGGTCGAATGAATTGAACAAGTCGCTTAGGCGAG
TTGAAGCATCAGAGTGGAAGGCAGGAGCAGTTGCTAAAGTCAAAGATGCAAGGAATGAAAAGAGTAAAAGTGGAGAAAAGTCAAATCTCGGTCAACAGCAGGTTAGCGTC
GAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGCGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTACCGTTTT
TCCTTATTCAGAACGCGCGTATAAGAGGCAGCGTCGCAACGCTGTCTTGACAGCGTCTCGATGCTAA
Protein sequenceShow/hide protein sequence
MCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYNNEVTTLINQLKSVKIEFSDEVNVIQLLTSLPDSWETMKTTVSNSTGNNTLKF
SEVCDLAIAGEIRRQGSNKESTVGSALVMTKGKDKVDEDNEPSSSRKKWKSRNEVECYYCHKKGHFKYQCRKFKEDQKRKREANIVEEVVLACVESDTKYSNHSSDWILE
SAASVHIALDRSLFTSFTGGHHGLVRMGKGRTSKTRGIEDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMYEFGSRQCKLKFESQVVAVGHRKSTLYRCQLN
VAKGSMRQWMPVKDADGSCRGTVEPAARIANFDQSDQDPSVQKQLGSPGEKVDDYRESPIVRRSNELNKSLRRVEASEWKAGAVAKVKDARNEKSKSGEKSNLGQQQVSV
ETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAYKRQRRNAVLTASRC