; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g04360 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g04360
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:3176629..3177897
RNA-Seq ExpressionMoc08g04360
SyntenyMoc08g04360
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5549868.1 hypothetical protein RHGRI_014986 [Rhododendron griersonianum]3.9e-6436.32Show/hide
Query:  MQVNDLLTCKKIHKTL---GERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSH
        +++ DLL CK +H  +     +P DM  + W  ++ + V  IR  +   V   V+ ET+A +L K L+  Y++ SA  K FL+ K  N+  +EG S+  H
Subjt:  MQVNDLLTCKKIHKTL---GERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSH

Query:  INELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALV---AQFKGKGKM
        +NE+  I+N+L  M +  ++E++A+ LL+SLP++WET+   VSNS  +  +    +  + L+EE RRK       + G+ +  E+ +V    + +G+   
Subjt:  INELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALV---AQFKGKGKM

Query:  KYNGKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQEN--------EDTLNYVSAE---VLACIEGNTTPVDRSSEWIVDSAASVHVASDRSW
         +N  Q   ++RG   S  ++EC YC KK H K+ C KLK  +EN        ++    VS++   V+ C E     V + + W++DS AS HV S   +
Subjt:  KYNGKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQEN--------EDTLNYVSAE---VLACIEGNTTPVDRSSEWIVDSAASVHVASDRSW

Query:  FTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYVLRFDVAR
        FTS+  G+ G VRMGN  LSKI G+G++ L T  G +LVL  VR+VP   +NLIS GKLDDEGY + F   +WKL +GS +V  G +  S+Y ++  +++
Subjt:  FTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYVLRFDVAR

Query:  GL
        GL
Subjt:  GL

KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]4.2e-6638.15Show/hide
Query:  MQVNDLLTCKKIHKTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINE
        MQ+ D L  KK+H+ L  +P  M  + W+ +D QV+  IR+ LS  V   VAKE T + L+KVL D YEKPSAN K+FL  K F++ MEEG  V +H+NE
Subjt:  MQVNDLLTCKKIHKTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINE

Query:  LTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYN---
           I+N+L  + ++ ++EV+A+ L+ SLP+SWE M+ AVSNS+G   LKF+ + D  L EE RR      + T   E    SA   + +G+ + + N   
Subjt:  LTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYN---

Query:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHC-RKLKEDQENEDTLNYVSAEV----LACIEG-----------NTTPVDRS------SEWIVDSAASV
        G+ + RN +G   S   VEC+ C K  HFK +C    K++       N V+ E+    L  ++            NTT    S        W++DS AS 
Subjt:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHC-RKLKEDQENEDTLNYVSAEV----LACIEG-----------NTTPVDRS------SEWIVDSAASV

Query:  HVASDRSWFTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVY
        H   DR+   ++ VGN+G V + NG    I GIGD+ L+  +G    +  VR+VP  M NLIS G+LDD G+   F +  WK+ +GS +V  GH++ S+Y
Subjt:  HVASDRSWFTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVY

Query:  V
        +
Subjt:  V

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]7.2e-6638.15Show/hide
Query:  MQVNDLLTCKKIHKTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINE
        MQ+ D L  KK+H+ L  +P  M  + W+ +D QV+  IR+ LS  V   VAKE T + L+KVL D YEKPSAN K+FL  K F++ MEEG  V +H+NE
Subjt:  MQVNDLLTCKKIHKTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINE

Query:  LTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYN---
           I+N+L  + ++ ++EV+A+ LL SLP+SWE M+ AVSNS+G   LKF+ + D  L EE RR      + T   E  + SA   + +G+ + + N   
Subjt:  LTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYN---

Query:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHC-RKLKEDQENEDTLNYVSAEV----LACIEG-----------NTTPVDRS------SEWIVDSAASV
        G+ + RN +G   S   VEC+ C K  HFK +C    K++       N V+ E+    L  ++            NTT    S        W++DS AS 
Subjt:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHC-RKLKEDQENEDTLNYVSAEV----LACIEG-----------NTTPVDRS------SEWIVDSAASV

Query:  HVASDRSWFTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVY
        H   DR+   ++  GN+G V + NG    I GIGD+ L+  +G    +  VR+VP  M NLIS G+LDD G+   F +  WK+ +GS +V  GH++ S+Y
Subjt:  HVASDRSWFTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVY

Query:  V
        +
Subjt:  V

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]5.5e-6639.58Show/hide
Query:  MQVNDLLTCKKIHKTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINE
        MQ+ D L  KK+H+ L  +P  M  + W+ +D QV+  IR+ LS  V   VAKE T + L+KVL D YEKPSAN K+FL  K F++ MEEG  V +H+NE
Subjt:  MQVNDLLTCKKIHKTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINE

Query:  LTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYN---
           I+N+L  + ++ ++EV+A+ LL SLP+SWE M+ AVSNS+G   LKF+ + D  L EE RR      + T   E    SA   + +G+ + + N   
Subjt:  LTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYN---

Query:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQENEDTLNYVSAEVLACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVRM
        G+ + RN +G   S   VEC+ C K  HFK              T+N   A     I    +P+D    W++DS AS H   DR+   ++ VGN+G V +
Subjt:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQENEDTLNYVSAEVLACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVRM

Query:  GNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV
         NG    I GIGD+ L+  +G    +  VR+VP  M NLIS G+LDD G+   F +  WK+ +GS +V  GH++ S+Y+
Subjt:  GNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV

RVW84195.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.3e-6335.68Show/hide
Query:  VNDLLTCKKIHKTL---GERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN
        + DLL CK ++  +     +P  M D  W ++D + V  IR  +   V   V+ E +A  L   L+  Y++ +A  K FL+ K  N   +EGT +  H+N
Subjt:  VNDLLTCKKIHKTL---GERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN

Query:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYNGK
        E+  I+N+L  M +  ++E++A+ LL+SLP+SWET+   VSNS  +  +    +  + L+EE RRK          + +    ALV + +G+G+ K   K
Subjt:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYNGK

Query:  QQHRNNRGSGNSSG----EVECFYCHKKDHFKKHCRKLKEDQENE---------DTLNYVSAEVLACIEG-NTTPVDRSSEWIVDSAASVHVASDRSWFT
          H  ++  G SS     +VEC+YCHKK H K+ CRKLK  ++N+         DT      E++   +  +   + + ++W++DS AS HV S   +FT
Subjt:  QQHRNNRGSGNSSG----EVECFYCHKKDHFKKHCRKLKEDQENE---------DTLNYVSAEVLACIEG-NTTPVDRSSEWIVDSAASVHVASDRSWFT

Query:  SFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYVLRFDVAR
        S++ G+ G VRMGN  +SKI G+GD+ L T+ G +L+L  VR+VP   +NLIS GKLDDEGY + F++ +WKL +GS +V  G +  S+Y ++  + +
Subjt:  SFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYVLRFDVAR

TrEMBL top hitse value%identityAlignment
A0A2N9G6Q3 Uncharacterized protein1.4e-6238.42Show/hide
Query:  MQVNDLLTCKKIH-KTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN
        MQ+ D L  KK+H   LGE+P DM D  W  +D QV+  IR+ LS  V   V KE T  EL+  L   YEKPSAN K+ L  K FN+ M EGT+V  H+N
Subjt:  MQVNDLLTCKKIH-KTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN

Query:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGK-MKYN-
        E   I N+L  + ++ ++E++A+ +L SLP+SWE M+ AVSNS G+  LK+  I D  L EE RR+         G  +   SAL  + +G+GK   YN 
Subjt:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGK-MKYN-

Query:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQENEDTLNYVSAEVL-ACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVR
        G+ + R  R       ++EC+ C K  H +K+C +LK+  EN D+ N V+ EV  A +    +P++    W++DS AS H  + R    ++  G+ G V 
Subjt:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQENEDTLNYVSAEVL-ACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVR

Query:  MGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV
        + +     + G+GDVR+   NG+  +L  VR+VP    NLIS G+LD EG+   F    WK+ +G+ +V  G +  ++Y+
Subjt:  MGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV

A0A2N9GHK9 Uncharacterized protein1.4e-6238.42Show/hide
Query:  MQVNDLLTCKKIH-KTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN
        MQ+ D L  KK+H   LGE+P DM D  W  +D QV+  IR+ LS  V   V KE T  EL+  L   YEKPSAN K+ L  K FN+ M EGT+V  H+N
Subjt:  MQVNDLLTCKKIH-KTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN

Query:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGK-MKYN-
        E   I N+L  + ++ ++E++A+ +L SLP+SWE M+ AVSNS G+  LK+  I D  L EE RR+         G  +   SAL  + +G+GK   YN 
Subjt:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGK-MKYN-

Query:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQENEDTLNYVSAEVL-ACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVR
        G+ + R  R       ++EC+ C K  H +K+C +LK+  EN D+ N V+ EV  A +    +P++    W++DS AS H  + R    ++  G+ G V 
Subjt:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQENEDTLNYVSAEVL-ACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVR

Query:  MGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV
        + +     + G+GDVR+   NG+  +L  VR+VP    NLIS G+LD EG+   F    WK+ +G+ +V  G +  ++Y+
Subjt:  MGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV

A0A2N9IKI1 Uncharacterized protein1.4e-6238.42Show/hide
Query:  MQVNDLLTCKKIH-KTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN
        MQ+ D L  KK+H   LGE+P DM D  W  +D QV+  IR+ LS  V   V KE T  EL+  L   YEKPSAN K+ L  K FN+ M EGT+V  H+N
Subjt:  MQVNDLLTCKKIH-KTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN

Query:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGK-MKYN-
        E   I N+L  + ++ ++E++A+ +L SLP+SWE M+ AVSNS G+  LK+  I D  L EE RR+         G  +   SAL  + +G+GK   YN 
Subjt:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGK-MKYN-

Query:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQENEDTLNYVSAEVL-ACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVR
        G+ + R  R       ++EC+ C K  H +K+C +LK+  EN D+ N V+ EV  A +    +P++    W++DS AS H  + R    ++  G+ G V 
Subjt:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQENEDTLNYVSAEVL-ACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVR

Query:  MGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV
        + +     + G+GDVR+   NG+  +L  VR+VP    NLIS G+LD EG+   F    WK+ +G+ +V  G +  ++Y+
Subjt:  MGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV

A0A2N9J3Y8 Uncharacterized protein1.4e-6238.42Show/hide
Query:  MQVNDLLTCKKIH-KTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN
        MQ+ D L  KK+H   LGE+P DM D  W  +D QV+  IR+ LS  V   V KE T  EL+  L   YEKPSAN K+ L  K FN+ M EGT+V  H+N
Subjt:  MQVNDLLTCKKIH-KTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN

Query:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGK-MKYN-
        E   I N+L  + ++ ++E++A+ +L SLP+SWE M+ AVSNS G+  LK+  I D  L EE RR+         G  +   SAL  + +G+GK   YN 
Subjt:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGK-MKYN-

Query:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQENEDTLNYVSAEVL-ACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVR
        G+ + R  R       ++EC+ C K  H +K+C +LK+  EN D+ N V+ EV  A +    +P++    W++DS AS H  + R    ++  G+ G V 
Subjt:  GKQQHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLKEDQENEDTLNYVSAEVL-ACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVR

Query:  MGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV
        + +     + G+GDVR+   NG+  +L  VR+VP    NLIS G+LD EG+   F    WK+ +G+ +V  G +  ++Y+
Subjt:  MGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV

A0A438HI91 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-6335.68Show/hide
Query:  VNDLLTCKKIHKTL---GERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN
        + DLL CK ++  +     +P  M D  W ++D + V  IR  +   V   V+ E +A  L   L+  Y++ +A  K FL+ K  N   +EGT +  H+N
Subjt:  VNDLLTCKKIHKTL---GERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHIN

Query:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYNGK
        E+  I+N+L  M +  ++E++A+ LL+SLP+SWET+   VSNS  +  +    +  + L+EE RRK          + +    ALV + +G+G+ K   K
Subjt:  ELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYNGK

Query:  QQHRNNRGSGNSSG----EVECFYCHKKDHFKKHCRKLKEDQENE---------DTLNYVSAEVLACIEG-NTTPVDRSSEWIVDSAASVHVASDRSWFT
          H  ++  G SS     +VEC+YCHKK H K+ CRKLK  ++N+         DT      E++   +  +   + + ++W++DS AS HV S   +FT
Subjt:  QQHRNNRGSGNSSG----EVECFYCHKKDHFKKHCRKLKEDQENE---------DTLNYVSAEVLACIEG-NTTPVDRSSEWIVDSAASVHVASDRSWFT

Query:  SFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYVLRFDVAR
        S++ G+ G VRMGN  +SKI G+GD+ L T+ G +L+L  VR+VP   +NLIS GKLDDEGY + F++ +WKL +GS +V  G +  S+Y ++  + +
Subjt:  SFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYVLRFDVAR

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.2e-1724.14Show/hide
Query:  QVNDLLTCKKIHKTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINEL
        ++  LL  + + K +     +  D +W + +    + I   LS    +    + TA+++L+ L   YE+ S  +++ L  +  ++ +    S+ SH +  
Subjt:  QVNDLLTCKKIHKTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINEL

Query:  TDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMK-YNGKQ
         +++++L   G KIEE  K   LL +LP  ++ + TA+  +L E +L    + +  L +E + K      +       V +A+V       K   +  + 
Subjt:  TDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMK-YNGKQ

Query:  QHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLK-----EDQENEDTLNYVSAEVLACI--EGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVG---
                GNS  +V+C +C ++ H KK C   K     +++ENE  +   ++  +A +  E N T V  +  +++DS AS H+ +D S +T        
Subjt:  QHRNNRGSGNSSGEVECFYCHKKDHFKKHCRKLK-----EDQENEDTLNYVSAEVLACI--EGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVG---

Query:  -NHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVV
            V + G    +  RGI  VRLR D+  E+ L  V +      NL+S  +L + G   EF ++   + +   +VV
Subjt:  -NHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-4730.88Show/hide
Query:  QVNDLLTCKKIHKTL---GERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHI
        ++ DLL  + +HK L    ++P  M  + W ++DE+  + IR+ LS  V + +  E TA+ +   L+  Y   +   K++L  + + +HM EGT+  SH+
Subjt:  QVNDLLTCKKIHKTL---GERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHI

Query:  NELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYNG
        N    ++ +L  +GVKIEEE KA+ LL SLP S++ + T + +  G+ +++   +  A L  E  RK                 AL+ + +G+   + + 
Subjt:  NELTDILNKLEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYNG

Query:  KQQHRNNRGSGNSSGEV---ECFYCHKKDHFKKHC---RKLKED---QENEDTLNYVSAEVLACIEGNTTPV-------------DRSSEWIVDSAASVH
               RG   +  +     C+ C++  HFK+ C   RK K +   Q+N+D          A ++ N   V                SEW+VD+AAS H
Subjt:  KQQHRNNRGSGNSSGEV---ECFYCHKKDHFKKHC---RKLKED---QENEDTLNYVSAEVLACIEGNTTPV-------------DRSSEWIVDSAASVH

Query:  VASDRSWFTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV
            R  F  +  G+ G V+MGN   SKI GIGD+ ++T+ G  LVL  VR+VP   MNLIS   LD +GY S FA  +W+L +GS ++  G  + ++Y 
Subjt:  VASDRSWFTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYV

Query:  LRFDVARG
           ++ +G
Subjt:  LRFDVARG

P25601 Putative transposon Ty5-1 protein YCL075W1.6e-0736.08Show/hide
Query:  TLNYVSAEVLACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLIS
        T   ++ +   C+  +T P  +SSEWI D+  + H+  DRS F+SFT  +      G G    I G G V + T     + LH V YVP   +NLIS
Subjt:  TLNYVSAEVLACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLIS

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein1.0e-0624.18Show/hide
Query:  SHINELTDILNK-LEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTL-GAENGVESAL-----VAQF
        S   ++ D+L K  E   ++  E+V   RL   L D     K + S+ L + +L+ L     A  E++  ++ K   TTL G+ +G++S L     V + 
Subjt:  SHINELTDILNK-LEGMGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTL-GAENGVESAL-----VAQF

Query:  KGKGKMKYNGKQQHRNNRGSG----------NSSGEVECFYCHKKDHFKKHCR----KLKEDQENEDTLNYVSAEVLACIEGNTTPVDRSSEWIVDSAAS
          K  ++Y   + H ++               S  E  C  C+K +H ++ C+      KE++E+E  ++Y   E +  +   T   D    WI+   A 
Subjt:  KGKGKMKYNGKQQHRNNRGSG----------NSSGEVECFYCHKKDHFKKHCR----KLKEDQENEDTLNYVSAEVLACIEGNTTPVDRSSEWIVDSAAS

Query:  VHVASDRSWFTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGY
        +++     +FT+        V   +G +  + G GDV++R   G +  +  V +VP    N++S GK+  + Y
Subjt:  VHVASDRSWFTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVPSFMMNLISAGKLDDEGY

AT3G29785.1 unknown protein3.8e-0938.96Show/hide
Query:  MQVNDLLTCKKIHKTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKI
        M++ D L  KK+H+ LG++   M+   WN +  QV+  IR+ +S  +   VAKE +   L+KVL D Y+KPS N  +
Subjt:  MQVNDLLTCKKIHKTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGTAAATGATCTTCTTACGTGCAAGAAGATACACAAGACTTTGGGTGAGAGACCAGCGGATATGACAGACAAAGCTTGGAATGAGATGGATGAGCAGGTCGTTGC
AAATATCAGAATGGCATTATCAATGGGGGTATGCAGCCTCGTGGCGAAAGAGACGACTGCAAAAGAGTTATTGAAGGTCTTGCAAGATAGGTATGAAAAACCTTCTGCCA
ATACAAAAATATTTCTATGGACCAAGTATTTTAACATCCACATGGAGGAGGGAACCTCGGTGAATTCCCACATTAATGAGCTCACCGATATCTTGAACAAATTAGAAGGG
ATGGGTGTCAAGATTGAGGAGGAGGTGAAAGCTATGAGGCTGTTGACGTCTTTGCCTGACAGTTGGGAGACGATGAAGACTGCGGTGTCGAATTCGCTAGGAGAAAATAG
CTTGAAATTTTTAGCTATTTGTGATGCCGCCTTATCTGAGGAAGCCCGGAGAAAATTAGGAAAAATGTCTGTAACTACTTTAGGGGCAGAAAATGGGGTTGAATCAGCTT
TGGTAGCTCAGTTTAAGGGGAAGGGCAAGATGAAGTACAACGGGAAGCAGCAACATAGGAATAATAGGGGTAGTGGAAATTCCAGTGGAGAAGTTGAATGTTTTTACTGC
CATAAGAAAGACCACTTCAAGAAACATTGCAGGAAGCTTAAAGAGGATCAGGAAAATGAGGACACTTTAAATTACGTGTCAGCGGAGGTGTTAGCTTGTATTGAAGGTAA
CACAACACCTGTAGACCGTTCATCAGAGTGGATAGTGGACAGTGCAGCTTCGGTGCATGTAGCTTCAGACAGGAGTTGGTTCACGTCCTTTACTGTAGGAAATCATGGTG
TAGTAAGGATGGGAAATGGGAGACTCTCCAAGATCAGAGGAATTGGGGATGTTCGTTTGAGGACTGACAATGGGACCGAGCTAGTTTTGCATGGTGTCAGGTATGTACCC
AGTTTCATGATGAATTTGATATCAGCAGGGAAGCTGGATGACGAAGGCTACAGAAGTGAGTTTGCAGAGAATAGATGGAAACTCATGAGGGGATCCGAGATAGTGGTTGT
TGGCCACAGAAAAGCTTCAGTGTATGTGTTGAGGTTTGATGTTGCCAGAGGATTAGAGAGACAAGTTATGCACAGGGCTGCAGATAGTTCAGGAGGAGACTTGAAAGAAC
TAGCAGCATTGACAGTCAGAACAGATCAGAAGAATCTGCCATCAGTTCAAGTACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGTAAATGATCTTCTTACGTGCAAGAAGATACACAAGACTTTGGGTGAGAGACCAGCGGATATGACAGACAAAGCTTGGAATGAGATGGATGAGCAGGTCGTTGC
AAATATCAGAATGGCATTATCAATGGGGGTATGCAGCCTCGTGGCGAAAGAGACGACTGCAAAAGAGTTATTGAAGGTCTTGCAAGATAGGTATGAAAAACCTTCTGCCA
ATACAAAAATATTTCTATGGACCAAGTATTTTAACATCCACATGGAGGAGGGAACCTCGGTGAATTCCCACATTAATGAGCTCACCGATATCTTGAACAAATTAGAAGGG
ATGGGTGTCAAGATTGAGGAGGAGGTGAAAGCTATGAGGCTGTTGACGTCTTTGCCTGACAGTTGGGAGACGATGAAGACTGCGGTGTCGAATTCGCTAGGAGAAAATAG
CTTGAAATTTTTAGCTATTTGTGATGCCGCCTTATCTGAGGAAGCCCGGAGAAAATTAGGAAAAATGTCTGTAACTACTTTAGGGGCAGAAAATGGGGTTGAATCAGCTT
TGGTAGCTCAGTTTAAGGGGAAGGGCAAGATGAAGTACAACGGGAAGCAGCAACATAGGAATAATAGGGGTAGTGGAAATTCCAGTGGAGAAGTTGAATGTTTTTACTGC
CATAAGAAAGACCACTTCAAGAAACATTGCAGGAAGCTTAAAGAGGATCAGGAAAATGAGGACACTTTAAATTACGTGTCAGCGGAGGTGTTAGCTTGTATTGAAGGTAA
CACAACACCTGTAGACCGTTCATCAGAGTGGATAGTGGACAGTGCAGCTTCGGTGCATGTAGCTTCAGACAGGAGTTGGTTCACGTCCTTTACTGTAGGAAATCATGGTG
TAGTAAGGATGGGAAATGGGAGACTCTCCAAGATCAGAGGAATTGGGGATGTTCGTTTGAGGACTGACAATGGGACCGAGCTAGTTTTGCATGGTGTCAGGTATGTACCC
AGTTTCATGATGAATTTGATATCAGCAGGGAAGCTGGATGACGAAGGCTACAGAAGTGAGTTTGCAGAGAATAGATGGAAACTCATGAGGGGATCCGAGATAGTGGTTGT
TGGCCACAGAAAAGCTTCAGTGTATGTGTTGAGGTTTGATGTTGCCAGAGGATTAGAGAGACAAGTTATGCACAGGGCTGCAGATAGTTCAGGAGGAGACTTGAAAGAAC
TAGCAGCATTGACAGTCAGAACAGATCAGAAGAATCTGCCATCAGTTCAAGTACAATAG
Protein sequenceShow/hide protein sequence
MQVNDLLTCKKIHKTLGERPADMTDKAWNEMDEQVVANIRMALSMGVCSLVAKETTAKELLKVLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEG
MGVKIEEEVKAMRLLTSLPDSWETMKTAVSNSLGENSLKFLAICDAALSEEARRKLGKMSVTTLGAENGVESALVAQFKGKGKMKYNGKQQHRNNRGSGNSSGEVECFYC
HKKDHFKKHCRKLKEDQENEDTLNYVSAEVLACIEGNTTPVDRSSEWIVDSAASVHVASDRSWFTSFTVGNHGVVRMGNGRLSKIRGIGDVRLRTDNGTELVLHGVRYVP
SFMMNLISAGKLDDEGYRSEFAENRWKLMRGSEIVVVGHRKASVYVLRFDVARGLERQVMHRAADSSGGDLKELAALTVRTDQKNLPSVQVQ