; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g14310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g14310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:10977873..10979448
RNA-Seq ExpressionMoc10g14310
SyntenyMoc10g14310
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8654393.1 hypothetical protein F3Y22_tig00117048pilonHSYRG00173 [Hibiscus syriacus]2.9e-6342.06Show/hide
Query:  IGEQSSFEG---IMRFDGANFGYWKMQIKDYLTCKKVHKALKER-PAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSAN
        +G  ++ EG   I +FDGA+FG+WKMQI+D+L  K +++ L E+ P GMK EDW  ++ QA+ +IRL LS NVA  +A E T  GLM AL++ YEKPSA+
Subjt:  IGEQSSFEG---IMRFDGANFGYWKMQIKDYLTCKKVHKALKER-PAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSAN

Query:  NKVYLVRRYFNIRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSS
        NKV+L+RR FN+RM E ASV  H+NE+  +  QL+S  I F DEV A++LL+SLP+SW      VS+S G+  LKF  + D  ++EEI R+ESG+ STSS
Subjt:  NKVYLVRRYFNIRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSS

Query:  SASGSALAVQKGKEKVEYDDRQHRNSR----RNGNGEIECYYCHKKGHYKRHCR--KLRGYEKAGCKCKCSCNYGMVRMGN--GRLSKTRGIGNISLKTD
        +    +    +G+      +R    SR    R G  +  CY C KKGH+KR CR  K     +          + M+   N      K  G G+I LK  
Subjt:  SASGSALAVQKGKEKVEYDDRQHRNSR----RNGNGEIECYYCHKKGHYKRHCR--KLRGYEKAGCKCKCSCNYGMVRMGN--GRLSKTRGIGNISLKTD

Query:  SGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY
        + T   L+ VR++P ++ NLIS G+LD +G+ + F+G  WK+T+G+ ++    +T T+Y
Subjt:  SGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY

KAE8660210.1 F-box family protein [Hibiscus syriacus]6.9e-6539.39Show/hide
Query:  IGEQSSFEG---IMRFDGANFGYWKMQIKDYLTCKKVHKALK-ERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSAN
        +G  ++ EG   I +FDGA+FG+WKMQI+D+L  K +++ L  ++P GMK EDW  ++ QA+ +IRL LS NVA  +A E T  GLM AL++ YEKPSA+
Subjt:  IGEQSSFEG---IMRFDGANFGYWKMQIKDYLTCKKVHKALK-ERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSAN

Query:  NKVYLVRRYFNIRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSS
        NKV+L+RR FN+RM E ASV  H+NE+  +  QL+S  I F DEV A++LL+SLP+SW    T VS+S G+  LKF ++ D  ++EEI R+ESG+ STSS
Subjt:  NKVYLVRRYFNIRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSS

Query:  SASGSALAVQKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCR---KLRGYEKA-------------------------------GCKCK---
        +    +      +       +  R   R GN +  CY C KKGH+KR CR   K  G +++                                  C+   
Subjt:  SASGSALAVQKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCR---KLRGYEKA-------------------------------GCKCK---

Query:  ---CSCNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY
            S ++G V + +    K  G G+I LK  + T   L  VR++P ++ NLIS G+LD +GY + F+G  WK+T+G+ ++  G +T T+Y
Subjt:  ---CSCNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY

KAE8714488.1 hypothetical protein F3Y22_tig00110195pilonHSYRG00090 [Hibiscus syriacus]2.6e-6440.21Show/hide
Query:  IMRFDGANFGYWKMQIKDYLTCKKVHKALK-ERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFNI
        I +FDGA+FG+WKMQI+D+L  K +++ L  ++P GMK EDW  ++ QA+ +IRL LS NVA  +A E T  GLM AL++ YEKPSA+NKV+L+RR FN+
Subjt:  IMRFDGANFGYWKMQIKDYLTCKKVHKALK-ERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFNI

Query:  RMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAVQKG
        RM E ASV  H+NE+  +  QL+S  I F DEV A++LL+SLP+SW    T VS+S G+  LKF ++ D  ++EEI R+ESG+ STSS+    +    +G
Subjt:  RMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAVQKG

Query:  KEKVEYDDRQHRNSR----RNGNGEIECYYCHKKGHYKRHCRKLR----------------------------------GYEKAGCKCK------CSCNY
        +    Y +R    SR    R G  +  CY C KKGH+KR CR L+                                  G       C+       S ++
Subjt:  KEKVEYDDRQHRNSR----RNGNGEIECYYCHKKGHYKRHCRKLR----------------------------------GYEKAGCKCK------CSCNY

Query:  GMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY
        G V + +    K  G G+I LK  + T   L  VR++P ++ NLIS G+LD +GY + F+G  WK+T+G+ ++  G +T T+Y
Subjt:  GMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY

TKS02608.1 hypothetical protein D5086_0000161380 [Populus alba]1.7e-6338.96Show/hide
Query:  GIMRFDGANFGYWKMQIKDYLTCKKVH-KALKERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFN
        GI +FDG +FGYWKMQI+DYL  KK+H   L  +P  M+ E+W+ ++ Q + IIRL LS  VA  V  E +   LM AL+  YEKPSANNKV+L+++ FN
Subjt:  GIMRFDGANFGYWKMQIKDYLTCKKVH-KALKERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFN

Query:  IRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAV--
        ++M EN SV  H+N    + NQL+S  I F DE+ A++LL SLP SWE M+T VSNS G   LK+ +I D  +AEE+ RK+SG+    SS+SGSAL +  
Subjt:  IRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAV--

Query:  -----QKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCRKLRGYE----------------------------KAGCKCKCS-----------
              +G  +     R    S+     ++EC+ C K GH+ R+C K +  E                             +G    C+           
Subjt:  -----QKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCRKLRGYE----------------------------KAGCKCKCS-----------

Query:  CNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTV
         ++G+V + +G   K  GIG++ +KT +G+   L++VR+VP ++  LIS G+LDD G+   FAG MWK+++G+ +L  G +T T+
Subjt:  CNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTV

TKS09800.1 hypothetical protein D5086_0000089010 [Populus alba]1.7e-6338.96Show/hide
Query:  GIMRFDGANFGYWKMQIKDYLTCKKVH-KALKERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFN
        GI +FDG +FGYWKMQI+DYL  KK+H   L  +P  M+ E+W+ ++ Q + IIRL LS  VA  V  E +   LM AL+  YEKPSANNKV+L+++ FN
Subjt:  GIMRFDGANFGYWKMQIKDYLTCKKVH-KALKERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFN

Query:  IRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAV--
        ++M EN SV  H+N    + NQL+S  I F DE+ A++LL SLP SWE M+T VSNS G   LK+ +I D  +AEE+ RK+SG+    SS+SGSAL +  
Subjt:  IRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAV--

Query:  -----QKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCRKLRGYE----------------------------KAGCKCKCS-----------
              +G  +     R    S+     ++EC+ C K GH+ R+C K +  E                             +G    C+           
Subjt:  -----QKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCRKLRGYE----------------------------KAGCKCKCS-----------

Query:  CNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTV
         ++G+V + +G   K  GIG++ +KT +G+   L++VR+VP ++  LIS G+LDD G+   FAG MWK+++G+ +L  G +T T+
Subjt:  CNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTV

TrEMBL top hitse value%identityAlignment
A0A4U5PY83 CCHC-type domain-containing protein8.2e-6438.96Show/hide
Query:  GIMRFDGANFGYWKMQIKDYLTCKKVH-KALKERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFN
        GI +FDG +FGYWKMQI+DYL  KK+H   L  +P  M+ E+W+ ++ Q + IIRL LS  VA  V  E +   LM AL+  YEKPSANNKV+L+++ FN
Subjt:  GIMRFDGANFGYWKMQIKDYLTCKKVH-KALKERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFN

Query:  IRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAV--
        ++M EN SV  H+N    + NQL+S  I F DE+ A++LL SLP SWE M+T VSNS G   LK+ +I D  +AEE+ RK+SG+    SS+SGSAL +  
Subjt:  IRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAV--

Query:  -----QKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCRKLRGYE----------------------------KAGCKCKCS-----------
              +G  +     R    S+     ++EC+ C K GH+ R+C K +  E                             +G    C+           
Subjt:  -----QKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCRKLRGYE----------------------------KAGCKCKCS-----------

Query:  CNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTV
         ++G+V + +G   K  GIG++ +KT +G+   L++VR+VP ++  LIS G+LDD G+   FAG MWK+++G+ +L  G +T T+
Subjt:  CNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTV

A0A4U5QGR0 Uncharacterized protein8.2e-6438.96Show/hide
Query:  GIMRFDGANFGYWKMQIKDYLTCKKVH-KALKERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFN
        GI +FDG +FGYWKMQI+DYL  KK+H   L  +P  M+ E+W+ ++ Q + IIRL LS  VA  V  E +   LM AL+  YEKPSANNKV+L+++ FN
Subjt:  GIMRFDGANFGYWKMQIKDYLTCKKVH-KALKERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFN

Query:  IRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAV--
        ++M EN SV  H+N    + NQL+S  I F DE+ A++LL SLP SWE M+T VSNS G   LK+ +I D  +AEE+ RK+SG+    SS+SGSAL +  
Subjt:  IRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAV--

Query:  -----QKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCRKLRGYE----------------------------KAGCKCKCS-----------
              +G  +     R    S+     ++EC+ C K GH+ R+C K +  E                             +G    C+           
Subjt:  -----QKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCRKLRGYE----------------------------KAGCKCKCS-----------

Query:  CNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTV
         ++G+V + +G   K  GIG++ +KT +G+   L++VR+VP ++  LIS G+LDD G+   FAG MWK+++G+ +L  G +T T+
Subjt:  CNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTV

A0A6A2WXJ4 F-box family protein3.3e-6539.39Show/hide
Query:  IGEQSSFEG---IMRFDGANFGYWKMQIKDYLTCKKVHKALK-ERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSAN
        +G  ++ EG   I +FDGA+FG+WKMQI+D+L  K +++ L  ++P GMK EDW  ++ QA+ +IRL LS NVA  +A E T  GLM AL++ YEKPSA+
Subjt:  IGEQSSFEG---IMRFDGANFGYWKMQIKDYLTCKKVHKALK-ERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSAN

Query:  NKVYLVRRYFNIRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSS
        NKV+L+RR FN+RM E ASV  H+NE+  +  QL+S  I F DEV A++LL+SLP+SW    T VS+S G+  LKF ++ D  ++EEI R+ESG+ STSS
Subjt:  NKVYLVRRYFNIRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSS

Query:  SASGSALAVQKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCR---KLRGYEKA-------------------------------GCKCK---
        +    +      +       +  R   R GN +  CY C KKGH+KR CR   K  G +++                                  C+   
Subjt:  SASGSALAVQKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCR---KLRGYEKA-------------------------------GCKCK---

Query:  ---CSCNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY
            S ++G V + +    K  G G+I LK  + T   L  VR++P ++ NLIS G+LD +GY + F+G  WK+T+G+ ++  G +T T+Y
Subjt:  ---CSCNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY

A0A6A3BGE7 Uncharacterized protein1.3e-6440.21Show/hide
Query:  IMRFDGANFGYWKMQIKDYLTCKKVHKALK-ERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFNI
        I +FDGA+FG+WKMQI+D+L  K +++ L  ++P GMK EDW  ++ QA+ +IRL LS NVA  +A E T  GLM AL++ YEKPSA+NKV+L+RR FN+
Subjt:  IMRFDGANFGYWKMQIKDYLTCKKVHKALK-ERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFNI

Query:  RMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAVQKG
        RM E ASV  H+NE+  +  QL+S  I F DEV A++LL+SLP+SW    T VS+S G+  LKF ++ D  ++EEI R+ESG+ STSS+    +    +G
Subjt:  RMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAVQKG

Query:  KEKVEYDDRQHRNSR----RNGNGEIECYYCHKKGHYKRHCRKLR----------------------------------GYEKAGCKCK------CSCNY
        +    Y +R    SR    R G  +  CY C KKGH+KR CR L+                                  G       C+       S ++
Subjt:  KEKVEYDDRQHRNSR----RNGNGEIECYYCHKKGHYKRHCRKLR----------------------------------GYEKAGCKCK------CSCNY

Query:  GMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY
        G V + +    K  G G+I LK  + T   L  VR++P ++ NLIS G+LD +GY + F+G  WK+T+G+ ++  G +T T+Y
Subjt:  GMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY

A0A6A3BK95 Uncharacterized protein1.4e-6339.58Show/hide
Query:  IMRFDGANFGYWKMQIKDYLTCKKVHKALK-ERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFNI
        I +FDGA+FG+WKMQI+ +L  K +++ L  ++P GMK EDW  ++ QA+ +IRL LS NVA  +A E T  GLM AL++ YEKPSA+NKV+L+RR FN+
Subjt:  IMRFDGANFGYWKMQIKDYLTCKKVHKALK-ERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFNI

Query:  RMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAVQKG
        RM E ASV  H+NE+  +  QL+S  I F DEV A++LL+SLP+SW    T VS+S G+  LKF ++ D  ++EEI R+ESG+ STSS+    +      
Subjt:  RMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAVQKG

Query:  KEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCR---KLRGYEKA-------------------------------GCKCK------CSCNYGMVR
        +       +  R   R GN +  CY C KKGH+KR CR   K  G +++                                  C+       S ++G V 
Subjt:  KEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHCR---KLRGYEKA-------------------------------GCKCK------CSCNYGMVR

Query:  MGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY
        + +    K  G G+I LK  + T   L  VR++P ++ NLIS G+LD +GY + F+G  WK+T+G+ ++  G +T T+Y
Subjt:  MGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVY

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-4530.71Show/hide
Query:  IMRFDGAN-FGYWKMQIKDYLTCKKVHKAL---KERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRY
        + +F+G N F  W+ +++D L  + +HK L    ++P  MK EDW D++E+A + IRL LS +V + + +E TA G+   L + Y   +  NK+YL ++ 
Subjt:  IMRFDGAN-FGYWKMQIKDYLTCKKVHKAL---KERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRY

Query:  FNIRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAV
        + + M E  +  SH+N    LI QLA+  +   +E  AILLL SLP S++ + TT+ +  G  +++  ++  A +  E  RK+   +  +    G   + 
Subjt:  FNIRMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAV

Query:  QKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHC---RKLRGYEKAGCK---------------------------------------------
        Q+          + ++  R+ +    CY C++ GH+KR C   RK +G E +G K                                             
Subjt:  QKGKEKVEYDDRQHRNSRRNGNGEIECYYCHKKGHYKRHC---RKLRGYEKAGCK---------------------------------------------

Query:  -------CK-CSCNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVYTV
               C+  + ++G V+MGN   SK  GIG+I +KT+ G  LVL+DVR+VP +RMNLIS   LD DGY S FA   W+LT+GS ++  G    T+Y  
Subjt:  -------CK-CSCNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMWKLTRGSELLVVGHRTSTVYTV

Query:  RFDVAKG
          ++ +G
Subjt:  RFDVAKG

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein8.5e-1339.77Show/hide
Query:  RFDGANFGYWKMQIKDYLTCKKVHKALKERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKV
        + DG ++ + +M+I+DYL  KK+H+ L ++   M  +DW  +  Q + +IRL +S N+A  VA E +  GLM  L++ Y+KPS NN V
Subjt:  RFDGANFGYWKMQIKDYLTCKKVHKALKERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATTGGTGAGCAGTCAAGTTTCGAAGGAATTATGAGGTTCGATGGAGCCAACTTTGGGTATTGGAAAATGCAGATTAAGGATTACTTAACCTGCAAAAAGGTGCA
CAAAGCGTTGAAGGAACGACCAGCTGGAATGAAGCTTGAAGATTGGGAAGATATGAATGAACAGGCAGTTGCAATTATCAGGTTGTGTCTATCAATGAATGTAGCGAGTC
TCGTGGCAAATGAGACAACGGCGATAGGTCTGATGAATGCACTGACGAACGGGTATGAAAAACCATCTGCCAATAATAAGGTATATCTTGTTAGAAGATATTTTAACATT
CGTATGGATGAGAATGCTTCTGTGAATTCCCATATAAATGAAGTCACTAATTTAATCAATCAGTTAGCGTCTCGGAATATTACATTTAGTGATGAGGTGAATGCTATTTT
ATTGTTGACTTCTTTGCCTGAAAGTTGGGAAACAATGAAGACGACTGTGTCTAATTCATTAGGAGATAAGAGTTTAAAATTTACAGAAATTTGTGATGCTGCAATTGCTG
AGGAAATTTACAGGAAAGAGAGTGGAAAAGAATCCACTTCTAGTTCTGCATCTGGCTCAGCATTGGCTGTTCAGAAAGGTAAAGAAAAGGTTGAGTATGATGATAGACAA
CATAGAAATAGCAGGAGAAATGGGAATGGTGAGATTGAATGTTATTACTGCCACAAGAAGGGCCACTATAAGAGGCACTGTAGAAAATTGAGAGGATATGAAAAAGCTGG
ATGCAAATGCAAATGCAGTTGCAACTATGGCATGGTGAGGATGGGAAATGGTAGGCTCTCCAAGACTAGAGGAATTGGAAATATCAGTCTGAAGACCGATAGTGGGACTG
AGTTGGTTCTGCGAGATGTCAGGTATGTACCCAGCATCAGAATGAATTTAATATCTGCAGGGAAGTTGGATGACGATGGGTACAGAAGTGAGTTTGCTGGAAATATGTGG
AAACTCACTAGAGGATCCGAGTTGCTGGTTGTTGGCCACAGGACATCTACAGTGTACACAGTGAGGTTTGATGTTGCCAAAGGATCAGAGAGACAGAGGATGCATAGAGC
TGCAAATGGTTTAGATGGAGACGGGATAAAACCAACAGCATTGACAACCAAGATAGATCCATCAGTTCATGTTCAACAATTGGGAGACAAGGATAAGGAAAAGAAAAAGA
GTTTAGTTGGTTGTCAGGTGATAGCCCCAGTTGTTAGACGGTTCAACGAATTGATGAAGTCGCATAGGCGAAAAGATGCATCGAAGAGGAAAACTACAGTTGGTGCTGAG
GTCGAGGTTGAAGTCTCTAGCTTGGTAACAAACTTAAGTGGGAGTGTCAATTCATCGAAGAAGAATTCTTTCTTTGGGAGTCGTTGGTCACGATCGAAGAAGGAAGCGTT
GGGAGGTACCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAATTGGTGAGCAGTCAAGTTTCGAAGGAATTATGAGGTTCGATGGAGCCAACTTTGGGTATTGGAAAATGCAGATTAAGGATTACTTAACCTGCAAAAAGGTGCA
CAAAGCGTTGAAGGAACGACCAGCTGGAATGAAGCTTGAAGATTGGGAAGATATGAATGAACAGGCAGTTGCAATTATCAGGTTGTGTCTATCAATGAATGTAGCGAGTC
TCGTGGCAAATGAGACAACGGCGATAGGTCTGATGAATGCACTGACGAACGGGTATGAAAAACCATCTGCCAATAATAAGGTATATCTTGTTAGAAGATATTTTAACATT
CGTATGGATGAGAATGCTTCTGTGAATTCCCATATAAATGAAGTCACTAATTTAATCAATCAGTTAGCGTCTCGGAATATTACATTTAGTGATGAGGTGAATGCTATTTT
ATTGTTGACTTCTTTGCCTGAAAGTTGGGAAACAATGAAGACGACTGTGTCTAATTCATTAGGAGATAAGAGTTTAAAATTTACAGAAATTTGTGATGCTGCAATTGCTG
AGGAAATTTACAGGAAAGAGAGTGGAAAAGAATCCACTTCTAGTTCTGCATCTGGCTCAGCATTGGCTGTTCAGAAAGGTAAAGAAAAGGTTGAGTATGATGATAGACAA
CATAGAAATAGCAGGAGAAATGGGAATGGTGAGATTGAATGTTATTACTGCCACAAGAAGGGCCACTATAAGAGGCACTGTAGAAAATTGAGAGGATATGAAAAAGCTGG
ATGCAAATGCAAATGCAGTTGCAACTATGGCATGGTGAGGATGGGAAATGGTAGGCTCTCCAAGACTAGAGGAATTGGAAATATCAGTCTGAAGACCGATAGTGGGACTG
AGTTGGTTCTGCGAGATGTCAGGTATGTACCCAGCATCAGAATGAATTTAATATCTGCAGGGAAGTTGGATGACGATGGGTACAGAAGTGAGTTTGCTGGAAATATGTGG
AAACTCACTAGAGGATCCGAGTTGCTGGTTGTTGGCCACAGGACATCTACAGTGTACACAGTGAGGTTTGATGTTGCCAAAGGATCAGAGAGACAGAGGATGCATAGAGC
TGCAAATGGTTTAGATGGAGACGGGATAAAACCAACAGCATTGACAACCAAGATAGATCCATCAGTTCATGTTCAACAATTGGGAGACAAGGATAAGGAAAAGAAAAAGA
GTTTAGTTGGTTGTCAGGTGATAGCCCCAGTTGTTAGACGGTTCAACGAATTGATGAAGTCGCATAGGCGAAAAGATGCATCGAAGAGGAAAACTACAGTTGGTGCTGAG
GTCGAGGTTGAAGTCTCTAGCTTGGTAACAAACTTAAGTGGGAGTGTCAATTCATCGAAGAAGAATTCTTTCTTTGGGAGTCGTTGGTCACGATCGAAGAAGGAAGCGTT
GGGAGGTACCACTTAG
Protein sequenceShow/hide protein sequence
MGIGEQSSFEGIMRFDGANFGYWKMQIKDYLTCKKVHKALKERPAGMKLEDWEDMNEQAVAIIRLCLSMNVASLVANETTAIGLMNALTNGYEKPSANNKVYLVRRYFNI
RMDENASVNSHINEVTNLINQLASRNITFSDEVNAILLLTSLPESWETMKTTVSNSLGDKSLKFTEICDAAIAEEIYRKESGKESTSSSASGSALAVQKGKEKVEYDDRQ
HRNSRRNGNGEIECYYCHKKGHYKRHCRKLRGYEKAGCKCKCSCNYGMVRMGNGRLSKTRGIGNISLKTDSGTELVLRDVRYVPSIRMNLISAGKLDDDGYRSEFAGNMW
KLTRGSELLVVGHRTSTVYTVRFDVAKGSERQRMHRAANGLDGDGIKPTALTTKIDPSVHVQQLGDKDKEKKKSLVGCQVIAPVVRRFNELMKSHRRKDASKRKTTVGAE
VEVEVSSLVTNLSGSVNSSKKNSFFGSRWSRSKKEALGGTT