; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g01120 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g01120
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:812455..814239
RNA-Seq ExpressionMoc03g01120
SyntenyMoc03g01120
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7121453.1 hypothetical protein RHSIM_Rhsim13G0116100 [Rhododendron simsii]1.9e-6540.32Show/hide
Query:  DEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAIQL
        DEDW  TN +AVA IR  +  +V   VA ET   +L   LE  YE+ +A NK  L+RR  N++ +   SV  H+++   LINQL+++K+   +E+ A+ L
Subjt:  DEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAIQL

Query:  LTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSG----KESTFVQ--GSTLAVEKSKEKVASDDRQQKHSRWDWK--RDVECFHCHKKDH
        L+SLP+SWE +  T+SN      +    + D    EE RRK  G     E+  VQ  G +       ++  S DR +  SR   K   ++ECFHCHK  H
Subjt:  LTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSG----KESTFVQ--GSTLAVEKSKEKVASDDRQQKHSRWDWK--RDVECFHCHKKDH

Query:  IKKNCRMLKEDLKR-YMAESNAVVDNA---------LVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLK
        ++K CR+L+++LK+  + ES+   D           +VC +    +  Q   WVI+S AS +++S R  F S+  G+ GHVRMGN  +SK  G+G++ L+
Subjt:  IKKNCRMLKEDLKR-YMAESNAVVDNA---------LVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLK

Query:  TNSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVARGSL
        TN+  KLLL+DVR+VP IR+NLIS GKLDD+GY+++FG  +WKL+KGS +VA G + ST+Y  Q  +++G +
Subjt:  TNSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVARGSL

KAF7129225.1 hypothetical protein RHSIM_Rhsim10G0050800 [Rhododendron simsii]4.1e-6540.05Show/hide
Query:  DEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAIQL
        DEDW  TN +AVA IR  +  +V   VA ET   +L   LE  YE+ +A NK  L+RR  N++ +   SV  H+++   LINQL+++K+   +E+ A+ L
Subjt:  DEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAIQL

Query:  LTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSG----KESTFVQ--GSTLAVEKSKEKVASDDRQQKHSRWDWK--RDVECFHCHKKDH
        L+SLP+SWE +  T+SN      +    + D    EE RRK  G     E+  VQ  G +       ++  S DR +  SR   K   ++ECFHCHK  H
Subjt:  LTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSG----KESTFVQ--GSTLAVEKSKEKVASDDRQQKHSRWDWK--RDVECFHCHKKDH

Query:  IKKNCRMLKEDLKRYMA-------ESNAVVDNA---LVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLK
        ++K CR+L+++LK+          ++ AV  +    +VC +    +  Q   WVI+S AS +++S R  F S+  G+ GHVRMGN  +SK  G+G++ L+
Subjt:  IKKNCRMLKEDLKRYMA-------ESNAVVDNA---LVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLK

Query:  TNSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVARGSL
        TN+  KLLL+DVR+VP+IR+NLIS GKLDD+GY+++FG  +WKL+KGS +VA G + ST+Y  Q  +++G +
Subjt:  TNSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVARGSL

KAF7129546.1 hypothetical protein RHSIM_Rhsim10G0154200 [Rhododendron simsii]7.1e-6539.47Show/hide
Query:  DEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAIQL
        DEDW  TN +AVA IR  +  +V   VA ET   +L   LE  YE+ +A NK  L+RR  N++ +   SV  H+++   LINQL+++K+   +E+ A+ L
Subjt:  DEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAIQL

Query:  LTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE---KSKEKVASDDRQQKHSRWDWK--------RDVECFHCHK
        L+SLP+SWE +  T+SN      +    + D    EE RRK  G   T V    L V+   +S+ + +  DR +   R   K         ++ECFHCHK
Subjt:  LTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE---KSKEKVASDDRQQKHSRWDWK--------RDVECFHCHK

Query:  KDHIKKNCRMLKEDLKRYMA-------ESNAVVDNA---LVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNI
          H++K CR+L+++LK+          ++ AV  +    +VC +    +  Q   WVI+S AS +++S R  F S+  G+ GHVRMGN  +SK  G+G++
Subjt:  KDHIKKNCRMLKEDLKRYMA-------ESNAVVDNA---LVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNI

Query:  RLKTNSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVARGSL
         L+TN+  KLLL+DVR+VP IR+NLIS GKLDD+GY+++FG  +WKL+KGS +VA G + ST+Y  Q+ +++G +
Subjt:  RLKTNSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVARGSL

RVW14266.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.9e-6639.67Show/hide
Query:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI
        MKD++W + + +AV FIR  +  +V   V+ E +   L   LE+ Y++ +A NK +L R+  N + ++ T +  H+NE+  ++NQLA++KITF DE+ A+
Subjt:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI

Query:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE-----KSKEKVASD-DRQQKHSRWDWKRDVECFHCHKKDH
         LL+SLPESWE +  T+SN   D  +  S++  + + EE RRK SG          L +E     +SK K + + D+ +  S    K+DVEC++CHKK H
Subjt:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE-----KSKEKVASD-DRQQKHSRWDWKRDVECFHCHKKDH

Query:  IKKNCRMLK-------EDLKRYMAESNAVVDNALVCVESNTEIG--NQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKT
        +K+ CR LK       ++ ++   ++  V D  L+ +  +  +    Q ++WVI+S AS +++S    FTS+ +G+ G+VRMGN  +SK  G+G+I L+T
Subjt:  IKKNCRMLK-------EDLKRYMAESNAVVDNALVCVESNTEIG--NQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKT

Query:  NSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVAR
        N+  KLLLRDVR+VP IR+NLISTGKLDD+GY + F   +WKL+KGS +VA G +  ++YT Q  + +
Subjt:  NSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVAR

RVW84195.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.9e-6639.67Show/hide
Query:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI
        MKD++W + + +AV FIR  +  +V   V+ E +   L   LE+ Y++ +A NK +L R+  N + ++ T +  H+NE+  ++NQLA++KITF DE+ A+
Subjt:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI

Query:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE-----KSKEKVASD-DRQQKHSRWDWKRDVECFHCHKKDH
         LL+SLPESWE +  T+SN   D  +  S++  + + EE RRK SG          L +E     +SK K + + D+ +  S    K+DVEC++CHKK H
Subjt:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE-----KSKEKVASD-DRQQKHSRWDWKRDVECFHCHKKDH

Query:  IKKNCRMLK-------EDLKRYMAESNAVVDNALVCVESNTEIG--NQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKT
        +K+ CR LK       ++ ++   ++  V D  L+ +  +  +    Q ++WVI+S AS +++S    FTS+ +G+ G+VRMGN  +SK  G+G+I L+T
Subjt:  IKKNCRMLK-------EDLKRYMAESNAVVDNALVCVESNTEIG--NQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKT

Query:  NSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVAR
        N+  KLLLRDVR+VP IR+NLISTGKLDD+GY + F   +WKL+KGS +VA G +  ++YT Q  + +
Subjt:  NSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVAR

TrEMBL top hitse value%identityAlignment
A0A438BTH6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-6639.67Show/hide
Query:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI
        MKD++W + + +AV FIR  +  +V   V+ E +   L   LE+ Y++ +A NK +L R+  N + ++ T +  H+NE+  ++NQLA++KITF DE+ A+
Subjt:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI

Query:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE-----KSKEKVASD-DRQQKHSRWDWKRDVECFHCHKKDH
         LL+SLPESWE +  T+SN   D  +  S++  + + EE RRK SG          L +E     +SK K + + D+ +  S    K+DVEC++CHKK H
Subjt:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE-----KSKEKVASD-DRQQKHSRWDWKRDVECFHCHKKDH

Query:  IKKNCRMLK-------EDLKRYMAESNAVVDNALVCVESNTEIG--NQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKT
        +K+ CR LK       ++ ++   ++  V D  L+ +  +  +    Q ++WVI+S AS +++S    FTS+ +G+ G+VRMGN  +SK  G+G+I L+T
Subjt:  IKKNCRMLK-------EDLKRYMAESNAVVDNALVCVESNTEIG--NQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKT

Query:  NSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVAR
        N+  KLLLRDVR+VP IR+NLISTGKLDD+GY + F   +WKL+KGS +VA G +  ++YT Q  + +
Subjt:  NSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVAR

A0A438HI91 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-6639.67Show/hide
Query:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI
        MKD++W + + +AV FIR  +  +V   V+ E +   L   LE+ Y++ +A NK +L R+  N + ++ T +  H+NE+  ++NQLA++KITF DE+ A+
Subjt:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI

Query:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE-----KSKEKVASD-DRQQKHSRWDWKRDVECFHCHKKDH
         LL+SLPESWE +  T+SN   D  +  S++  + + EE RRK SG          L +E     +SK K + + D+ +  S    K+DVEC++CHKK H
Subjt:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE-----KSKEKVASD-DRQQKHSRWDWKRDVECFHCHKKDH

Query:  IKKNCRMLK-------EDLKRYMAESNAVVDNALVCVESNTEIG--NQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKT
        +K+ CR LK       ++ ++   ++  V D  L+ +  +  +    Q ++WVI+S AS +++S    FTS+ +G+ G+VRMGN  +SK  G+G+I L+T
Subjt:  IKKNCRMLK-------EDLKRYMAESNAVVDNALVCVESNTEIG--NQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKT

Query:  NSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVAR
        N+  KLLLRDVR+VP IR+NLISTGKLDD+GY + F   +WKL+KGS +VA G +  ++YT Q  + +
Subjt:  NSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVAR

A0A438IBT7 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-5937.77Show/hide
Query:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI
        MKD++W + + +AV FIR                               +A NK +L R+  N + ++ T +  H+NE+  ++NQLA++KITF DE+ A+
Subjt:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI

Query:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE-----KSKEKVASD-DRQQKHSRWDWKRDVECFHCHKKDH
         LL+SLPESWE +  T+SN   D  +  S++  + + EE RRK SG          L +E     KSK K + + D+ +  S    K+DVEC++CHKK H
Subjt:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVE-----KSKEKVASD-DRQQKHSRWDWKRDVECFHCHKKDH

Query:  IKKNCRMLK-------EDLKRYMAESNAVVDNALVCVESNTEIG--NQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKT
        +K+ CR LK       ++ ++   ++  V D  L+ +  +  +    Q ++WVI+S AS +++S    FTS+ +G+ G+VRMGN  +SK  G+G+I L+T
Subjt:  IKKNCRMLK-------EDLKRYMAESNAVVDNALVCVESNTEIG--NQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKT

Query:  NSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVAR
        N+  KLLLRDVR+VP IR+NLIS GKLDD+GY + F   +WKL+KGS +VA G +  ++YT Q  + +
Subjt:  NSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVAR

A0A6A2XKG3 Uncharacterized protein3.1e-5836.89Show/hide
Query:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI
        MKDEDW   + QA+  IRL LS NVA  +A E TT  LM AL + YEKPSA+NKV+L+RR FN++M +  S+  H+NE+  +  QL+SV+I F DEV A+
Subjt:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI

Query:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVEKSKEKVASDDR--QQKHSRWDWKRDVECFHCHKKDHIKKN
         LL+SLP+SW    T +S+  G+  LKF ++ D  ++EE+RR+ SG+ ST     T +  ++ E+ ++  R   ++       +D  C++C K  H K++
Subjt:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVEKSKEKVASDDR--QQKHSRWDWKRDVECFHCHKKDHIKKN

Query:  CRMLKEDLKRYMAESNAVVDNALVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKTNSETKLLLRDVRY
        CR LK+D     + +        + +  N+ I      W+++S AS + +S + +  ++  G+ G V + + +  K  G G+IRLK  ++T   L  VR+
Subjt:  CRMLKEDLKRYMAESNAVVDNALVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKTNSETKLLLRDVRY

Query:  VPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVARGSLKQQMQVAD
        +P ++ NLIS G+LD +GY + F G +WK+TKG+ ++A G +  T+Y +       +L+  ++VAD
Subjt:  VPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVARGSLKQQMQVAD

A0A6A3BGE7 Uncharacterized protein4.8e-5938.11Show/hide
Query:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI
        MKDEDW   + QA+  IRL LS NVA  +A E TT  LM AL + YEKPSA+NKV+L+RR FN++M +  SV  H+NE+  +  QL+SV+I F DEV A+
Subjt:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI

Query:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVEKSKEKVASDDR--QQKHSRWDWKRDVECFHCHKKDHIKKN
         LL+SLP+SW    T +S+  G+  LKF ++ D  ++EE+RR+ SG+ ST     T +  ++ E+ ++  R   ++      K+D  C++C KK H K++
Subjt:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVEKSKEKVASDDR--QQKHSRWDWKRDVECFHCHKKDHIKKN

Query:  CRMLKEDLKRYMAESNAVVDNALVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKTNSETKLLLRDVRY
        CR LK+D     + +        + +  N+ I      W+++S AS + +  + +  ++  G+ G V + + +  K  G G+IRLK  ++T   L  VR+
Subjt:  CRMLKEDLKRYMAESNAVVDNALVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKTNSETKLLLRDVRY

Query:  VPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTS
        +P ++ NLIS G+LD +GY + F G +WK+TKG+ ++A G +  T+Y +
Subjt:  VPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTS

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-4732.79Show/hide
Query:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI
        MK EDW + +E+A + IRL LS +V + + +E T   +   LE+ Y   +  NK+YL ++ + + M + T+  SH+N    LI QLA++ +   +E  AI
Subjt:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAI

Query:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKR--SGKESTFVQGSTLAVEKSKEKVASDDRQQKHSRWDWKRDVECFHCHKKDHIKKN
         LL SLP S++ + TT+  L G  +++  ++  A +  E  RK+  +  ++   +G   + ++S         + K       R   C++C++  H K++
Subjt:  QLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKR--SGKESTFVQGSTLAVEKSKEKVASDDRQQKHSRWDWKRDVECFHCHKKDHIKKN

Query:  C---RMLKEDLKRYMAESNAVV-----DNALVCVESNTE---IGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKTNS
        C   R  K +      + N        DN ++ +    E   +    SEWV+++AAS + +  R LF  +  G+ G V+MGN   SK  GIG+I +KTN 
Subjt:  C---RMLKEDLKRYMAESNAVV-----DNALVCVESNTE---IGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKTNS

Query:  ETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVARGSL
           L+L+DVR+VP +RMNLIS   LD DGY+S F   +W+LTKGS ++A G  + T+Y +   + +G L
Subjt:  ETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTKGSKLVAVGHRKSTVYTSQLSVARGSL

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein2.6e-0418.4Show/hide
Query:  KDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVY-----LVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDE
        K  D+V  + +A+  ++  L+ +V     + ++  D+   L    E+ +           L ++  +++M    S +S++++  +++ +L   K+  SD 
Subjt:  KDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVY-----LVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDE

Query:  VNAIQLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVEKSKEKVASDDRQQKHSRWDWKRDVECFHCHKKDHIK
             + T+L  S++ + + +  L+    +    + +         + S +E+ F     L ++   EK              W     C  C+K +H +
Subjt:  VNAIQLLTSLPESWEMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVEKSKEKVASDDRQQKHSRWDWKRDVECFHCHKKDHIK

Query:  KNCRMLKEDLKRYMAESNAVVDNALVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKTNSETKLLLRDV
        ++C+  +    +   E   VVD  L  V +          W+I+  A + ++   + FT+  R     V   +G +   +G G+++++     K  +R+V
Subjt:  KNCRMLKEDLKRYMAESNAVVDNALVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKTNSETKLLLRDV

Query:  RYVPSIRMNLISTGKLDDDGYQSEFG
         +VP +  N++S GK+    Y    G
Subjt:  RYVPSIRMNLISTGKLDDDGYQSEFG

AT3G29785.1 unknown protein2.0e-0443.64Show/hide
Query:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKV
        M  +DW     Q +  IRL +S N+A  VA E +   LMK L + Y+KPS NN V
Subjt:  MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATGAAGATTGGGTGGAGACGAATGAACAGGCAGTTGCCTTTATCAGATTATGTTTGTCGATGAATGTGGCAAGTCTCGTAGCAAACGAAACGACAACAATAGA
TTTGATGAAAGCACTGGAGAATAGGTACGAGAAACCCTCAGCTAATAATAAGGTATATCTCGTAAGGAGGTTCTTTAACATTCAAATGGAAAAGAATACTTCTGTCAATT
CCCACATAAATGAGGTCACAAAATTAATCAATCAATTGGCATCAGTTAAGATTACTTTTAGTGACGAAGTGAATGCTATTCAGTTGCTAACATCTTTACCTGAAAGTTGG
GAAATGATGAAGACAACAATGTCCAATTTGTTAGGAGACAAATCTTTGAAATTTTCAGAAATTTGTGATGCGGCAATTACTGAAGAAGTTCGCAGAAAGCGAAGTGGAAA
AGAATCTACTTTTGTACAAGGTTCAACATTGGCTGTCGAAAAAAGTAAAGAGAAGGTTGCGTCTGATGACAGGCAGCAGAAGCATAGTAGGTGGGATTGGAAGAGAGATG
TTGAATGTTTTCACTGCCACAAAAAAGACCACATCAAGAAAAACTGTAGAATGTTAAAAGAGGATCTGAAAAGGTATATGGCGGAGTCAAATGCAGTTGTAGACAATGCC
CTCGTCTGTGTTGAAAGCAACACAGAAATAGGAAACCAGTCATCAGAGTGGGTAATAAACAGCGCAGCTTCAGTATACATATCTTCAAATAGAAGATTATTCACATCTTT
CAGAAGAGGCAATTGCGGCCACGTGAGGATGGGGAATGGAAAACTTTCCAAGACCAAAGGGATTGGAAATATACGATTGAAGACCAATAGTGAGACTAAGTTATTACTAC
GAGATGTCAGGTATGTACCCAGTATCAGAATGAACCTTATCTCTACAGGTAAGTTGGACGATGATGGTTATCAAAGTGAGTTTGGTGGGAACCAGTGGAAGCTCACCAAA
GGATCCAAGTTGGTGGCAGTTGGCCATAGAAAATCTACAGTTTACACGTCGCAATTGAGTGTTGCCAGAGGATCATTGAAACAGCAGATGCAAGTTGCAGATGGTGTCCA
AAGAGGAAGGATTGAACCACCAACAAAGACAACCAGAACAGATCAGGAGAATCTGCCATCAATTCAGGAAGAACAATTGGGAAGTCAAAGAACGATAACATGTTCTTTAG
GTTGTCTGGGATTATCTCCAGTTCTCAGACGAATGGGTGAATTGATGAAGTCGCGAGAAGCCTTGAAGATTTTTTCAAGAAAATCACACGTTGTTGCAGTAGAAGCCTTG
GAGTTTTCCAAGATTATCATAGACTGTTGTTACAGTGGGAGGCAAGATCGATGTTTGGTCTCCCAGTGGGAGATTGTTGGGATAATGGAGCCAAAACAGATGATCCGTGT
CCTACGAGGAAAAGGATTTTCAAACACTACTATCAATTCCTACAAGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATGAAGATTGGGTGGAGACGAATGAACAGGCAGTTGCCTTTATCAGATTATGTTTGTCGATGAATGTGGCAAGTCTCGTAGCAAACGAAACGACAACAATAGA
TTTGATGAAAGCACTGGAGAATAGGTACGAGAAACCCTCAGCTAATAATAAGGTATATCTCGTAAGGAGGTTCTTTAACATTCAAATGGAAAAGAATACTTCTGTCAATT
CCCACATAAATGAGGTCACAAAATTAATCAATCAATTGGCATCAGTTAAGATTACTTTTAGTGACGAAGTGAATGCTATTCAGTTGCTAACATCTTTACCTGAAAGTTGG
GAAATGATGAAGACAACAATGTCCAATTTGTTAGGAGACAAATCTTTGAAATTTTCAGAAATTTGTGATGCGGCAATTACTGAAGAAGTTCGCAGAAAGCGAAGTGGAAA
AGAATCTACTTTTGTACAAGGTTCAACATTGGCTGTCGAAAAAAGTAAAGAGAAGGTTGCGTCTGATGACAGGCAGCAGAAGCATAGTAGGTGGGATTGGAAGAGAGATG
TTGAATGTTTTCACTGCCACAAAAAAGACCACATCAAGAAAAACTGTAGAATGTTAAAAGAGGATCTGAAAAGGTATATGGCGGAGTCAAATGCAGTTGTAGACAATGCC
CTCGTCTGTGTTGAAAGCAACACAGAAATAGGAAACCAGTCATCAGAGTGGGTAATAAACAGCGCAGCTTCAGTATACATATCTTCAAATAGAAGATTATTCACATCTTT
CAGAAGAGGCAATTGCGGCCACGTGAGGATGGGGAATGGAAAACTTTCCAAGACCAAAGGGATTGGAAATATACGATTGAAGACCAATAGTGAGACTAAGTTATTACTAC
GAGATGTCAGGTATGTACCCAGTATCAGAATGAACCTTATCTCTACAGGTAAGTTGGACGATGATGGTTATCAAAGTGAGTTTGGTGGGAACCAGTGGAAGCTCACCAAA
GGATCCAAGTTGGTGGCAGTTGGCCATAGAAAATCTACAGTTTACACGTCGCAATTGAGTGTTGCCAGAGGATCATTGAAACAGCAGATGCAAGTTGCAGATGGTGTCCA
AAGAGGAAGGATTGAACCACCAACAAAGACAACCAGAACAGATCAGGAGAATCTGCCATCAATTCAGGAAGAACAATTGGGAAGTCAAAGAACGATAACATGTTCTTTAG
GTTGTCTGGGATTATCTCCAGTTCTCAGACGAATGGGTGAATTGATGAAGTCGCGAGAAGCCTTGAAGATTTTTTCAAGAAAATCACACGTTGTTGCAGTAGAAGCCTTG
GAGTTTTCCAAGATTATCATAGACTGTTGTTACAGTGGGAGGCAAGATCGATGTTTGGTCTCCCAGTGGGAGATTGTTGGGATAATGGAGCCAAAACAGATGATCCGTGT
CCTACGAGGAAAAGGATTTTCAAACACTACTATCAATTCCTACAAGGAGTAG
Protein sequenceShow/hide protein sequence
MKDEDWVETNEQAVAFIRLCLSMNVASLVANETTTIDLMKALENRYEKPSANNKVYLVRRFFNIQMEKNTSVNSHINEVTKLINQLASVKITFSDEVNAIQLLTSLPESW
EMMKTTMSNLLGDKSLKFSEICDAAITEEVRRKRSGKESTFVQGSTLAVEKSKEKVASDDRQQKHSRWDWKRDVECFHCHKKDHIKKNCRMLKEDLKRYMAESNAVVDNA
LVCVESNTEIGNQSSEWVINSAASVYISSNRRLFTSFRRGNCGHVRMGNGKLSKTKGIGNIRLKTNSETKLLLRDVRYVPSIRMNLISTGKLDDDGYQSEFGGNQWKLTK
GSKLVAVGHRKSTVYTSQLSVARGSLKQQMQVADGVQRGRIEPPTKTTRTDQENLPSIQEEQLGSQRTITCSLGCLGLSPVLRRMGELMKSREALKIFSRKSHVVAVEAL
EFSKIIIDCCYSGRQDRCLVSQWEIVGIMEPKQMIRVLRGKGFSNTTINSYKE