; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g09940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g09940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr9:8409031..8414673
RNA-Seq ExpressionMoc09g09940
SyntenyMoc09g09940
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY97663.1 hypothetical protein Acr_12g0002040 [Actinidia rufa]6.5e-3537.15Show/hide
Query:  RGNQRHNPRYR-----EPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAGGS--RSIADYNEE
        RG   H+   R     EP++Y+MK+DL  F G L IEGFL+WI  VE FF YM  PD K+V LVA KL GG SAWW+ L         +  R+   Y EE
Subjt:  RGNQRHNPRYR-----EPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAGGS--RSIADYNEE

Query:  FHRLDVAGF--KKGTSTVDKTVN-QGVASSSKTKGCDDIKVGDQTKKLN--NTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVE-------DGDN--LD
        F++L       +     + + VN   VA   +TK    +  G  T++    N Y +    KC++CG+  H SN C +R T+N VE        GDN  L 
Subjt:  FHRLDVAGF--KKGTSTVDKTVN-QGVASSSKTKGCDDIKVGDQTKKLN--NTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVE-------DGDN--LD

Query:  HDVDINDEEDRLVLEPDEGDPNK-STGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPVHFQIGD
          VD+ D    L+  P + D +    GK    V Y    ++ +      S+  V  E  + +  +     EVR  +EA+N+KYK AAD+KRR   F  GD
Subjt:  HDVDINDEEDRLVLEPDEGDPNK-STGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPVHFQIGD

Query:  LVMIYLRKGRLHMGKYSKLNKKK
        LVM+YL K R   G Y+KL  KK
Subjt:  LVMIYLRKGRLHMGKYSKLNKKK

VFR02798.1 unnamed protein product [Cuscuta campestris]1.6e-3328.47Show/hide
Query:  NPRYREP--------NEYK----MKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAG------------
        NP +REP        ++Y+     KV++P F G L  + F++W+  V+  F Y + P+  KV LVA+KL G  SAWWEQ+    +  G            
Subjt:  NPRYREP--------NEYK----MKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAG------------

Query:  ------------------------GSRSIADYNEEFHRLD------------VAGFKKGTSTVDKTVNQGVASSSKTKGCDDIKVGDQTKKLNNTYGRPM
                                G+RS+ DY EEF++L             VA +  G   ++K   +   ++++     + ++   T+ +  T G   
Subjt:  ------------------------GSRSIADYNEEFHRLD------------VAGFKKGTSTVDKTVNQGVASSSKTKGCDDIKVGDQTKKLNNTYGRPM

Query:  LG-KCFRCGQVGHLSNECP----QRKTINFVEDG-----------------DNLDHDVDINDEEDRLVLE------PDEGDPNKSTGKPPFEVVYTKLPR
           KC+ CG+  H +NEC     Q+    FVE+                  D  + DV   D  + LV+        D+   + +TG  PF++V  + P 
Subjt:  LG-KCFRCGQVGHLSNECP----QRKTINFVEDG-----------------DNLDHDVDINDEEDRLVLE------PDEGDPNKSTGKPPFEVVYTKLPR

Query:  LTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPVHFQIGDLVMIYLRKGRLHMGKYSKLNKKKIGPFPIQEKCGDNAYEI
          +D+  +         +E+M E +  +H +VR  IE +N+KYK   D  RR V F +GD V   L K R  +G+Y KL  +KIGP  I +K  DNAY +
Subjt:  LTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPVHFQIGDLVMIYLRKGRLHMGKYSKLNKKKIGPFPIQEKCGDNAYEI

Query:  QLPSTISTQSL
        QLPS + T  +
Subjt:  QLPSTISTQSL

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]1.2e-3331.77Show/hide
Query:  IDNLQEHQDQPRKRNGKDIHGQGIVTGRGEQQAVPRMYQEPLPFQDDEYPEWEMN-----RRGNQRHNPRYREPNEYKMKVDLPVFYGKLDIEGFLEWIK
        + N+   +   R+R G  I  Q     + +Q+A     ++     D++   W  N      R NQR   R  E ++YKMK+DLP++ GK +IE FL+WIK
Subjt:  IDNLQEHQDQPRKRNGKDIHGQGIVTGRGEQQAVPRMYQEPLPFQDDEYPEWEMN-----RRGNQRHNPRYREPNEYKMKVDLPVFYGKLDIEGFLEWIK

Query:  NVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAG------------------------------------GSRSIADYNEEFHRL------
        + E+FF YM+TP+ KKV LVALKL+ G SAWW+QL +     G                                    G RS+A+Y EEFHRL      
Subjt:  NVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAG------------------------------------GSRSIADYNEEFHRL------

Query:  ------DVAGFKKG---------------------------------------------TSTVDKTVNQGVASSSKTKG--CDDIKVGDQTKKL------
               VA F  G                                             T++     N   ++S+K KG   D+ +V  + KK       
Subjt:  ------DVAGFKKG---------------------------------------------TSTVDKTVNQGVASSSKTKG--CDDIKVGDQTKKL------

Query:  -NNTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVEDGDNLDHDVDINDEEDRLVLEPDEGD
          N+Y RP LGKCFRCGQ GHLS+ CPQRKTI   E+G  +  D  I  EE+  ++E D+G+
Subjt:  -NNTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVEDGDNLDHDVDINDEEDRLVLEPDEGD

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]4.7e-3331.49Show/hide
Query:  IDNLQEHQDQPRKRNGKDIHGQGIVTGRGEQQAVPRMYQEPLPFQDDEYPEWEMN-----RRGNQRHNPRYREPNEYKMKVDLPVFYGKLDIEGFLEWIK
        + N+   +   R+R G  I  Q     + +Q+      ++     D++   W  N      R N+R   R  E ++YKMK+DLP++YGK +IE FL+WIK
Subjt:  IDNLQEHQDQPRKRNGKDIHGQGIVTGRGEQQAVPRMYQEPLPFQDDEYPEWEMN-----RRGNQRHNPRYREPNEYKMKVDLPVFYGKLDIEGFLEWIK

Query:  NVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAG------------------------------------GSRSIADYNEEFHRL------
        + E+FF YM+TP+ KKV LVALKL+ G SAWW+QL +     G                                    G R++A+Y EEFHRL      
Subjt:  NVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAG------------------------------------GSRSIADYNEEFHRL------

Query:  ------DVAGFKKG---------------------------------------------TSTVDKTVNQGVASSSKTKG--CDDIKVGDQTKKL------
               VA F  G                                             T++     N   ++S+K KG   D+ +V  + KK       
Subjt:  ------DVAGFKKG---------------------------------------------TSTVDKTVNQGVASSSKTKG--CDDIKVGDQTKKL------

Query:  -NNTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVEDGDNLDHDVDINDEEDRLVLEPDEGD
          N Y RP LGKCFRCGQ GHLSN CPQRKTI   E+G     D  I  EE+  ++E D+G+
Subjt:  -NNTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVEDGDNLDHDVDINDEEDRLVLEPDEGD

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]1.0e-3537.76Show/hide
Query:  WEMN-----RRGNQRHNPRYREPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAG--------
        W +N      R N+R   R  E ++YKMK+DLP++ GK +IE FL+WIK+ E+FF YM+TP+ KKV LVALKL+ G SAWW+QL +     G        
Subjt:  WEMN-----RRGNQRHNPRYREPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAG--------

Query:  ----------------------------GSRSIADYNEEFHRL------------DVAGFK---------------------KGTSTVDKTVNQ-GVASS
                                    G RS+ADY EEFHRL             VA F                      + TST  KT +Q   ++ 
Subjt:  ----------------------------GSRSIADYNEEFHRL------------DVAGFK---------------------KGTSTVDKTVNQ-GVASS

Query:  SKTKGCDDIKVGDQTKKL-------NNTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVEDGDNLDHDVDINDEEDRLVLEPDEGD
         K K  D+ +V  + KK         N+Y RP LGKCFRCGQ GHLSN CPQRKTI   E+G     D  I  EE+  ++E D+G+
Subjt:  SKTKGCDDIKVGDQTKKL-------NNTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVEDGDNLDHDVDINDEEDRLVLEPDEGD

TrEMBL top hitse value%identityAlignment
A0A484NRH7 CCHC-type domain-containing protein7.8e-3428.47Show/hide
Query:  NPRYREP--------NEYK----MKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAG------------
        NP +REP        ++Y+     KV++P F G L  + F++W+  V+  F Y + P+  KV LVA+KL G  SAWWEQ+    +  G            
Subjt:  NPRYREP--------NEYK----MKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAG------------

Query:  ------------------------GSRSIADYNEEFHRLD------------VAGFKKGTSTVDKTVNQGVASSSKTKGCDDIKVGDQTKKLNNTYGRPM
                                G+RS+ DY EEF++L             VA +  G   ++K   +   ++++     + ++   T+ +  T G   
Subjt:  ------------------------GSRSIADYNEEFHRLD------------VAGFKKGTSTVDKTVNQGVASSSKTKGCDDIKVGDQTKKLNNTYGRPM

Query:  LG-KCFRCGQVGHLSNECP----QRKTINFVEDG-----------------DNLDHDVDINDEEDRLVLE------PDEGDPNKSTGKPPFEVVYTKLPR
           KC+ CG+  H +NEC     Q+    FVE+                  D  + DV   D  + LV+        D+   + +TG  PF++V  + P 
Subjt:  LG-KCFRCGQVGHLSNECP----QRKTINFVEDG-----------------DNLDHDVDINDEEDRLVLE------PDEGDPNKSTGKPPFEVVYTKLPR

Query:  LTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPVHFQIGDLVMIYLRKGRLHMGKYSKLNKKKIGPFPIQEKCGDNAYEI
          +D+  +         +E+M E +  +H +VR  IE +N+KYK   D  RR V F +GD V   L K R  +G+Y KL  +KIGP  I +K  DNAY +
Subjt:  LTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPVHFQIGDLVMIYLRKGRLHMGKYSKLNKKKIGPFPIQEKCGDNAYEI

Query:  QLPSTISTQSL
        QLPS + T  +
Subjt:  QLPSTISTQSL

A0A5N5JVJ8 CCHC-type domain-containing protein5.6e-3226.94Show/hide
Query:  MQPSSLHDHRPIVEGAVENLQRNVVEIKQILSVLVDKIDNLQEHQDQPRKRNGKDIHGQGIVTGRGEQQAVPRMYQEPLPFQDDEYPEWEMNRRGNQRHN
        M P     +R  VE A  N +  V +++Q ++++ +++  L  +Q+     +G + +  G   G  E+       Q  +P    + PE + NRR      
Subjt:  MQPSSLHDHRPIVEGAVENLQRNVVEIKQILSVLVDKIDNLQEHQDQPRKRNGKDIHGQGIVTGRGEQQAVPRMYQEPLPFQDDEYPEWEMNRRGNQRHN

Query:  PRYREPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAGGSRSIADY----NEEFHRLDVAG-F
               E  M+ ++P F G L  E FL+W+  VE    + N P   +V LVA  L+G  +AWW+QL +     G ++ I D+     +   R  + G F
Subjt:  PRYREPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAGGSRSIADY----NEEFHRLDVAG-F

Query:  KKGTSTVDKTVNQGVASSSKTKGCDDIKVG-DQTKKLNNTYGRPMLG-KCFRCGQVGHLSNECPQ-RKTINFVEDGDNLDHDVDIN-----DEEDRLVLE
         +G+S +   +    +SSS          G       +N   R + G KCF CG+VGH  ++C +  K   F +  +  ++D  I      DE++ ++ +
Subjt:  KKGTSTVDKTVNQGVASSSKTKGCDDIKVG-DQTKKLNNTYGRPMLG-KCFRCGQVGHLSNECPQ-RKTINFVEDGDNLDHDVDIN-----DEEDRLVLE

Query:  PDEGDP----------------------------------------------------------NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEA
          EGD                                                           N+STG  PF++VY+ +PR  +D+  LP +  +  + 
Subjt:  PDEGDP----------------------------------------------------------NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEA

Query:  EQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPVHFQIGDLVMIYLRKGRLHMGKYSKLNKKKIGPFPIQEKCGDNAYEIQLPSTIST
              +  +H +V +++E T  KYK AAD KRR + F +GD V   L K R  +G+Y+KL  KKIGP  + EK   NAY ++LPS I T
Subjt:  EQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPVHFQIGDLVMIYLRKGRLHMGKYSKLNKKKIGPFPIQEKCGDNAYEIQLPSTIST

A0A6J1CCQ8 uncharacterized protein LOC111009540 isoform X21.2e-3129.44Show/hide
Query:  MQPSSLHDHRPIVEGAVENLQRNVVEIKQI-------LSVLVDKIDNLQ---EHQDQ-------------PRKRNGKDIHGQGIV-------TGRGEQQA
        + P S  +   ++E ++  ++ NV  I  +       L ++ D+ + LQ   +H D+             PR  +G++   Q +        T  G++  
Subjt:  MQPSSLHDHRPIVEGAVENLQRNVVEIKQI-------LSVLVDKIDNLQ---EHQDQ-------------PRKRNGKDIHGQGIV-------TGRGEQQA

Query:  VPRMYQEPLPFQDDEYPEWEMNRRGNQRHNPRYREPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLR
           + Q P   +++ Y E     RG  R         ++KMK+DLP F GK+D+E FL+ +KNVE+FF Y NTP+ KKV LVA K++ G SAWW+QL + 
Subjt:  VPRMYQEPLPFQDDEYPEWEMNRRGNQRHNPRYREPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLR

Query:  NDNAG------------------------------------GSRSIADYNEEFHRL------------DVAGFKKG------------------------
            G                                    G ++IADY E FHRL             +A F  G                        
Subjt:  NDNAG------------------------------------GSRSIADYNEEFHRL------------DVAGFKKG------------------------

Query:  -------------------------TSTVD--KTVNQGVASSSKTKGCDD------IKVGD-QTKKLNNTYGRPMLGKCFRCGQVGHLSNECPQRKTINF
                                 T+T D  K +  G  S+S TK  DD       K  D  +K+  N Y RP LGKCFRCGQV HLSNECPQR+ +  
Subjt:  -------------------------TSTVD--KTVNQGVASSSKTKGCDD------IKVGD-QTKKLNNTYGRPMLGKCFRCGQVGHLSNECPQRKTINF

Query:  VEDGDNLDHDVDINDEEDRLVLEPDEGD
        V+  D L+ D+D+  E+D   +EPDEGD
Subjt:  VEDGDNLDHDVDINDEEDRLVLEPDEGD

A0A7J0FGD7 CCHC-type domain-containing protein3.2e-3537.15Show/hide
Query:  RGNQRHNPRYR-----EPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAGGS--RSIADYNEE
        RG   H+   R     EP++Y+MK+DL  F G L IEGFL+WI  VE FF YM  PD K+V LVA KL GG SAWW+ L         +  R+   Y EE
Subjt:  RGNQRHNPRYR-----EPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAGGS--RSIADYNEE

Query:  FHRLDVAGF--KKGTSTVDKTVN-QGVASSSKTKGCDDIKVGDQTKKLN--NTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVE-------DGDN--LD
        F++L       +     + + VN   VA   +TK    +  G  T++    N Y +    KC++CG+  H SN C +R T+N VE        GDN  L 
Subjt:  FHRLDVAGF--KKGTSTVDKTVN-QGVASSSKTKGCDDIKVGDQTKKLN--NTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVE-------DGDN--LD

Query:  HDVDINDEEDRLVLEPDEGDPNK-STGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPVHFQIGD
          VD+ D    L+  P + D +    GK    V Y    ++ +      S+  V  E  + +  +     EVR  +EA+N+KYK AAD+KRR   F  GD
Subjt:  HDVDINDEEDRLVLEPDEGDPNK-STGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPVHFQIGD

Query:  LVMIYLRKGRLHMGKYSKLNKKK
        LVM+YL K R   G Y+KL  KK
Subjt:  LVMIYLRKGRLHMGKYSKLNKKK

A0A7J0H1N6 CCHC-type domain-containing protein1.9e-3228.51Show/hide
Query:  RGNQRHNPRYR-----EPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAGGSRSIADYNEEFH
        RG   H+   R     E  +Y+MK+DLP F G L I+GFL+WI  VE FF YM  PD K++ LVA KLKGG SAW            G R+   Y EEF+
Subjt:  RGNQRHNPRYR-----EPNEYKMKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAGGSRSIADYNEEFH

Query:  RL------------DVAGFKKGTSTVDKTVNQGVASS-SKTKGCDDIKVGDQTKKLN--NTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVE------D
        RL             +A F  G     +  +Q V +S S+TK    +  G  T++    N Y +    KC+RCG+ GH SN CP+R T+N VE      D
Subjt:  RL------------DVAGFKKGTSTVDKTVNQGVASS-SKTKGCDDIKVGDQTKKLN--NTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVE------D

Query:  GDNLDHDVD--------INDEED-----------RLVLEPDEGDP----------------------NKSTGK-------------------PPFEVVYT
        G + + + D          D+E+           +L+L P   D                       N  +G+                    P+ + + 
Subjt:  GDNLDHDVD--------INDEED-----------RLVLEPDEGDP----------------------NKSTGK-------------------PPFEVVYT

Query:  K----------------LPRLTID---------------------------------------------ITNLPSSIDVSLEAEQMAERIAKLH--QEVR
        K                + +  +D                                             ++ L  S    +  E+    +  +H   EVR
Subjt:  K----------------LPRLTID---------------------------------------------ITNLPSSIDVSLEAEQMAERIAKLH--QEVR

Query:  DHIEATNSKYKEAADSKRRPVHFQIGDLVMIYLRKGRLHMGKYSKLNKKKIGPFPIQEKCGDNAYEIQLPSTISTQS
          +EA+N+KYK  AD+KRR   F  G+LVM+YLRK R   G Y+KL  KK GPF I +K  +NAY + LP+ +   S
Subjt:  DHIEATNSKYKEAADSKRRPVHFQIGDLVMIYLRKGRLHMGKYSKLNKKKIGPFPIQEKCGDNAYEIQLPSTISTQS

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.9e-0834.92Show/hide
Query:  NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPV-HFQIGDLVMIYLRK-GRLHMGKYSKLN
        + +T   PFE+V+   P L+     LPS  D   + ++ ++   ++ Q V++H+   N K K+  D K + +  FQ GDLVM+   K G LH  K +KL 
Subjt:  NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPV-HFQIGDLVMIYLRK-GRLHMGKYSKLN

Query:  KKKIGPFPIQEKCGDNAYEIQLPSTI
            GPF + +K G N YE+ LP +I
Subjt:  KKKIGPFPIQEKCGDNAYEIQLPSTI

P0CT35 Transposon Tf2-2 polyprotein1.9e-0834.92Show/hide
Query:  NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPV-HFQIGDLVMIYLRK-GRLHMGKYSKLN
        + +T   PFE+V+   P L+     LPS  D   + ++ ++   ++ Q V++H+   N K K+  D K + +  FQ GDLVM+   K G LH  K +KL 
Subjt:  NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPV-HFQIGDLVMIYLRK-GRLHMGKYSKLN

Query:  KKKIGPFPIQEKCGDNAYEIQLPSTI
            GPF + +K G N YE+ LP +I
Subjt:  KKKIGPFPIQEKCGDNAYEIQLPSTI

P0CT36 Transposon Tf2-3 polyprotein1.9e-0834.92Show/hide
Query:  NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPV-HFQIGDLVMIYLRK-GRLHMGKYSKLN
        + +T   PFE+V+   P L+     LPS  D   + ++ ++   ++ Q V++H+   N K K+  D K + +  FQ GDLVM+   K G LH  K +KL 
Subjt:  NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPV-HFQIGDLVMIYLRK-GRLHMGKYSKLN

Query:  KKKIGPFPIQEKCGDNAYEIQLPSTI
            GPF + +K G N YE+ LP +I
Subjt:  KKKIGPFPIQEKCGDNAYEIQLPSTI

P0CT41 Transposon Tf2-12 polyprotein1.9e-0834.92Show/hide
Query:  NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPV-HFQIGDLVMIYLRK-GRLHMGKYSKLN
        + +T   PFE+V+   P L+     LPS  D   + ++ ++   ++ Q V++H+   N K K+  D K + +  FQ GDLVM+   K G LH  K +KL 
Subjt:  NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPV-HFQIGDLVMIYLRK-GRLHMGKYSKLN

Query:  KKKIGPFPIQEKCGDNAYEIQLPSTI
            GPF + +K G N YE+ LP +I
Subjt:  KKKIGPFPIQEKCGDNAYEIQLPSTI

Q9UR07 Transposon Tf2-11 polyprotein1.9e-0834.92Show/hide
Query:  NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPV-HFQIGDLVMIYLRK-GRLHMGKYSKLN
        + +T   PFE+V+   P L+     LPS  D   + ++ ++   ++ Q V++H+   N K K+  D K + +  FQ GDLVM+   K G LH  K +KL 
Subjt:  NKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAEQMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPV-HFQIGDLVMIYLRK-GRLHMGKYSKLN

Query:  KKKIGPFPIQEKCGDNAYEIQLPSTI
            GPF + +K G N YE+ LP +I
Subjt:  KKKIGPFPIQEKCGDNAYEIQLPSTI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCCGTCCTCTCTCCACGATCATCGACCAATAGTTGAAGGTGCTGTAGAGAATTTGCAGCGGAATGTGGTAGAGATTAAGCAAATCTTGAGCGTTTTGGTGGACAA
GATTGACAACTTGCAAGAACATCAAGATCAGCCAAGAAAGAGGAACGGTAAGGACATTCATGGGCAAGGGATTGTTACTGGGAGAGGGGAACAGCAAGCAGTCCCAAGAA
TGTATCAAGAACCCCTCCCTTTTCAAGATGATGAGTATCCGGAATGGGAAATGAATCGCAGGGGCAACCAACGACATAACCCAAGGTATCGTGAACCTAACGAGTATAAA
ATGAAAGTTGACCTCCCTGTTTTTTATGGTAAACTTGACATTGAAGGTTTTCTTGAGTGGATCAAGAATGTTGAAAGTTTCTTTGGATATATGAACACCCCGGATCATAA
AAAAGTCGACCTTGTTGCCTTAAAACTCAAAGGTGGAACTTCCGCTTGGTGGGAGCAATTGCTCTTAAGAAACGACAATGCAGGGGGGAGCCGATCCATTGCCGACTACA
ATGAGGAATTCCACCGCTTAGATGTTGCTGGATTCAAGAAAGGCACGTCAACCGTGGATAAAACCGTAAACCAAGGTGTGGCTTCTAGCTCTAAAACGAAGGGGTGTGAT
GACATTAAAGTGGGGGACCAAACCAAGAAACTTAACAATACTTATGGCCGTCCCATGTTGGGAAAATGTTTTCGGTGTGGCCAAGTTGGTCATTTATCAAATGAATGTCC
CCAAAGAAAGACAATCAACTTCGTGGAAGATGGTGACAATCTTGATCACGATGTTGATATCAATGACGAGGAGGATCGCTTAGTTCTTGAACCCGATGAGGGTGATCCCA
ACAAATCTACGGGGAAGCCCCCATTTGAAGTAGTTTATACTAAACTCCCTCGCTTAACCATAGACATTACTAATTTACCATCTTCTATTGATGTTAGTTTGGAGGCGGAG
CAAATGGCAGAAAGGATAGCAAAATTGCATCAGGAAGTCCGTGATCATATTGAGGCAACCAACTCCAAATATAAAGAAGCTGCAGATAGTAAAAGAAGACCAGTACACTT
CCAAATTGGAGATCTGGTCATGATTTATTTAAGGAAAGGAAGACTACATATGGGCAAGTATAGTAAGCTTAACAAAAAGAAGATTGGTCCCTTTCCAATCCAAGAAAAGT
GCGGTGACAACGCCTACGAGATTCAGCTGCCGAGTACAATATCAACCCAATCTTTAATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGCCGTCCTCTCTCCACGATCATCGACCAATAGTTGAAGGTGCTGTAGAGAATTTGCAGCGGAATGTGGTAGAGATTAAGCAAATCTTGAGCGTTTTGGTGGACAA
GATTGACAACTTGCAAGAACATCAAGATCAGCCAAGAAAGAGGAACGGTAAGGACATTCATGGGCAAGGGATTGTTACTGGGAGAGGGGAACAGCAAGCAGTCCCAAGAA
TGTATCAAGAACCCCTCCCTTTTCAAGATGATGAGTATCCGGAATGGGAAATGAATCGCAGGGGCAACCAACGACATAACCCAAGGTATCGTGAACCTAACGAGTATAAA
ATGAAAGTTGACCTCCCTGTTTTTTATGGTAAACTTGACATTGAAGGTTTTCTTGAGTGGATCAAGAATGTTGAAAGTTTCTTTGGATATATGAACACCCCGGATCATAA
AAAAGTCGACCTTGTTGCCTTAAAACTCAAAGGTGGAACTTCCGCTTGGTGGGAGCAATTGCTCTTAAGAAACGACAATGCAGGGGGGAGCCGATCCATTGCCGACTACA
ATGAGGAATTCCACCGCTTAGATGTTGCTGGATTCAAGAAAGGCACGTCAACCGTGGATAAAACCGTAAACCAAGGTGTGGCTTCTAGCTCTAAAACGAAGGGGTGTGAT
GACATTAAAGTGGGGGACCAAACCAAGAAACTTAACAATACTTATGGCCGTCCCATGTTGGGAAAATGTTTTCGGTGTGGCCAAGTTGGTCATTTATCAAATGAATGTCC
CCAAAGAAAGACAATCAACTTCGTGGAAGATGGTGACAATCTTGATCACGATGTTGATATCAATGACGAGGAGGATCGCTTAGTTCTTGAACCCGATGAGGGTGATCCCA
ACAAATCTACGGGGAAGCCCCCATTTGAAGTAGTTTATACTAAACTCCCTCGCTTAACCATAGACATTACTAATTTACCATCTTCTATTGATGTTAGTTTGGAGGCGGAG
CAAATGGCAGAAAGGATAGCAAAATTGCATCAGGAAGTCCGTGATCATATTGAGGCAACCAACTCCAAATATAAAGAAGCTGCAGATAGTAAAAGAAGACCAGTACACTT
CCAAATTGGAGATCTGGTCATGATTTATTTAAGGAAAGGAAGACTACATATGGGCAAGTATAGTAAGCTTAACAAAAAGAAGATTGGTCCCTTTCCAATCCAAGAAAAGT
GCGGTGACAACGCCTACGAGATTCAGCTGCCGAGTACAATATCAACCCAATCTTTAATGTAG
Protein sequenceShow/hide protein sequence
MQPSSLHDHRPIVEGAVENLQRNVVEIKQILSVLVDKIDNLQEHQDQPRKRNGKDIHGQGIVTGRGEQQAVPRMYQEPLPFQDDEYPEWEMNRRGNQRHNPRYREPNEYK
MKVDLPVFYGKLDIEGFLEWIKNVESFFGYMNTPDHKKVDLVALKLKGGTSAWWEQLLLRNDNAGGSRSIADYNEEFHRLDVAGFKKGTSTVDKTVNQGVASSSKTKGCD
DIKVGDQTKKLNNTYGRPMLGKCFRCGQVGHLSNECPQRKTINFVEDGDNLDHDVDINDEEDRLVLEPDEGDPNKSTGKPPFEVVYTKLPRLTIDITNLPSSIDVSLEAE
QMAERIAKLHQEVRDHIEATNSKYKEAADSKRRPVHFQIGDLVMIYLRKGRLHMGKYSKLNKKKIGPFPIQEKCGDNAYEIQLPSTISTQSLM