; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g06340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g06340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:4593845..4600574
RNA-Seq ExpressionMoc03g06340
SyntenyMoc03g06340
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81130.1 hypothetical protein VITISV_003944 [Vitis vinifera]1.2e-12356.9Show/hide
Query:  FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMD
        F GTDF YW+ QI DYL+ ++L L  L  KP+ M+  +W  LDR+VLG IRLTL+++V  +V KE TT  LM ALS +YEKPS NNKV+L TK FNLKM 
Subjt:  FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMD

Query:  EGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVD-RGRNNNRD
        E  SVA H+NEF+T+ N+L +V++ F DE+ A+++L SLPNSWE M+ A+SNS GKEKLK+ D+RD  L EEIRR+D+G  S SG+ALN++ RGR NNR+
Subjt:  EGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVD-RGRNNNRD

Query:  -HVKRGKSRN---NRSKSKN-SRLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDILENYVAGNHG
         +  R  SRN   NRSKS++  +++CWNCGKTGH +R CK+PKK + ++  AN V EE+ DAL+L V+   + WV+DSGASFHTT  R+I++NYVAG+ G
Subjt:  -HVKRGKSRN---NRSKSKN-SRLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDILENYVAGNHG

Query:  KVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDMIAVVDHSSQT
        KVYLADG  LD++G+GDV + + N SVW++ KVRH+ ++ +NLISVGQLD+EG  I F  G WKVTKG+ V+ARG+K GTLY+    +D IAV D S+ T
Subjt:  KVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDMIAVVDHSSQT

Query:  QIWQSRLGHMSEK
         +W  RLGHMSEK
Subjt:  QIWQSRLGHMSEK

CAN81130.1 hypothetical protein VITISV_003944 [Vitis vinifera]2.7e-0648.57Show/hide
Query:  LGHMSEKAEIGYRFWDDQNKKIIRSKNVIFNEKVLYKDRNKVDSRKTELDVSTENTQYISPPEVETKTTE
        +G+  EK   GYRFWD+QN+KIIRS+NVIFNE+V+YKDR+ V S  TE+D   + +++++  E+   T +
Subjt:  LGHMSEKAEIGYRFWDDQNKKIIRSKNVIFNEKVLYKDRNKVDSRKTELDVSTENTQYISPPEVETKTTE

CAN81130.1 hypothetical protein VITISV_003944 [Vitis vinifera]1.6e-12356.9Show/hide
Query:  FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMD
        F GTDF YW+ QI DYL+ ++L LS L  KP+ M+  KW  LDR+VLG IRLTL+++V  +V KE TT  LM ALS +YEKPS NNKV+L  K FNLKM 
Subjt:  FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMD

Query:  EGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVD-RGRNNNRD
        +  SVA H+NEF+T+ N+L +V++ F DE+ A+++L SLPNSWE M+ A+SNS GKEKLK+ D+RD  L EEIRR+D+G  S SG+ALN++ RGR NNR+
Subjt:  EGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVD-RGRNNNRD

Query:  -HVKRGKSRN---NRSKSKN-SRLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDILENYVAGNHG
         +  R  SRN   NRSKS++  +++CWNCGKTGH +R CK+PKK + ++  AN V EE+ DAL+L V+   + WV+DSGASFHTT  R+I++NYVAG+ G
Subjt:  -HVKRGKSRN---NRSKSKN-SRLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDILENYVAGNHG

Query:  KVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDMIAVVDHSSQT
        KVYLADG  LD++G+GDV + + N SVW++ KVRH+ ++ +NLISVGQLD+EG  I F  G WKVTKG+ V+ARG+K GTLY+    +D IAV D S+ T
Subjt:  KVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDMIAVVDHSSQT

Query:  QIWQSRLGHMSEK
         +W  RLGHMSEK
Subjt:  QIWQSRLGHMSEK

KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]1.7e-12555.99Show/hide
Query:  FYGTDFTYWKDQIVDYLHSKELELSLDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMDE
        F GTDF +W+ QI DYL+ K+L   L  KP+ M + +W  LDR+VLG IRLTL+KNV  +VAKE TT GLM  LS++YEKPS NNKV+L  K F+LKM+E
Subjt:  FYGTDFTYWKDQIVDYLHSKELELSLDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMDE

Query:  GTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVD-RGRNNNRDH
        G  VA H+NEF+T++N+L +V++ F DE+ A++L+ SLPNSWEPM+AA+SNS G +KLKF DVRD  LGEE+RR D+G  STS +A NV+ RGR+ NR +
Subjt:  GTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVD-RGRNNNRDH

Query:  VKRG--KSRNNRSKSKNSR-LECWNCGKTGHLRRNCKA-PKKAEGKEAGANVVAEEIHDALVLTVEG-----AHNT------------------WVVDSG
          RG  KSRN + +SK+ + +ECWNCGKTGH + NC A PKK   K  GAN V +EI DAL+++V+       H+T                  WV+DSG
Subjt:  VKRG--KSRNNRSKSKNSR-LECWNCGKTGHLRRNCKA-PKKAEGKEAGANVVAEEIHDALVLTVEG-----AHNT------------------WVVDSG

Query:  ASFHTTGQRDILENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLG
        ASFHTT  R+I+ENYV GN+GKVYLA+G PLDI+GIGD+NLK+++  VW I KVRHV  +M+NLISVGQLD+ G  ++FG G WKV KGSMV+ARG K G
Subjt:  ASFHTTGQRDILENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLG

Query:  TLYVNDNDKDMIAVVDHSSQTQIWQSRLGHMSEK
        +LY+  + ++ IA+V+++ QTQ+W  RLGHMSEK
Subjt:  TLYVNDNDKDMIAVVDHSSQTQIWQSRLGHMSEK

KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]5.2e-0539.6Show/hide
Query:  EIGYRFWDDQNKKIIRSKNVIFNEKVLYKDRNKVDSRKTE---------LDVSTENTQYISPPEVETKTTEIE-DQNTITLEETTVEFDEQVDELDKPVV
        E GYRFWDDQN+KIIRSKNV+FNE  LYKD+ K  S + E         L    ++T      E + +  E++ +  TI +  T V    +   + KPVV
Subjt:  EIGYRFWDDQNKKIIRSKNVIFNEKVLYKDRNKVDSRKTE---------LDVSTENTQYISPPEVETKTTEIE-DQNTITLEETTVEFDEQVDELDKPVV

Query:  K
        +
Subjt:  K

KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]2.2e-12556.45Show/hide
Query:  FYGTDFTYWKDQIVDYLHSKELELSLDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMDE
        F GTDF +W+ QI DYL+ K+L   L  KP+ M + +W  LDR+VLG IRLTL+KNV  +VAKE TT GLM  LS++YEKPS NNKV+L  K F+LKM+E
Subjt:  FYGTDFTYWKDQIVDYLHSKELELSLDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMDE

Query:  GTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVD-RGRNNNRDH
        G  VA H+NEF+T++N+L +V++ F DE+ A++LL SLPNSWEPM+AA+SNS G +KLKF DVRD  LGEE+RR D+G  S S +A NV+ RGR+ NR +
Subjt:  GTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVD-RGRNNNRDH

Query:  VKRG--KSRNNRSKSKNSR-LECWNCGKTGHLRRNCKA-PKKAEGKEAGANVVAEEIHDALVLTVEG-----AHNT------------------WVVDSG
          RG  KSRN + +SK+ + +ECWNCGKTGH + NC A PKK   K  GAN V +EI DAL++ V+       H+T                  WV+DSG
Subjt:  VKRG--KSRNNRSKSKNSR-LECWNCGKTGHLRRNCKA-PKKAEGKEAGANVVAEEIHDALVLTVEG-----AHNT------------------WVVDSG

Query:  ASFHTTGQRDILENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLG
        ASFHTT  R+I+ENYVAGN+GKVYLA+G PLDI+GIGD+NLK+++  VW I KVRHV  +M+NLISVGQLD+ G  ++FG G WKV KGSMV+ARG K G
Subjt:  ASFHTTGQRDILENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLG

Query:  TLYVNDNDKDMIAVVDHSSQTQIWQSRLGHMSEK
        +LY+  + ++ IAVV+++ QTQ+W  RLGHMSEK
Subjt:  TLYVNDNDKDMIAVVDHSSQTQIWQSRLGHMSEK

RVW23526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.8e-0543.43Show/hide
Query:  LGHMSEKAEIGYRFWDDQNKKIIRSKNVIFNEKVLYKDRNKVDSRKTELDVSTENTQYISPPEVETKTTEIEDQNTITLEETTVEFDEQVDELDKPVVK
        +G+  EK   GYRFWD+QN+KIIRS+NVIFNE+V+YKDR+ V S  TE+D   + +++++  E+   T +         EE     + QVD L  PVV+
Subjt:  LGHMSEKAEIGYRFWDDQNKKIIRSKNVIFNEKVLYKDRNKVDSRKTELDVSTENTQYISPPEVETKTTEIEDQNTITLEETTVEFDEQVDELDKPVVK

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]2.0e-15073.35Show/hide
Query:  IVDYLHSKELELSLDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMDEGTSVAAHINEFD
        ++DYLHSKELE  L+ KPDDM E +WKKLDRKVLGTIRLTLTKNVQSSVAK TTTMGLM+AL+N+YEK SVNNKVYLATKFFNLKM E T + AH+NEFD
Subjt:  IVDYLHSKELELSLDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMDEGTSVAAHINEFD

Query:  TLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVDRGRNNNRDHVKRGKSRNNRSKS
         LINKLVAVDL F  E+ AILLLRSLP+SWEPM+AAISNS  KEKLKF DVRDAAL EEIRRKDSGIA TSG+ LNVDRGRNNNR +  RGKS+NNRS+S
Subjt:  TLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVDRGRNNNRDHVKRGKSRNNRSKS

Query:  KNSRLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDILENYVAGNHGKVYLADGKPLDIIGIGDVN
        +NSR ECWNCGK GHL+ NCKAPKK EG EA AN VAE+IHDALV+ VE AH+TWV+DS                  GNHGKVYLADG+PLDIIGIG+VN
Subjt:  KNSRLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDILENYVAGNHGKVYLADGKPLDIIGIGDVN

Query:  LKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQIWQSRLGHMSEK
        LK+AN SVW IRK                LDNEGCEISFGQGNWKVTKG+MVIARG K GTLYVN+NDKDM+AVVDHSS TQ+W + LGHMSEK
Subjt:  LKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQIWQSRLGHMSEK

TrEMBL top hitse value%identityAlignment
A0A2N9G6Q3 Uncharacterized protein8.6e-0663.64Show/hide
Query:  GYRFWDDQNKKIIRSKNVIFNEKVLYKDRNKVDSRKTELDVSTE
        GYRFWDDQN+K+IRS+NVIFNE+V+YKDR  V  R+   D   E
Subjt:  GYRFWDDQNKKIIRSKNVIFNEKVLYKDRNKVDSRKTELDVSTE

A0A2N9GHK9 Uncharacterized protein5.6e-0549.32Show/hide
Query:  GYRFWDDQNKKIIRSKNVIFNEKVLYKDRN-------KVDSRKTE---LDVSTENT-QYISPPEVETKTTEIE
        GYRFWDDQN+K+IRS+NVIFNE+V+YKDR+       KV+ +K+E   LD  + NT Q     E E    ++E
Subjt:  GYRFWDDQNKKIIRSKNVIFNEKVLYKDRN-------KVDSRKTE---LDVSTENT-QYISPPEVETKTTEIE

A0A2N9GHK9 Uncharacterized protein2.0e-12757.21Show/hide
Query:  MTGEDKLVI----FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNK
        MTGE+  V     F GTDF YW+ QI DYL+ K+L L  L +KP+DME+A+W  LDR+VLG IRLTL++ V  +V KE TT  LM+AL  +YEKPS NNK
Subjt:  MTGEDKLVI----FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNK

Query:  VYLATKFFNLKMDEGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTA
        V+L  K FNLKM EGT+VA H+NEF+T+ N+L +V++ F DE+ A+++L SLPNSWE M+ A+SNS GK KLK+ D+RD  LGEE+RR+D+G  S+SG+A
Subjt:  VYLATKFFNLKMDEGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTA

Query:  LNVD-RGRNNNRDHVK-RGKSRNNRSKSKNSR-LECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDI
        LN++ RGR  +R++ + R KSR  RSKSK  R LECWNCGKTGH+R+NC   KK + +   ANVV EE+HDAL+L+V+    +WV+DSGASFHTT  R+I
Subjt:  LNVD-RGRNNNRDHVK-RGKSRNNRSKSKNSR-LECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDI

Query:  LENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDM
        ++NYVAG+ GKVYLAD + LD++G+GDV + + N SVW+++KVRHV  + +NLISVGQLD EG  I F  G WK+TKG+MV+ARG+K GTLY+  + +D 
Subjt:  LENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDM

Query:  IAVVDHSSQTQIWQSRLGHMSEK
        IAV +  + T +W  RLGHMSEK
Subjt:  IAVVDHSSQTQIWQSRLGHMSEK

A0A2N9IKI1 Uncharacterized protein2.0e-12757.21Show/hide
Query:  MTGEDKLVI----FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNK
        MTGE+  V     F GTDF YW+ QI DYL+ K+L L  L +KP+DME+A+W  LDR+VLG IRLTL++ V  +V KE TT  LM+AL  +YEKPS NNK
Subjt:  MTGEDKLVI----FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNK

Query:  VYLATKFFNLKMDEGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTA
        V+L  K FNLKM EGT+VA H+NEF+T+ N+L +V++ F DE+ A+++L SLPNSWE M+ A+SNS GK KLK+ D+RD  LGEE+RR+D+G  S+SG+A
Subjt:  VYLATKFFNLKMDEGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTA

Query:  LNVD-RGRNNNRDHVK-RGKSRNNRSKSKNSR-LECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDI
        LN++ RGR  +R++ + R KSR  RSKSK  R LECWNCGKTGH+R+NC   KK + +   ANVV EE+HDAL+L+V+    +WV+DSGASFHTT  R+I
Subjt:  LNVD-RGRNNNRDHVK-RGKSRNNRSKSKNSR-LECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDI

Query:  LENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDM
        ++NYVAG+ GKVYLAD + LD++G+GDV + + N SVW+++KVRHV  + +NLISVGQLD EG  I F  G WK+TKG+MV+ARG+K GTLY+  + +D 
Subjt:  LENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDM

Query:  IAVVDHSSQTQIWQSRLGHMSEK
        IAV +  + T +W  RLGHMSEK
Subjt:  IAVVDHSSQTQIWQSRLGHMSEK

A0A2N9IKI1 Uncharacterized protein5.6e-0549.32Show/hide
Query:  GYRFWDDQNKKIIRSKNVIFNEKVLYKDRN-------KVDSRKTE---LDVSTENT-QYISPPEVETKTTEIE
        GYRFWDDQN+K+IRS+NVIFNE+V+YKDR+       KV+ +K+E   LD  + NT Q     E E    ++E
Subjt:  GYRFWDDQNKKIIRSKNVIFNEKVLYKDRN-------KVDSRKTE---LDVSTENT-QYISPPEVETKTTEIE

A0A2N9IKI1 Uncharacterized protein2.0e-12757.21Show/hide
Query:  MTGEDKLVI----FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNK
        MTGE+  V     F GTDF YW+ QI DYL+ K+L L  L +KP+DME+A+W  LDR+VLG IRLTL++ V  +V KE TT  LM+AL  +YEKPS NNK
Subjt:  MTGEDKLVI----FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNK

Query:  VYLATKFFNLKMDEGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTA
        V+L  K FNLKM EGT+VA H+NEF+T+ N+L +V++ F DE+ A+++L SLPNSWE M+ A+SNS GK KLK+ D+RD  LGEE+RR+D+G  S+SG+A
Subjt:  VYLATKFFNLKMDEGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTA

Query:  LNVD-RGRNNNRDHVK-RGKSRNNRSKSKNSR-LECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDI
        LN++ RGR  +R++ + R KSR  RSKSK  R LECWNCGKTGH+R+NC   KK + +   ANVV EE+HDAL+L+V+    +WV+DSGASFHTT  R+I
Subjt:  LNVD-RGRNNNRDHVK-RGKSRNNRSKSKNSR-LECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDI

Query:  LENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDM
        ++NYVAG+ GKVYLAD + LD++G+GDV + + N SVW+++KVRHV  + +NLISVGQLD EG  I F  G WK+TKG+MV+ARG+K GTLY+  + +D 
Subjt:  LENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDM

Query:  IAVVDHSSQTQIWQSRLGHMSEK
        IAV +  + T +W  RLGHMSEK
Subjt:  IAVVDHSSQTQIWQSRLGHMSEK

A0A2N9J3Y8 Uncharacterized protein5.6e-0549.32Show/hide
Query:  GYRFWDDQNKKIIRSKNVIFNEKVLYKDRN-------KVDSRKTE---LDVSTENT-QYISPPEVETKTTEIE
        GYRFWDDQN+K+IRS+NVIFNE+V+YKDR+       KV+ +K+E   LD  + NT Q     E E    ++E
Subjt:  GYRFWDDQNKKIIRSKNVIFNEKVLYKDRN-------KVDSRKTE---LDVSTENT-QYISPPEVETKTTEIE

A0A2N9J3Y8 Uncharacterized protein2.0e-12757.21Show/hide
Query:  MTGEDKLVI----FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNK
        MTGE+  V     F GTDF YW+ QI DYL+ K+L L  L +KP+DME+A+W  LDR+VLG IRLTL++ V  +V KE TT  LM+AL  +YEKPS NNK
Subjt:  MTGEDKLVI----FYGTDFTYWKDQIVDYLHSKELELS-LDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNK

Query:  VYLATKFFNLKMDEGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTA
        V+L  K FNLKM EGT+VA H+NEF+T+ N+L +V++ F DE+ A+++L SLPNSWE M+ A+SNS GK KLK+ D+RD  LGEE+RR+D+G  S+SG+A
Subjt:  VYLATKFFNLKMDEGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTA

Query:  LNVD-RGRNNNRDHVK-RGKSRNNRSKSKNSR-LECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDI
        LN++ RGR  +R++ + R KSR  RSKSK  R LECWNCGKTGH+R+NC   KK + +   ANVV EE+HDAL+L+V+    +WV+DSGASFHTT  R+I
Subjt:  LNVD-RGRNNNRDHVK-RGKSRNNRSKSKNSR-LECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDI

Query:  LENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDM
        ++NYVAG+ GKVYLAD + LD++G+GDV + + N SVW+++KVRHV  + +NLISVGQLD EG  I F  G WK+TKG+MV+ARG+K GTLY+  + +D 
Subjt:  LENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDM

Query:  IAVVDHSSQTQIWQSRLGHMSEK
        IAV +  + T +W  RLGHMSEK
Subjt:  IAVVDHSSQTQIWQSRLGHMSEK

A0A6J1DF43 uncharacterized protein LOC1110204699.7e-15173.35Show/hide
Query:  IVDYLHSKELELSLDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMDEGTSVAAHINEFD
        ++DYLHSKELE  L+ KPDDM E +WKKLDRKVLGTIRLTLTKNVQSSVAK TTTMGLM+AL+N+YEK SVNNKVYLATKFFNLKM E T + AH+NEFD
Subjt:  IVDYLHSKELELSLDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMDEGTSVAAHINEFD

Query:  TLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVDRGRNNNRDHVKRGKSRNNRSKS
         LINKLVAVDL F  E+ AILLLRSLP+SWEPM+AAISNS  KEKLKF DVRDAAL EEIRRKDSGIA TSG+ LNVDRGRNNNR +  RGKS+NNRS+S
Subjt:  TLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVDRGRNNNRDHVKRGKSRNNRSKS

Query:  KNSRLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDILENYVAGNHGKVYLADGKPLDIIGIGDVN
        +NSR ECWNCGK GHL+ NCKAPKK EG EA AN VAE+IHDALV+ VE AH+TWV+DS                  GNHGKVYLADG+PLDIIGIG+VN
Subjt:  KNSRLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDILENYVAGNHGKVYLADGKPLDIIGIGDVN

Query:  LKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQIWQSRLGHMSEK
        LK+AN SVW IRK                LDNEGCEISFGQGNWKVTKG+MVIARG K GTLYVN+NDKDM+AVVDHSS TQ+W + LGHMSEK
Subjt:  LKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQIWQSRLGHMSEK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-5534.35Show/hide
Query:  FTYWKDQIVDYLHSKELELSLD---KKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMDEGT
        F+ W+ ++ D L  + L   LD   KKPD M+   W  LD +    IRL L+ +V +++  E T  G+ + L ++Y   ++ NK+YL  + + L M EGT
Subjt:  FTYWKDQIVDYLHSKELELSLD---KKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFNLKMDEGT

Query:  SVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDA-ALGEEIRRKDSGIAS---TSGTALNVDRGRNNNRD
        +  +H+N F+ LI +L  + +   +E  AILLL SLP+S++ +   I +  GK  ++  DV  A  L E++R+K         T G   +  R  NN   
Subjt:  SVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDA-ALGEEIRRKDSGIAS---TSGTALNVDRGRNNNRD

Query:  HVKRGKSRNNRSKSKNSRLECWNCGKTGHLRRNCKAPKKAEGKEAGA--------------NVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDIL
           RGKS+N   +SK+    C+NC + GH +R+C  P+K +G+ +G               NVV     +   + + G  + WVVD+ AS H T  RD+ 
Subjt:  HVKRGKSRNNRSKSKNSRLECWNCGKTGHLRRNCKAPKKAEGKEAGA--------------NVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDIL

Query:  ENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDND---K
          YVAG+ G V + +     I GIGD+ +K       +++ VRHV ++  NLIS   LD +G E  F    W++TKGS+VIA+G   GTLY  + +    
Subjt:  ENYVAGNHGKVYLADGKPLDIIGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDND---K

Query:  DMIAVVDHSSQTQIWQSRLGHMSEK
        ++ A  D  S   +W  R+GHMSEK
Subjt:  DMIAVVDHSSQTQIWQSRLGHMSEK

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein8.5e-1447.06Show/hide
Query:  GTDFTYWKDQIVDYLHSKELELSLDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKV
        GT +++ + +I DYL+ K+L   L KK + M +  W  L R+VL  IRLT++KN+  +VAKE +  GLM  LS+IY+KPS NN V
Subjt:  GTDFTYWKDQIVDYLHSKELELSLDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACTGGAGAAGATAAATTGGTTATTTTTTATGGAACTGATTTTACATACTGGAAGGATCAGATAGTAGATTATCTACATTCAAAGGAATTGGAATTGTCA
TTAGACAAGAAGCCAGATGACATGGAAGAAGCCAAATGGAAAAAGTTGGACAGGAAGGTGTTGGGTACGATTCGCCTAACATTAACAAAAAATGTGCAGAGCAGC
GTAGCTAAGGAGACTACCACAATGGGGTTGATGAGTGCATTGTCCAACATATATGAGAAGCCCTCAGTAAATAATAAGGTGTATCTCGCAACTAAATTTTTTAAT
TTGAAGATGGATGAAGGTACATCTGTAGCTGCCCATATAAATGAATTTGATACGTTGATTAACAAACTGGTTGCTGTGGATTTAACATTTATGGATGAATTAAAT
GCTATCTTGTTGTTGAGATCTTTACCTAACAGTTGGGAGCCTATGAAGGCAGCTATTTCAAATTCTTGGGGAAAAGAGAAATTGAAATTTGCAGATGTCAGAGAT
GCAGCTCTTGGAGAGGAGATTCGCAGAAAGGATTCTGGTATTGCGTCTACTTCTGGTACAGCATTGAATGTGGACAGAGGAAGAAATAATAACAGAGACCACGTA
AAACGTGGAAAGTCAAGAAACAACAGAAGCAAGTCCAAAAACAGCAGACTAGAATGTTGGAATTGTGGTAAGACAGGACATCTGAGGAGGAACTGCAAAGCTCCA
AAGAAAGCTGAGGGTAAAGAAGCTGGTGCAAATGTTGTTGCTGAAGAAATACATGATGCTCTAGTTCTTACAGTTGAGGGCGCTCATAACACATGGGTGGTGGAT
TCAGGTGCGTCTTTTCATACAACAGGACAACGTGACATTCTTGAGAATTATGTTGCAGGAAATCATGGAAAGGTGTATCTTGCTGATGGAAAGCCTTTGGACATC
ATTGGGATTGGTGACGTTAATTTAAAAGTGGCGAACGATTCAGTCTGGATGATTCGCAAGGTACGTCACGTTCAGAATATGATGAAGAACCTGATTTCCGTGGGG
CAGCTGGATAATGAAGGATGTGAAATATCCTTCGGTCAAGGAAACTGGAAAGTTACAAAGGGTTCGATGGTGATCGCTCGAGGAAGAAAGTTAGGAACTTTGTAT
GTCAACGACAACGACAAAGATATGATAGCTGTTGTAGATCATTCAAGTCAGACCCAAATATGGCAAAGTAGGCTGGGACATATGAGTGAAAAAGCTGAGATAGGT
TACAGATTTTGGGATGACCAAAACAAGAAAATTATCAGAAGCAAGAACGTGATCTTCAATGAGAAAGTCTTATACAAAGACAGAAATAAAGTTGATTCAAGAAAG
ACAGAGTTAGATGTAAGCACAGAGAATACTCAGTATATAAGTCCTCCTGAGGTTGAGACTAAAACAACAGAGATTGAAGATCAGAATACAATTACTCTTGAAGAA
ACAACTGTGGAATTTGATGAACAAGTTGATGAACTTGATAAACCAGTTGTGAAAACTGATCAGGTCCTTCCTCACCAAGTGACACTCAAGCTCGACTTAATACAC
GATCTTGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGATGACTGGAGAAGATAAATTGGTTATTTTTTATGGAACTGATTTTACATACTGGAAGGATCAGATAGTAGATTATCTACATTCAAAGGAATTGGAATTGTCA
TTAGACAAGAAGCCAGATGACATGGAAGAAGCCAAATGGAAAAAGTTGGACAGGAAGGTGTTGGGTACGATTCGCCTAACATTAACAAAAAATGTGCAGAGCAGC
GTAGCTAAGGAGACTACCACAATGGGGTTGATGAGTGCATTGTCCAACATATATGAGAAGCCCTCAGTAAATAATAAGGTGTATCTCGCAACTAAATTTTTTAAT
TTGAAGATGGATGAAGGTACATCTGTAGCTGCCCATATAAATGAATTTGATACGTTGATTAACAAACTGGTTGCTGTGGATTTAACATTTATGGATGAATTAAAT
GCTATCTTGTTGTTGAGATCTTTACCTAACAGTTGGGAGCCTATGAAGGCAGCTATTTCAAATTCTTGGGGAAAAGAGAAATTGAAATTTGCAGATGTCAGAGAT
GCAGCTCTTGGAGAGGAGATTCGCAGAAAGGATTCTGGTATTGCGTCTACTTCTGGTACAGCATTGAATGTGGACAGAGGAAGAAATAATAACAGAGACCACGTA
AAACGTGGAAAGTCAAGAAACAACAGAAGCAAGTCCAAAAACAGCAGACTAGAATGTTGGAATTGTGGTAAGACAGGACATCTGAGGAGGAACTGCAAAGCTCCA
AAGAAAGCTGAGGGTAAAGAAGCTGGTGCAAATGTTGTTGCTGAAGAAATACATGATGCTCTAGTTCTTACAGTTGAGGGCGCTCATAACACATGGGTGGTGGAT
TCAGGTGCGTCTTTTCATACAACAGGACAACGTGACATTCTTGAGAATTATGTTGCAGGAAATCATGGAAAGGTGTATCTTGCTGATGGAAAGCCTTTGGACATC
ATTGGGATTGGTGACGTTAATTTAAAAGTGGCGAACGATTCAGTCTGGATGATTCGCAAGGTACGTCACGTTCAGAATATGATGAAGAACCTGATTTCCGTGGGG
CAGCTGGATAATGAAGGATGTGAAATATCCTTCGGTCAAGGAAACTGGAAAGTTACAAAGGGTTCGATGGTGATCGCTCGAGGAAGAAAGTTAGGAACTTTGTAT
GTCAACGACAACGACAAAGATATGATAGCTGTTGTAGATCATTCAAGTCAGACCCAAATATGGCAAAGTAGGCTGGGACATATGAGTGAAAAAGCTGAGATAGGT
TACAGATTTTGGGATGACCAAAACAAGAAAATTATCAGAAGCAAGAACGTGATCTTCAATGAGAAAGTCTTATACAAAGACAGAAATAAAGTTGATTCAAGAAAG
ACAGAGTTAGATGTAAGCACAGAGAATACTCAGTATATAAGTCCTCCTGAGGTTGAGACTAAAACAACAGAGATTGAAGATCAGAATACAATTACTCTTGAAGAA
ACAACTGTGGAATTTGATGAACAAGTTGATGAACTTGATAAACCAGTTGTGAAAACTGATCAGGTCCTTCCTCACCAAGTGACACTCAAGCTCGACTTAATACAC
GATCTTGGATAG
Protein sequenceShow/hide protein sequence
MMTGEDKLVIFYGTDFTYWKDQIVDYLHSKELELSLDKKPDDMEEAKWKKLDRKVLGTIRLTLTKNVQSSVAKETTTMGLMSALSNIYEKPSVNNKVYLATKFFN
LKMDEGTSVAAHINEFDTLINKLVAVDLTFMDELNAILLLRSLPNSWEPMKAAISNSWGKEKLKFADVRDAALGEEIRRKDSGIASTSGTALNVDRGRNNNRDHV
KRGKSRNNRSKSKNSRLECWNCGKTGHLRRNCKAPKKAEGKEAGANVVAEEIHDALVLTVEGAHNTWVVDSGASFHTTGQRDILENYVAGNHGKVYLADGKPLDI
IGIGDVNLKVANDSVWMIRKVRHVQNMMKNLISVGQLDNEGCEISFGQGNWKVTKGSMVIARGRKLGTLYVNDNDKDMIAVVDHSSQTQIWQSRLGHMSEKAEIG
YRFWDDQNKKIIRSKNVIFNEKVLYKDRNKVDSRKTELDVSTENTQYISPPEVETKTTEIEDQNTITLEETTVEFDEQVDELDKPVVKTDQVLPHQVTLKLDLIH
DLG