; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G5016 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G5016
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationctg1227:1989189..1993183
RNA-Seq ExpressionCucsat.G5016
SyntenyCucsat.G5016
Gene Ontology termsGO:0031347 - regulation of defense response (biological process)
GO:0006631 - fatty acid metabolic process (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0016122 - xanthophyll metabolic process (biological process)
GO:0009266 - response to temperature stimulus (biological process)
GO:0009644 - response to high light intensity (biological process)
GO:0009915 - phloem sucrose loading (biological process)
GO:0010189 - vitamin E biosynthetic process (biological process)
GO:0015994 - chlorophyll metabolic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0000325 - plant-type vacuole (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0010287 - plastoglobule (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0009976 - tocopherol cyclase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025724 - GAG-pre-integrase domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033068.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.36e-19870.25Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTED PKDAK+KKDWLRDDARLYLQIKNSIESEIIGLV          ++++               C++FFRAEQKAESVT+YFMRLKKI A L 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK
        LLLPFSPDVKVQQ QREKM V IFLNGLLPEFGM K QILSDSKIPSLDDAFTRVLRIESSP  VSIPQ SSAL SKNNNPRAP       QR S DHRK
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK

Query:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGAT
        P S +IVCNYC KPGH+KRDCRKLLYKNSQ+SQHAQIASTCDIPEASVTISADE+ KFQNYQ+ LQASSSSTPIASTVAPGN KCLLTSSTKWVIDS AT
Subjt:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGAT

Query:  AHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGL
         HMTGNS LFSRPLSPAPFPSVTL                                                           DRV KKIIGRGYESGGL
Subjt:  AHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGL

Query:  YFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSL
        Y FDHQVSQAVAC VVPSPFEVHCRLGHPSLFVLKKLYPEFR  SSL
Subjt:  YFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSL

KAG8636760.1 hypothetical protein MANES_15G035050v8 [Manihot esculenta]4.33e-18753.71Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        MA++K   V++VIP+ +KITEHKLNGSN+ DW +TI  YLRS  MDDH+T+DPP D + ++DW+RDDARL+LQI+NSI SE+I L+++CE VK+L+E+L 
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYS KE +  +++VC  F+R ++  +++TSYFM  K++  EL +L+PFS DVK QQ QRE+M VM FL GL  EF  AK+ IL DS+I SL D F RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIESSPTSVSIPQPSSALFSKNN----NPRAPQRNSTDHRK-----------PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASV
        R +S   S     P+SAL S+N+    N R  QR   +  K            +S  I+C YCR+PGH K+ C+KL  KN QR+Q A +A      +  +
Subjt:  RIESSPTSVSIPQPSSALFSKNN----NPRAPQRNSTDHRK-----------PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASV

Query:  TISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSS
         IS DE+A+F  YQ SL++S+SS+  A   +  +  CL++SS+KWVIDSGAT HM+GNS L S   S A    VTLADG+ S V+ SG ++LTPS S+SS
Subjt:  TISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSS

Query:  VLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLN
        VL LP  +FNL+S S+LT  LNC   FF  +C+FQD +TK+IIGRG ES GLY  D Q+ +++AC    +PF VHCRL HPSL  LKKLYP+F SLS L+
Subjt:  VLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLN

Query:  CDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
        C+SCQFAK H L S  RV+KRA +PFELVHSD+WGPCP+VS++GF+YFVTFVD
Subjt:  CDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD

RVW38649.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.96e-18453.45Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN V ++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D  +K  W++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGK  V RM++V   F   E+ A+S+T+YFM  KK+  EL  L+PFSPDV+VQQ QRE+MAVM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTIS
        R E    +VS  Q ++ L +K  N    +R        + ++R  +S  IVC YC + GH K++C+KL  +N +R Q A +A++      D     VT++
Subjt:  RIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTIS

Query:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLH
        A+EF+K+  YQ++L+AS   TP+++    G   CL++SS KW+IDSGAT HMTGN   FS   + +  P VT+ADGST  + GSGT+  T S +LSSVL+
Subjt:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLH

Query:  LPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDS
        LPNL+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY  D  V + VAC    SP E HCRLGHPSL  LKKL P+F +L SL+C+S
Subjt:  LPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDS

Query:  CQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
        C FAK HR S  PR++KRA + FELVHSD+WGPCPV SQTGFRYFVTFVD
Subjt:  CQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD

RVW84740.1 Retrovirus-related Pol polyprotein from transposon RE2 [Vitis vinifera]2.34e-18252.91Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN V ++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D  +K  W++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGK  V RM++V   F   E++A+S+T+YFM  KK+  EL  L+PFSPDV+VQQ QRE+MAVM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIESSPTSVSIPQPSSALFSKNNNPRAPQ-------RNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTIS
        R ++ P+S    Q ++ L +K  N    +         + ++R  +S  IVC YC K GH K++ RKL  +N +R Q A +A++      D  +  VT++
Subjt:  RIESSPTSVSIPQPSSALFSKNNNPRAPQ-------RNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTIS

Query:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLH
        A+EFAK+  YQ++L+AS   TP+++    G   CL++SS KW+IDSGAT HMTGN   FS   + +  P VT+AD ST  + GSG +  T S +LSSVL+
Subjt:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLH

Query:  LPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDS
        LPNL+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY  D  V + VAC    SP E HCRLGHPSL VLKKL P+F +L SL+C+S
Subjt:  LPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDS

Query:  CQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
        C FAK HR S  PR++KRA + FELVHSD+WG CPV S+TGFRYFVTFVD
Subjt:  CQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]0.099.81Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE
        RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE
Subjt:  RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE

Query:  SLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTS
        SLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTS
Subjt:  SLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTS

Query:  QLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSS
        QLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLY FDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSS
Subjt:  QLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSS

Query:  PRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
        PRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
Subjt:  PRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD

TrEMBL top hitse value%identityAlignment
A0A438DT29 Retrovirus-related Pol polyprotein from transposon TNT 1-949.48e-18553.45Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN V ++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D  +K  W++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGK  V RM++V   F   E+ A+S+T+YFM  KK+  EL  L+PFSPDV+VQQ QRE+MAVM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTIS
        R E    +VS  Q ++ L +K  N    +R        + ++R  +S  IVC YC + GH K++C+KL  +N +R Q A +A++      D     VT++
Subjt:  RIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTIS

Query:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLH
        A+EF+K+  YQ++L+AS   TP+++    G   CL++SS KW+IDSGAT HMTGN   FS   + +  P VT+ADGST  + GSGT+  T S +LSSVL+
Subjt:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLH

Query:  LPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDS
        LPNL+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY  D  V + VAC    SP E HCRLGHPSL  LKKL P+F +L SL+C+S
Subjt:  LPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDS

Query:  CQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
        C FAK HR S  PR++KRA + FELVHSD+WGPCPV SQTGFRYFVTFVD
Subjt:  CQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD

A0A438HJT7 Retrovirus-related Pol polyprotein from transposon RE21.13e-18252.91Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN V ++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D  +K  W++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGK  V RM++V   F   E++A+S+T+YFM  KK+  EL  L+PFSPDV+VQQ QRE+MAVM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIESSPTSVSIPQPSSALFSKNNNPRAPQ-------RNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTIS
        R ++ P+S    Q ++ L +K  N    +         + ++R  +S  IVC YC K GH K++ RKL  +N +R Q A +A++      D  +  VT++
Subjt:  RIESSPTSVSIPQPSSALFSKNNNPRAPQ-------RNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTIS

Query:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLH
        A+EFAK+  YQ++L+AS   TP+++    G   CL++SS KW+IDSGAT HMTGN   FS   + +  P VT+AD ST  + GSG +  T S +LSSVL+
Subjt:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLH

Query:  LPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDS
        LPNL+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY  D  V + VAC    SP E HCRLGHPSL VLKKL P+F +L SL+C+S
Subjt:  LPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDS

Query:  CQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
        C FAK HR S  PR++KRA + FELVHSD+WG CPV S+TGFRYFVTFVD
Subjt:  CQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD

A0A5A7SR90 Gag-pol polyprotein4.53e-19870.25Show/hide
Query:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG
        MDDHMTED PKDAK+KKDWLRDDARLYLQIKNSIESEIIGLV          ++++               C++FFRAEQKAESVT+YFMRLKKI A L 
Subjt:  MDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELG

Query:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK
        LLLPFSPDVKVQQ QREKM V IFLNGLLPEFGM K QILSDSKIPSLDDAFTRVLRIESSP  VSIPQ SSAL SKNNNPRAP       QR S DHRK
Subjt:  LLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFSKNNNPRAP-------QRNSTDHRK

Query:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGAT
        P S +IVCNYC KPGH+KRDCRKLLYKNSQ+SQHAQIASTCDIPEASVTISADE+ KFQNYQ+ LQASSSSTPIASTVAPGN KCLLTSSTKWVIDS AT
Subjt:  PESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGAT

Query:  AHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGL
         HMTGNS LFSRPLSPAPFPSVTL                                                           DRV KKIIGRGYESGGL
Subjt:  AHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGL

Query:  YFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSL
        Y FDHQVSQAVAC VVPSPFEVHCRLGHPSLFVLKKLYPEFR  SSL
Subjt:  YFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSL

A5BI89 Uncharacterized protein6.87e-17853.75Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN V ++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D  +K  W++DDA L+LQ+KNSI S+I+GL+ HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGK  V RM++V   F   E+ A+S+T+YFM  KK+  EL  L+PFSPDV+VQQ QRE+MAVM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADE
        R E    S  T+V + +  +A  ++  N R   R   +     S  IVC YC + GH K++CRKL  +N +R Q A +A++      D     VT++A+E
Subjt:  RIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADE

Query:  FAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPN
        F+K+  YQ++L+AS   TP+++    G   CL++SS KW+IDSGAT HMTGN   FS   + +  P VT+ADGST  + GSGT+  T S +LSSVL+LPN
Subjt:  FAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPN

Query:  LSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQF
        L+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY  D  V + VAC    SP E HCRLGHPSL VLKKL P+F +L SL+C+SC F
Subjt:  LSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQF

Query:  AKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
        AK HR S  PR++KRA + FELVHSD+WGPCPV SQTGFRYFVTFVD
Subjt:  AKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD

B0FBS2 Uncharacterized protein3.73e-17754.11Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN V ++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D  +K  W++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGK  V RM++V   F   E+ A+S+T+YFM  KK+  EL  L+PFSPDV+VQQ QRE+MAVM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADE
        R E    S  T+V I +  +A  ++  N R   R   +     S  IVC YC + GH K++CRKL  +N +R Q A +A++      D     VT++A+E
Subjt:  RIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADE

Query:  FAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPN
        F+K+  YQ++L+AS   TP+++    G   CL++SS KW+IDSGAT HMTGN   FS   + +  P VT+ADGST  + GSGT+  T S +LSSVL+LPN
Subjt:  FAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPN

Query:  LSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQF
        L+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY  D  V + VAC    SP E HCRLGHPSL VLKKL P+F +L SL+C+SC F
Subjt:  LSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQF

Query:  AKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
        AK HR S  PR++KRA + FELVHSD+WGPCPV SQTGFRYFVTFVD
Subjt:  AKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.1e-2124.12Show/hide
Query:  KLNGSN-YYDWRRTILFYLRSTDMDDHMTEDPPK-DAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFF
        K NG N +  W+R +   L    +   +  D  K D  + +DW   D R    I+  +  +++  +   ++ + +   L+ LY  K   ++++     + 
Subjt:  KLNGSN-YYDWRRTILFYLRSTDMDDHMTEDPPK-DAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFF

Query:  RAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL---RIESSPTS-----VSIP
            +  +  S+      +I +L  L      VK++    E+   ++ LN L   +    T IL       L D  + +L   ++   P +     ++  
Subjt:  RAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL---RIESSPTS-----VSIP

Query:  QPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIA
        +  S   S NN  R+  R  + +R    V   C  C +PGH KRDC      N ++ +        D   A++  + D    F N +E     S      
Subjt:  QPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIA

Query:  STVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFS----LSSVLHLPNLSFNLISTSQLTHDLNC
                       ++WV+D+ A+ H T    LF R ++   F +V + + S S + G G I +  +      L  V H+P+L  NLIS   L  D   
Subjt:  STVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFS----LSSVLHLPNLSFNLISTSQLTHDLNC

Query:  VVMFFSGYCLFQDRVTK--KIIGRGYESGGLYFFDHQVSQAV--ACPVVPSPFEVHCRLGHPS-----LFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSS
           + S +   + R+TK   +I +G   G LY  + ++ Q    A     S    H R+GH S     +   K L    +  +   CD C F K HR+S 
Subjt:  VVMFFSGYCLFQDRVTK--KIIGRGYESGGLYFFDHQVSQAV--ACPVVPSPFEVHCRLGHPS-----LFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSS

Query:  SPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
             +R +   +LV+SD+ GP  + S  G +YFVTF+D
Subjt:  SPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-2124.7Show/hide
Query:  ADIKNLVVSNVIPLASKITE-HKLNGSNYYDWRRTI--LF--YLRSTDMDDHMTEDP--------PKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDH
        A  + LV++N   L   ++   KL  +NY  W R +  LF  Y  +  +D   T  P        P+       W R D  +Y  +  +I   +   V  
Subjt:  ADIKNLVVSNVIPLASKITE-HKLNGSNYYDWRRTI--LF--YLRSTDMDDHMTEDP--------PKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDH

Query:  CESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLL-LPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDS
          +  ++ E L  +Y+     H + ++  Q  +  +  +++  Y   L     +L LL  P   D +V++V          L  L  E+     QI +  
Subjt:  CESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLL-LPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDS

Query:  KIPSLDDAFTRVLRIESSPTSVS----IPQPSSALFSK-----NNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTC
          P+L +   R+L  ES   +VS    IP  ++A+  +     NNN    + N  D+R         N   KP   ++        N+Q   +      C
Subjt:  KIPSLDDAFTRVLRIESSPTSVS----IPQPSSALFSK-----NNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTC

Query:  DIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKC-LLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHL
         +       SA   ++ Q++  S+ +    +P        N+      SS  W++DSGAT H+T + +  S          V +ADGST  +  +G+  L
Subjt:  DIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKC-LLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHL

Query:  TPS---FSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEV----HCRLGHPSLFV
        +      +L ++L++PN+  NLIS  +L +     V FF      +D  T   + +G     LY +    SQ V+    PS        H RLGHP+  +
Subjt:  TPS---FSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEV----HCRLGHPSLFV

Query:  LKKLYPEFRSLSSLN-------CDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
        L  +   + SLS LN       C  C   K +++  S +    +  P E ++SD+W   P++S   +RY+V FVD
Subjt:  LKKLYPEFRSLSSLN-------CDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1924.38Show/hide
Query:  LVVSNVIPL-ASKITEHKLNGSNYYDWRRTILFYLRSTDMD---DHMTEDPP----KDAKQKKD-----WLRDDARLYLQIKNSIESEIIGLVDHCESVK
        LV +N++ +  S +T  KL  +NY  W R +       ++    D  T  PP     DA  + +     W R D  +Y  I  +I   +   V    +  
Subjt:  LVVSNVIPL-ASKITEHKLNGSNYYDWRRTILFYLRSTDMD---DHMTEDPP----KDAKQKKD-----WLRDDARLYLQIKNSIESEIIGLVDHCESVK

Query:  ELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLL-LPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSL
        ++ E L  +Y+     H                  VT   +R      +L LL  P   D +V++V          L  L  ++     QI +    PSL
Subjt:  ELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLL-LPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSL

Query:  DDAFTRVLRIESSPTSVS----IPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTIS
         +   R++  ES   +++    +P  ++ +  +N N    Q N  D+R   +     N  +      R        N Q   +      C +       S
Subjt:  DDAFTRVLRIESSPTSVS----IPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTIS

Query:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLT-SSTKWVIDSGATAHMTG--NSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHL---TPSFS
        A    +   +Q +     S++P        N+      ++  W++DSGAT H+T   N+  F +P +      V +ADGST  +  +G+  L   + S  
Subjt:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLT-SSTKWVIDSGATAHMTG--NSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHL---TPSFS

Query:  LSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVP----SPFEVHCRLGHPSLFVLKKLYPEF
        L+ VL++PN+  NLIS  +L +     V FF      +D  T   + +G     LY +    SQAV+    P    +    H RLGHPSL +L  +    
Subjt:  LSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVP----SPFEVHCRLGHPSLFVLKKLYPEF

Query:  R------SLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD
               S   L+C  C   K H++  S      +  P E ++SD+W   P++S   +RY+V FVD
Subjt:  R------SLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD

Arabidopsis top hitse value%identityAlignment
AT5G53670.1 unknown protein6.2e-0426.12Show/hide
Query:  LNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLV-DHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRA
        L+GSN+ +W+  +L  L   D+D  +  + P   K+ K W R +    + +K  I     G+V D   + K+ L  L+  ++  E+  R          +
Subjt:  LNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLV-DHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRA

Query:  EQKAESVTSYFMRLKKIIAE---LGLLLPFSPDV
          + E+V    MR+K + A+   LG+   FS D+
Subjt:  EQKAESVTSYFMRLKKIIAE---LGLLLPFSPDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACATAAAAAATTTGGTAGTCTCCAACGTTATTCCCTTGGCCTCTAAGATCACAGAACATAAGTTAAATGGATCCAATTATTACGATTGGCGTCGGACAATTTT
ATTTTATTTGAGAAGTACTGATATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGA
TCAAGAATTCAATTGAGAGTGAGATAATTGGATTGGTTGATCACTGTGAGTCTGTTAAAGAACTTTTGGAATTTTTGGATTTTCTATACTCAGGTAAAGAGCAAGTGCAT
AGAATGTTTGAAGTTTGTATGCAATTTTTTCGTGCGGAACAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGATCATTGCCGAGCTTGGCTTGTTGTT
ACCTTTTAGTCCTGATGTTAAAGTTCAACAAGTTCAACGAGAGAAGATGGCTGTTATGATTTTTCTGAATGGACTCTTACCTGAATTTGGAATGGCAAAGACACAGATTC
TCTCTGACTCCAAGATTCCATCATTAGATGATGCCTTCACTCGAGTCCTTCGCATTGAAAGCTCTCCGACTAGTGTGTCTATTCCTCAACCCAGTAGTGCTCTCTTTAGC
AAGAACAATAACCCTCGGGCACCTCAGAGGAATAGTACTGATCATCGAAAACCAGAGTCTGTAGAGATTGTTTGTAACTACTGTCGTAAGCCAGGCCATATGAAACGTGA
TTGTCGGAAATTGCTATATAAGAATAGTCAACGATCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACTATTTCTGCAGATGAGTTTGCTA
AGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGTAATATAAAGTGTCTTCTTACATCATCTACCAAATGG
GTCATAGACTCTGGTGCCACAGCTCATATGACAGGTAATTCTCACCTATTTTCTAGACCGTTGTCCCCTGCCCCTTTCCCATCTGTTACATTGGCCGATGGCTCCACATC
TTCTGTTCTTGGCTCTGGCACTATTCACCTTACCCCATCCTTTTCTCTCTCTTCTGTGTTACATTTGCCTAACTTATCCTTTAATTTAATTTCTACTAGTCAACTTACTC
ATGACCTAAATTGTGTTGTCATGTTCTTTTCTGGTTATTGCTTGTTTCAGGATCGTGTGACGAAGAAGATTATTGGTAGAGGATATGAGTCAGGAGGCCTTTATTTCTTT
GATCATCAAGTATCGCAAGCTGTGGCGTGTCCTGTCGTTCCCTCTCCTTTTGAAGTCCATTGTCGTTTAGGTCATCCATCTTTGTTTGTGTTGAAGAAACTTTATCCAGA
ATTTAGGTCTTTGTCCTCTTTAAATTGTGATTCGTGTCAATTTGCGAAATTTCATCGTCTTAGTTCGAGTCCTCGAGTCGATAAACGAGCAATTGCTCCATTTGAGTTAG
TTCATTCTGATATTTGGGGTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACTTTTGTTGAC
mRNA sequenceShow/hide mRNA sequence
ATGGCCGACATAAAAAATTTGGTAGTCTCCAACGTTATTCCCTTGGCCTCTAAGATCACAGAACATAAGTTAAATGGATCCAATTATTACGATTGGCGTCGGACAATTTT
ATTTTATTTGAGAAGTACTGATATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGA
TCAAGAATTCAATTGAGAGTGAGATAATTGGATTGGTTGATCACTGTGAGTCTGTTAAAGAACTTTTGGAATTTTTGGATTTTCTATACTCAGGTAAAGAGCAAGTGCAT
AGAATGTTTGAAGTTTGTATGCAATTTTTTCGTGCGGAACAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGATCATTGCCGAGCTTGGCTTGTTGTT
ACCTTTTAGTCCTGATGTTAAAGTTCAACAAGTTCAACGAGAGAAGATGGCTGTTATGATTTTTCTGAATGGACTCTTACCTGAATTTGGAATGGCAAAGACACAGATTC
TCTCTGACTCCAAGATTCCATCATTAGATGATGCCTTCACTCGAGTCCTTCGCATTGAAAGCTCTCCGACTAGTGTGTCTATTCCTCAACCCAGTAGTGCTCTCTTTAGC
AAGAACAATAACCCTCGGGCACCTCAGAGGAATAGTACTGATCATCGAAAACCAGAGTCTGTAGAGATTGTTTGTAACTACTGTCGTAAGCCAGGCCATATGAAACGTGA
TTGTCGGAAATTGCTATATAAGAATAGTCAACGATCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACTATTTCTGCAGATGAGTTTGCTA
AGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGTAATATAAAGTGTCTTCTTACATCATCTACCAAATGG
GTCATAGACTCTGGTGCCACAGCTCATATGACAGGTAATTCTCACCTATTTTCTAGACCGTTGTCCCCTGCCCCTTTCCCATCTGTTACATTGGCCGATGGCTCCACATC
TTCTGTTCTTGGCTCTGGCACTATTCACCTTACCCCATCCTTTTCTCTCTCTTCTGTGTTACATTTGCCTAACTTATCCTTTAATTTAATTTCTACTAGTCAACTTACTC
ATGACCTAAATTGTGTTGTCATGTTCTTTTCTGGTTATTGCTTGTTTCAGGATCGTGTGACGAAGAAGATTATTGGTAGAGGATATGAGTCAGGAGGCCTTTATTTCTTT
GATCATCAAGTATCGCAAGCTGTGGCGTGTCCTGTCGTTCCCTCTCCTTTTGAAGTCCATTGTCGTTTAGGTCATCCATCTTTGTTTGTGTTGAAGAAACTTTATCCAGA
ATTTAGGTCTTTGTCCTCTTTAAATTGTGATTCGTGTCAATTTGCGAAATTTCATCGTCTTAGTTCGAGTCCTCGAGTCGATAAACGAGCAATTGCTCCATTTGAGTTAG
TTCATTCTGATATTTGGGGTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACTTTTGTTGAC
Protein sequenceShow/hide protein sequence
MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVH
RMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFS
KNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKW
VIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYFF
DHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVD