; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013578 (gene) of Snake gourd v1 genome

Gene IDTan0013578
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:59167944..59169212
RNA-Seq ExpressionTan0013578
SyntenyTan0013578
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-13665.48Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT
        MLKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESI+M +PEGFI QGQEQK                                 N+DEPCVYKKI   
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
         VAFLVLYVDDILLIGND G+LT VK WLA QFQMKDLGEAQ+VLGIQI+ +RKNKTLALSQA YIDK+L                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL
                                                        TAVK +LKYLRRTRDYMLVYGAKDLIL GYTDS FQTD DSRKSTSGSVFTL
Subjt:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL

Query:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI
        NGGA+VWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHK GKHIERKYHLIREIVQRGDVIV +I
Subjt:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI

Query:  ASEHNIVDPFTKPLTAKVFE
        ASEHNI DPFTK LTAKVFE
Subjt:  ASEHNIVDPFTKPLTAKVFE

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-13364.52Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT
        MLKSIRILLSIA FYDYEIWQMDVKT FLNGNLEESI+M +PEGFI QGQEQK                                 N+DEPCVYKKI   
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
         VAFLVLYVDDILLIGND G+LT VK WLA QFQMKDLGE Q+VLGIQI+ +RKNKTLALSQA YIDK+L                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL
                                                        TAVK ILKYLRRTRDYMLVYGAKDLIL GYT+S FQTD DSRKSTS SVFTL
Subjt:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL

Query:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI
        NGGA+VWRSIKQGCI DSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHK GKHIERKYHLIREIVQRGDVIV +I
Subjt:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI

Query:  ASEHNIVDPFTKPLTAKVFE
        ASEHNI DPFTK LTAKVFE
Subjt:  ASEHNIVDPFTKPLTAKVFE

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]5.5e-13262.86Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT
        MLKSIRILLSIA ++DYEIWQMDVKT FLNGNLEE+IYM +PEGFI+ GQEQK                                  +DEPCVYK+I+N 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
        +VAFLVLYVDDILLIGND G LT +KQWLATQFQMKDLGEAQFVLGIQI  +RKNK LALSQA YIDK++                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL
                                                        TAVK ILKYLRRTRDY LVYG+KDLIL GYTDS FQTD DSRKSTSGSVFTL
Subjt:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL

Query:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI
        NGGA+VWRSIKQGCI DSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHK GKHIERKYHLIREIV RGDVIV QI
Subjt:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI

Query:  ASEHNIVDPFTKPLTAKVFE
        AS HN+ DPFTKPLTAKVFE
Subjt:  ASEHNIVDPFTKPLTAKVFE

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-13665.48Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT
        MLKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESI+M +PEGFI QGQEQK                                 N+DEPCVYKKI   
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
         VAFLVLYVDDILLIGND G+LT VK WLA QFQMKDLGEAQ+VLGIQI+ +RKNKTLALSQA YIDK+L                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL
                                                        TAVK +LKYLRRTRDYMLVYGAKDLIL GYTDS FQTD DSRKSTSGSVFTL
Subjt:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL

Query:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI
        NGGA+VWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHK GKHIERKYHLIREIVQRGDVIV +I
Subjt:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI

Query:  ASEHNIVDPFTKPLTAKVFE
        ASEHNI DPFTK LTAKVFE
Subjt:  ASEHNIVDPFTKPLTAKVFE

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]5.5e-13262.86Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT
        MLKSIRILLSIA ++DYEIWQMDVKT FLNGNLEE+IYM +PEGFI+ GQEQK                                  +DEPCVYK+I+N 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
        +VAFLVLYVDDILLIGND G LT +KQWLATQFQMKDLGEAQFVLGIQI  +RKNK LALSQA YIDK++                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL
                                                        TAVK ILKYLRRTRDY LVYG+KDLIL GYTDS FQTD DSRKSTSGSVFTL
Subjt:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL

Query:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI
        NGGA+VWRSIKQGCI DSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHK GKHIERKYHLIREIV RGDVIV QI
Subjt:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI

Query:  ASEHNIVDPFTKPLTAKVFE
        AS HN+ DPFTKPLTAKVFE
Subjt:  ASEHNIVDPFTKPLTAKVFE

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.7e-13262.86Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT
        MLKSIRILLSIA ++DYEIWQMDVKT FLNGNLEE+IYM +PEGFI+ GQEQK                                  +DEPCVYK+I+N 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
        +VAFLVLYVDDILLIGND G LT +KQWLATQFQMKDLGEAQFVLGIQI  +RKNK LALSQA YIDK++                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL
                                                        TAVK ILKYLRRTRDY LVYG+KDLIL GYTDS FQTD DSRKSTSGSVFTL
Subjt:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL

Query:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI
        NGGA+VWRSIKQGCI DSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHK GKHIERKYHLIREIV RGDVIV QI
Subjt:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI

Query:  ASEHNIVDPFTKPLTAKVFE
        AS HN+ DPFTKPLTAKVFE
Subjt:  ASEHNIVDPFTKPLTAKVFE

A0A5A7T2V9 Gag/pol protein4.9e-13464.52Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT
        MLKSIRILLSIA FYDYEIWQMDVKT FLNGNLEESI+M +PEGFI QGQEQK                                 N+DEPCVYKKI   
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
         VAFLVLYVDDILLIGND G+LT VK WLA QFQMKDLGE Q+VLGIQI+ +RKNKTLALSQA YIDK+L                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL
                                                        TAVK ILKYLRRTRDYMLVYGAKDLIL GYT+S FQTD DSRKSTS SVFTL
Subjt:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL

Query:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI
        NGGA+VWRSIKQGCI DSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHK GKHIERKYHLIREIVQRGDVIV +I
Subjt:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI

Query:  ASEHNIVDPFTKPLTAKVFE
        ASEHNI DPFTK LTAKVFE
Subjt:  ASEHNIVDPFTKPLTAKVFE

A0A5A7TZD0 Gag/pol protein6.2e-13765.48Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT
        MLKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESI+M +PEGFI QGQEQK                                 N+DEPCVYKKI   
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
         VAFLVLYVDDILLIGND G+LT VK WLA QFQMKDLGEAQ+VLGIQI+ +RKNKTLALSQA YIDK+L                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL
                                                        TAVK +LKYLRRTRDYMLVYGAKDLIL GYTDS FQTD DSRKSTSGSVFTL
Subjt:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL

Query:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI
        NGGA+VWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHK GKHIERKYHLIREIVQRGDVIV +I
Subjt:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI

Query:  ASEHNIVDPFTKPLTAKVFE
        ASEHNI DPFTK LTAKVFE
Subjt:  ASEHNIVDPFTKPLTAKVFE

A0A5A7UYE8 Gag/pol protein6.2e-13765.48Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT
        MLKSIRILLSIATFYDYEIWQMDVKT FLNGNLEESI+M +PEGFI QGQEQK                                 N+DEPCVYKKI   
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
         VAFLVLYVDDILLIGND G+LT VK WLA QFQMKDLGEAQ+VLGIQI+ +RKNKTLALSQA YIDK+L                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL
                                                        TAVK +LKYLRRTRDYMLVYGAKDLIL GYTDS FQTD DSRKSTSGSVFTL
Subjt:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL

Query:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI
        NGGA+VWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHK GKHIERKYHLIREIVQRGDVIV +I
Subjt:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI

Query:  ASEHNIVDPFTKPLTAKVFE
        ASEHNI DPFTK LTAKVFE
Subjt:  ASEHNIVDPFTKPLTAKVFE

A0A5D3CPJ6 Gag/pol protein2.7e-13262.86Show/hide
Query:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT
        MLKSIRILLSIA ++DYEIWQMDVKT FLNGNLEE+IYM +PEGFI+ GQEQK                                  +DEPCVYK+I+N 
Subjt:  MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQK---------------------------------NIDEPCVYKKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
        +VAFLVLYVDDILLIGND G LT +KQWLATQFQMKDLGEAQFVLGIQI  +RKNK LALSQA YIDK++                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL
                                                        TAVK ILKYLRRTRDY LVYG+KDLIL GYTDS FQTD DSRKSTSGSVFTL
Subjt:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL

Query:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI
        NGGA+VWRSIKQGCI DSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHK GKHIERKYHLIREIV RGDVIV QI
Subjt:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI

Query:  ASEHNIVDPFTKPLTAKVFE
        AS HN+ DPFTKPLTAKVFE
Subjt:  ASEHNIVDPFTKPLTAKVFE

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-3528.61Show/hide
Query:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEG-------------------------FIVQGQEQKNID------EPCVY---KKIVNT
        + S R +LS+   Y+ ++ QMDVKT FLNG L+E IYM  P+G                         F V  Q  K  +      + C+Y   K  +N 
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEG-------------------------FIVQGQEQKNID------EPCVY---KKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
         + +++LYVDD+++   D   +   K++L  +F+M DL E +  +GI+I    +   + LSQ+ Y+ K+L                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  -----------------------TAV------------------KNILKYLRRTRDYMLVYG---AKDLILAGYTDSYFQTDVDSRKSTSGSVFTL-NGG
                               TAV                  K +L+YL+ T D  L++    A +  + GY DS +      RKST+G +F + +  
Subjt:  -----------------------TAV------------------KNILKYLRRTRDYMLVYG---AKDLILAGYTDSYFQTDVDSRKSTSGSVFTL-NGG

Query:  AIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQIASE
         I W + +Q  +  S+ EAEY+A  EA +EA+WL+  LT + +   +  PI +Y DN G ++ +  P  HK  KHI+ KYH  RE VQ   + +  I +E
Subjt:  AIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQIASE

Query:  HNIVDPFTKPLTAKVF
        + + D FTKPL A  F
Subjt:  HNIVDPFTKPLTAKVF

P0CV72 Secreted RxLR effector protein 1616.4e-1448.31Show/hide
Query:  AVKNILKYLRRTRDYMLVY-GAKDLILAGYTDSYFQTDVDSRKSTSGSVFTLNGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWL
        A+K +L+YL+ T+ Y L +  A    L GY+D+ +  DV+SR+STSG +F LNGG + WRS KQ  +  S+ E EY+A  EA +EAVWL
Subjt:  AVKNILKYLRRTRDYMLVY-GAKDLILAGYTDSYFQTDVDSRKSTSGSVFTLNGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.9e-5333.33Show/hide
Query:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQ---------------------------------KNIDEPCVY-KKIVNT
        + SIR +LS+A   D E+ Q+DVKT FL+G+LEE IYM++PEGF V G++                                  K   +PCVY K+    
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQ---------------------------------KNIDEPCVY-KKIVNT

Query:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------
            L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LG++IV  R ++ L LSQ  YI+++L                              
Subjt:  TVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML------------------------------

Query:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL
                                                         AVK IL+YLR T    L +G  D IL GYTD+    D+D+RKS++G +FT 
Subjt:  ------------------------------------------------TAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTL

Query:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI
        +GGAI W+S  Q C+  ST EAEY+AA E  KE +WL++FL +L +         +YCD+  A+  SK    H   KHI+ +YH IRE+V    + V +I
Subjt:  NGGAIVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQI

Query:  ASEHNIVDPFTKPLTAKVFE
        ++  N  D  TK +    FE
Subjt:  ASEHNIVDPFTKPLTAKVFE

P25600 Putative transposon Ty5-1 protein YCL074W1.1e-0527.15Show/hide
Query:  MDVKTTFLNGNLEESIYMDKPEGFIVQGQEQKNID--------------EPCVYKKIVNTTV-----------------------AFLVLYVDDILLIGN
        MDV T FLN  ++E IY+ +P GF+     ++N D               P ++ + +N T+                        ++ +YVDD+L+   
Subjt:  MDVKTTFLNGNLEESIYMDKPEGFIVQGQEQKNID--------------EPCVYKKIVNTTV-----------------------AFLVLYVDDILLIGN

Query:  DAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDK
               VKQ L   + MKDLG+    LG+ I     N  + LS   YI K
Subjt:  DAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-0928.32Show/hide
Query:  SIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFI--------------VQGQEQ-------------------KNIDEPCVYKKIVNTTVA
        SIRI+L +A    + I Q+DV   FL G L + +YM +P GFI              + G +Q                    ++ +  ++      ++ 
Subjt:  SIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFI--------------VQGQEQ-------------------KNIDEPCVYKKIVNTTVA

Query:  FLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKMLTAVKNI
        ++++YVDDIL+ GND   L      L+ +F +KD  E  + LGI+    R    L LSQ  YI  +L     I
Subjt:  FLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKMLTAVKNI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-2424.02Show/hide
Query:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQKNIDEPCVYK-------------------------------------KI
        L S++++L+I+  Y++ + Q+D+   FLNG+L+E IYM  P G+  +  +    +  C  K                                     KI
Subjt:  LKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQKNIDEPCVYK-------------------------------------KI

Query:  VNTTVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQI--------VWNRKN----------------------------------
          T    +++YVDDI++  N+   +  +K  L + F+++DLG  ++ LG++I        +  RK                                   
Subjt:  VNTTVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQI--------VWNRKN----------------------------------

Query:  ----------------KTLALSQAFYIDKM-----------LTAVKNILKYLRRTRDYMLVYGAK-DLILAGYTDSYFQTDVDSRKSTSGSVFTLNGGAI
                        +   L  +F ++K+             AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+  D+R+ST+G    L    I
Subjt:  ----------------KTLALSQAFYIDKM-----------LTAVKNILKYLRRTRDYMLVYGAK-DLILAGYTDSYFQTDVDSRKSTSGSVFTLNGGAI

Query:  VWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIRE
         W+S KQ  +  S+ EAEY A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+  KHIE   H +RE
Subjt:  VWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIRE

ATMG00810.1 DNA/RNA polymerases superfamily protein1.9e-0543.28Show/hide
Query:  FLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML
        +L+LYVDDILL G+    L  +   L++ F MKDLG   + LGIQI  +     L LSQ  Y +++L
Subjt:  FLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQFVLGIQIVWNRKNKTLALSQAFYIDKML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAAGTCCATTAGAATACTCTTGTCCATTGCCACGTTTTATGACTATGAAATTTGGCAAATGGATGTCAAGACAACCTTTCTTAATGGCAATCTTGAAGAGAGTAT
CTATATGGATAAACCAGAGGGGTTCATAGTTCAGGGTCAAGAGCAAAAGAATATTGACGAGCCTTGTGTTTACAAGAAGATAGTCAATACCACTGTAGCTTTCTTAGTTC
TATACGTAGACGATATCTTACTCATTGGGAATGATGCAGGATTCCTAACTGGCGTTAAGCAATGGCTAGCAACCCAATTCCAAATGAAAGATTTGGGAGAGGCTCAGTTT
GTTCTTGGAATCCAAATTGTTTGGAATCGCAAGAACAAAACGCTAGCGTTGTCTCAGGCATTTTATATTGACAAGATGTTGACTGCCGTTAAGAACATCCTCAAGTATCT
TAGGAGAACGAGGGACTATATGCTTGTGTATGGCGCAAAGGATTTGATCCTTGCAGGATACACTGACTCTTATTTTCAGACTGATGTAGATTCGAGGAAATCCACATCAG
GATCAGTGTTCACTCTTAATGGAGGAGCTATAGTATGGAGGAGTATAAAGCAAGGTTGTATTGTTGACTCTACCATGGAGGCAGAGTACGTCGCAGCTTGTGAAGCAGCG
AAGGAGGCAGTATGGCTTAGGAAGTTCTTGACTGATTTGGAAGTTGTTCCAAATATGAATTTGCCTATCACCCTTTATTGTGATAATAGTGGTGCAGTGGCAAATTCTAA
AGAACCTAGAAGCCATAAGTGCGGAAAGCACATAGAGCGCAAATACCATCTCATCAGGGAGATTGTGCAACGAGGAGATGTGATCGTCCCGCAGATCGCTTCGGAGCACA
ACATTGTTGATCCGTTTACAAAGCCCCTCACGGCTAAAGTGTTTGAGGACACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTAAGTCCATTAGAATACTCTTGTCCATTGCCACGTTTTATGACTATGAAATTTGGCAAATGGATGTCAAGACAACCTTTCTTAATGGCAATCTTGAAGAGAGTAT
CTATATGGATAAACCAGAGGGGTTCATAGTTCAGGGTCAAGAGCAAAAGAATATTGACGAGCCTTGTGTTTACAAGAAGATAGTCAATACCACTGTAGCTTTCTTAGTTC
TATACGTAGACGATATCTTACTCATTGGGAATGATGCAGGATTCCTAACTGGCGTTAAGCAATGGCTAGCAACCCAATTCCAAATGAAAGATTTGGGAGAGGCTCAGTTT
GTTCTTGGAATCCAAATTGTTTGGAATCGCAAGAACAAAACGCTAGCGTTGTCTCAGGCATTTTATATTGACAAGATGTTGACTGCCGTTAAGAACATCCTCAAGTATCT
TAGGAGAACGAGGGACTATATGCTTGTGTATGGCGCAAAGGATTTGATCCTTGCAGGATACACTGACTCTTATTTTCAGACTGATGTAGATTCGAGGAAATCCACATCAG
GATCAGTGTTCACTCTTAATGGAGGAGCTATAGTATGGAGGAGTATAAAGCAAGGTTGTATTGTTGACTCTACCATGGAGGCAGAGTACGTCGCAGCTTGTGAAGCAGCG
AAGGAGGCAGTATGGCTTAGGAAGTTCTTGACTGATTTGGAAGTTGTTCCAAATATGAATTTGCCTATCACCCTTTATTGTGATAATAGTGGTGCAGTGGCAAATTCTAA
AGAACCTAGAAGCCATAAGTGCGGAAAGCACATAGAGCGCAAATACCATCTCATCAGGGAGATTGTGCAACGAGGAGATGTGATCGTCCCGCAGATCGCTTCGGAGCACA
ACATTGTTGATCCGTTTACAAAGCCCCTCACGGCTAAAGTGTTTGAGGACACCTAG
Protein sequenceShow/hide protein sequence
MLKSIRILLSIATFYDYEIWQMDVKTTFLNGNLEESIYMDKPEGFIVQGQEQKNIDEPCVYKKIVNTTVAFLVLYVDDILLIGNDAGFLTGVKQWLATQFQMKDLGEAQF
VLGIQIVWNRKNKTLALSQAFYIDKMLTAVKNILKYLRRTRDYMLVYGAKDLILAGYTDSYFQTDVDSRKSTSGSVFTLNGGAIVWRSIKQGCIVDSTMEAEYVAACEAA
KEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKCGKHIERKYHLIREIVQRGDVIVPQIASEHNIVDPFTKPLTAKVFEDT