; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0008379 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0008379
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag-pol polyprotein
Genome locationchr06:22379172..22386033
RNA-Seq ExpressionIVF0008379
SyntenyIVF0008379
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035157.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.52e-15737.56Show/hide
Query:  MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS
        MDESIALLT+QF+  +R L N N  G+N QTSNQYRR++    TRR NE SNRR+  Y KKKEG+ + FRCRECGGVGHYQ ECP  +R QKK++   L 
Subjt:  MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS

Query:  DEDTDDTKDDS-SMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMK------------------------
        DE++ D++DD  ++NAF    T    +DDSECS E  +++L+ + L+ L KED EA+AIQKE IQDL+E+NERLM                         
Subjt:  DEDTDDTKDDS-SMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMK------------------------

Query:  ---AETE--------------------ATTAGSPHA----------------------------KSPRRICYYCRQKAAVRI------------------
           + TE                     T+A S  A                            KS  R CYYC +K  +R+                  
Subjt:  ---AETE--------------------ATTAGSPHA----------------------------KSPRRICYYCRQKAAVRI------------------

Query:  ------------------------------------------------------------------------NDKDDETLNMHVDTSMLLVEVPKVDTLP
                                                                                ND++DET NM    +   VE  K D   
Subjt:  ------------------------------------------------------------------------NDKDDETLNMHVDTSMLLVEVPKVDTLP

Query:  NGTDINSETISK---EVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTID-VALKDKYWINAMQEKLLQFKHNN
        N +D   +++ K   E+I   S+L+PSAHV+KNHP SS+I DPS G+ T++K+K+DY KM+ DLCY S IE ST+D  ALKD+YW+NAMQE+LLQF+ NN
Subjt:  NGTDINSETISK---EVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTID-VALKDKYWINAMQEKLLQFKHNN

Query:  VWTLV----------------------------------------EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQP
        VW LV                                        EGVDFDETFAPVARL+AIRLLL IS ++KFKLYQ+DVKSDFLNGY+N+EVYVAQP
Subjt:  VWTLV----------------------------------------EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQP

Query:  NEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNN------------------
          FV+FE+ +HVYKLNKALYGLKQAP AWY+RLT+YLR +GYS+GE DKTLFI   +DQL+V QIYVDD+IF  FPQ L  N                  
Subjt:  NEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNN------------------

Query:  ---------------FIDVMKSEFEMSM-------ADPRTSHLEVVKRILKYVH--------------------GTSDFRILYSYDTTSILVGYCDVDGQ
                       ++   + +   ++       A+ R +HLE VKRILKYV                        +  + Y +D  +I++ YCD   +
Subjt:  ---------------FIDVMKSEFEMSM-------ADPRTSHLEVVKRILKYVH--------------------GTSDFRILYSYDTTSILVGYCDVDGQ

Query:  VLLMIGKAS--------------LEVVSVLGTILSH-GSGLYMVKSTKDELAAQ----ISSPSVQ--KVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPK
        + +                     E+V      L H  S L +       L A     +S+  VQ  +VRG RFK TPP+R YRL S+K Q +  ++L +
Subjt:  VLLMIGKAS--------------LEVVSVLGTILSH-GSGLYMVKSTKDELAAQ----ISSPSVQ--KVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPK

Query:  QIIDFSYPPTLGTYAP--------NISETPLSDMDLDDLDDVPLAQLLKKTNVLEVTVEMPASPSMSVHSQESSSTE
         +     P    + AP          SE  +SDMD DD D+V L +LLKK +       +P+ P  S+H QESSSTE
Subjt:  QIIDFSYPPTLGTYAP--------NISETPLSDMDLDDLDDVPLAQLLKKTNVLEVTVEMPASPSMSVHSQESSSTE

KAA0035323.1 putative mitochondrial protein [Cucumis melo var. makuwa]2.80e-16150.49Show/hide
Query:  MHVDTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTIDVALKDKYW
        M  D+S L VEVPKVD   +GTDINS+ ISKEVI DNSELVPSAH+R NHPPSS+I DPS GIIT+K+EKV+YSKMI DLCY S IE ST+DVALKD+YW
Subjt:  MHVDTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTIDVALKDKYW

Query:  INAMQEKLLQFKHNNVWTLV----------------------------------------EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSD
        INAMQ++L QF  NNVWTLV                                        +GVDFDETFAPVARL+AIRLLL IS +KKFKLYQ+DVKS 
Subjt:  INAMQEKLLQFKHNNVWTLV----------------------------------------EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSD

Query:  FLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLR-CKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFI
        FLNGY+N+EVYVAQP  F++FEY QHVYKLNKALY LKQAP+A    +   +R C  Y                                          
Subjt:  FLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLR-CKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFI

Query:  DVMKSEFEMSMADPRTSHLEVVKRILKYVHGTSDFRILYSYDTTSILVGYCDVDGQVLLMIGKASLEVVSVLGTILSHGSGLYMVKSTKDELAAQISSPS
                   AD RT+HLE  KRILKYVHGTSDF +LYSYDTTSILVGYCDVD        K++   + V   +++   G YMVKST+DE  AQISSPS
Subjt:  DVMKSEFEMSMADPRTSHLEVVKRILKYVHGTSDFRILYSYDTTSILVGYCDVDGQVLLMIGKASLEVVSVLGTILSHGSGLYMVKSTKDELAAQISSPS

Query:  VQKVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPKQIIDFSYPPTLGTYAPNISETPLSDMDLDDLDDVPLAQLLKKTNVLEVTVEMPASPSMSVHSQES
        VQKV+G+RFKSTPP R Y LPS+KSQ                                                                       +ES
Subjt:  VQKVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPKQIIDFSYPPTLGTYAPNISETPLSDMDLDDLDDVPLAQLLKKTNVLEVTVEMPASPSMSVHSQES

Query:  SSTEGVFVPTSGIRHTSNFQPGQLVHSSPSASLPFEPDVAHASVLGNISVVPEGRTDVRSDENEVDPPNLDIRFEEVLTNFDDNLTVPLGSPKIPVAPKL
        SSTEGVFVPT GI HTSN QP    HS  SASLP EP+V HASV  + S  PE R D RSDENEV+PPN DIR +EVL N  DN T P GS +IP AP+ 
Subjt:  SSTEGVFVPTSGIRHTSNFQPGQLVHSSPSASLPFEPDVAHASVLGNISVVPEGRTDVRSDENEVDPPNLDIRFEEVLTNFDDNLTVPLGSPKIPVAPKL

Query:  AKRKSQQN
        AK+KSQQN
Subjt:  AKRKSQQN

KAA0065844.1 putative mitochondrial protein [Cucumis melo var. makuwa]8.40e-231100Show/hide
Query:  MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS
        MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS
Subjt:  MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS

Query:  DEDTDDTKDDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKAETEATTAGSPHAKSPRRICYYCRQ
        DEDTDDTKDDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKAETEATTAGSPHAKSPRRICYYCRQ
Subjt:  DEDTDDTKDDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKAETEATTAGSPHAKSPRRICYYCRQ

Query:  KAAVRINDKDDETLNMHVDTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSA
        KAAVRINDKDDETLNMHVDTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSA
Subjt:  KAAVRINDKDDETLNMHVDTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSA

Query:  IESSTIDVALKDKYWINAMQEKLLQFKHNNVWTLV
        IESSTIDVALKDKYWINAMQEKLLQFKHNNVWTLV
Subjt:  IESSTIDVALKDKYWINAMQEKLLQFKHNNVWTLV

TYJ98295.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.99e-15743.99Show/hide
Query:  DESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLSD
        DESIALLTKQFSK  R+ K++NTAG   +T     R DGEN TR+ N+ S RRN D+GKKKE  GR FRCREC    HYQAECPT LR QKK Y ATLSD
Subjt:  DESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLSD

Query:  EDTDDTKDDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKA-------------ETEATTAGSPHA
        ED+ D +DD  MN FTA  T I+ E DSECSD   DE+LT ++LKMLRKED+EA+ IQKE IQDLM++NERLM+              +  ++  G    
Subjt:  EDTDDTKDDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKA-------------ETEATTAGSPHA

Query:  KSPRRICYYCRQKAAVRINDK-----DDETLNMHVDTSML----------------------LVEVPKV-------------------------------
         S R +      +  +RI        D+E LN    T  +                      L E+ +V                               
Subjt:  KSPRRICYYCRQKAAVRINDK-----DDETLNMHVDTSML----------------------LVEVPKV-------------------------------

Query:  DTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTIDVALKDKYWINAMQEKLLQFKHNN
        D  P+    NS  I+ EVI + + LVPSAHV+KNHP SS+I + S GI T+ KEK                          D+YWIN MQE+LLQFKHNN
Subjt:  DTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTIDVALKDKYWINAMQEKLLQFKHNN

Query:  VWTLV----------------------------------------EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQP
        VWTLV                                        EGVDFDETFAPVARL+AIRLLLRIS  +KFKL+Q+DVKS FLNGY+N+EVYVAQP
Subjt:  VWTLV----------------------------------------EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQP

Query:  NEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSM-----
          F++ E+ Q+VYK+NKALYGLKQAPRAWYERL IYL  +GYSKGE DKTLFI+RT+  LIVAQIYVDD+IF  FP+ LVNNFI+++KSEFE+S+     
Subjt:  NEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSM-----

Query:  -----------------------------------------------------------------------------------------ADPRTSHLEVV
                                                                                                 +DPRTSHL  V
Subjt:  -----------------------------------------------------------------------------------------ADPRTSHLEVV

Query:  KRILKYVHGTSDFRILYSYDTTSILVGYCDVD
        KRI+KYVHGT+DF ILYSYDT+S LVGYCD D
Subjt:  KRILKYVHGTSDFRILYSYDTTSILVGYCDVD

TYJ98791.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.75e-17138.85Show/hide
Query:  LNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLSDEDTDDTKDDSSMNAFTACFTAIDFE
        +NAQTSNQYRR++ ++ T+R NE S+RR+  Y KKKEG+ R FRCRECGGV HYQAECPT LR QK ++  TLSDE  D   DDS++NAFT   T  +  
Subjt:  LNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLSDEDTDDTKDDSSMNAFTACFTAIDFE

Query:  DDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKA-----------------------------ETEATTAGSPHAKSPR-----
        D+SECS+E   ++LT + L+ L KED EA+ IQKE IQDL+E+NE L+                               E   +   S H  S R     
Subjt:  DDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKA-----------------------------ETEATTAGSPHAKSPR-----

Query:  -----------------------------------------RICYYCRQKA-------------------------------------------------
                                                 R+CYYC +KA                                                 
Subjt:  -----------------------------------------RICYYCRQKA-------------------------------------------------

Query:  ---------------------------------------------------------------------------AVR-INDKDDETLNMHVDTSMLLVE
                                                                                   A++ IND++DE  NM    +   V+
Subjt:  ---------------------------------------------------------------------------AVR-INDKDDETLNMHVDTSMLLVE

Query:  VPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTIDVALKDKYWINAMQEKLLQF
        V K D   + +    E   +E I   SEL+  AHV+KNHP SS+I DPS G+ T++KEK+DY KM+ DLCY S  E ST+D +L+D+Y +NAMQE+LLQF
Subjt:  VPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTIDVALKDKYWINAMQEKLLQF

Query:  KHNNVWTLV----------------------------------------EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVY
        K NNVWTLV                                        EG+DFDETF+PVARL+AIRLLL IS ++KFKLYQ+DVKS FLNGY+N+EVY
Subjt:  KHNNVWTLV----------------------------------------EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVY

Query:  VAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSM-
        VAQP +FV+ E+ +HVYKLNKALYGLKQAPRAWY+RLT+YLR KGYS+GE DKTLFIHR +DQL+VAQIYVDD+IF  FPQ LVNNFI VM+SEFEMSM 
Subjt:  VAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSM-

Query:  ------------------------------------ADPRTSHLEVVKRILKYVHGTSDFRILYSYDTTSILVGYCDVDGQVLLMIGKAS-LEVVSVLGT
                                            ADPR ++LE VKRILKY+HGT+DF ++Y YDTT  LVGYCD D       G     ++  +L  
Subjt:  ------------------------------------ADPRTSHLEVVKRILKYVHGTSDFRILYSYDTTSILVGYCDVDGQVLLMIGKAS-LEVVSVLGT

Query:  ILSHGSGLYMVKSTKDELAAQISSPSVQ--KVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPKQIIDFSYPPTLGTYAPNISETPLSDMDLDDLDDVPLA
        +++   G Y+            S P +Q  +VRG+RFKSTPP+R YRLPS+K Q + SN                                 D DDV L 
Subjt:  ILSHGSGLYMVKSTKDELAAQISSPSVQ--KVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPKQIIDFSYPPTLGTYAPNISETPLSDMDLDDLDDVPLA

Query:  QLLKKTNVLEVTVEMPASPSMSVHSQESSSTE-GVFVPTSGI
         LLKKT+   ++ ++P+ P  S+HSQESSST+ G  +P   I
Subjt:  QLLKKTNVLEVTVEMPASPSMSVHSQESSSTE-GVFVPTSGI

TrEMBL top hitse value%identityAlignment
A0A5A7T0Q0 Gag-pol polyprotein9.1e-13737.27Show/hide
Query:  MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS
        MDESIALLT+QF+  +R L N N  G+N QTSNQYRR++    TRR NE SNRR+  Y KKKEG+ + FRCRECGGVGHYQ ECP  +R QKK++   L 
Subjt:  MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS

Query:  DEDTDDTK-DDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLM-------------------------
        DE++ D++ DD ++NAF    T    +DDSECS E  +++L+ + L+ L KED EA+AIQKE IQDL+E+NERLM                         
Subjt:  DEDTDDTK-DDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLM-------------------------

Query:  --KAETE--------------------ATTAGSPHA----------------------------KSPRRICYYCRQKAAVRI------------------
           + TE                     T+A S  A                            KS  R CYYC +K  +R+                  
Subjt:  --KAETE--------------------ATTAGSPHA----------------------------KSPRRICYYCRQKAAVRI------------------

Query:  ------------------------------------------------------------------------NDKDDETLNMHVDTSMLLVEVPKVDTLP
                                                                                ND++DET NM    +   VE  K D   
Subjt:  ------------------------------------------------------------------------NDKDDETLNMHVDTSMLLVEVPKVDTLP

Query:  NGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTID-VALKDKYWINAMQEKLLQFKHNNVW-
        +    + +   +E+I   S+L+PSAHV+KNHP SS+I DPS G+ T++K+K+DY KM+ DLCY S IE ST+D  ALKD+YW+NAMQE+LLQF+ NNVW 
Subjt:  NGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTID-VALKDKYWINAMQEKLLQFKHNNVW-

Query:  ---------------------------------------TLVEGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEF
                                               T VEGVDFDETFAPVARL+AIRLLL IS ++KFKLYQ+DVKSDFLNGY+N+EVYVAQP  F
Subjt:  ---------------------------------------TLVEGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEF

Query:  VNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNN---------------------
        V+FE+ +HVYKLNKALYGLKQAP AWY+RLT+YLR +GYS+GE DKTLFI   +DQL+V QIYVDD+IF  FPQ L  N                     
Subjt:  VNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNN---------------------

Query:  ------------FIDVMKSEFEMSM-------ADPRTSHLEVVKRILKYVH--------------------GTSDFRILYSYDTTSILVGYCDVDGQVLL
                    ++   + +   ++       A+ R +HLE VKRILKYV                        +  + Y +D  +I++ YCD   ++ +
Subjt:  ------------FIDVMKSEFEMSM-------ADPRTSHLEVVKRILKYVH--------------------GTSDFRILYSYDTTSILVGYCDVDGQVLL

Query:  MIGKAS--------------LEVVSVLGTILSH-GSGLYMVKSTKDELAA----QISSPSVQ--KVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPK---
                             E+V      L H  S L +       L A     +S+  VQ  +VRG RFK TPP+R YRL S+K Q +  ++L +   
Subjt:  MIGKAS--------------LEVVSVLGTILSH-GSGLYMVKSTKDELAA----QISSPSVQ--KVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPK---

Query:  -----QIIDFSYPPTLGTYAPNISETPLSDMDLDDLDDVPLAQLLKKTNVLEVTVEMPASPSMSVHSQESSSTE
             ++ +   P +   +A   SE  +SDMD DD D+V L +LLKK +       +P+ P  S+H QESSSTE
Subjt:  -----QIIDFSYPPTLGTYAPNISETPLSDMDLDDLDDVPLAQLLKKTNVLEVTVEMPASPSMSVHSQESSSTE

A0A5A7VEW7 Putative mitochondrial protein1.8e-185100Show/hide
Query:  MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS
        MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS
Subjt:  MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS

Query:  DEDTDDTKDDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKAETEATTAGSPHAKSPRRICYYCRQ
        DEDTDDTKDDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKAETEATTAGSPHAKSPRRICYYCRQ
Subjt:  DEDTDDTKDDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKAETEATTAGSPHAKSPRRICYYCRQ

Query:  KAAVRINDKDDETLNMHVDTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSA
        KAAVRINDKDDETLNMHVDTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSA
Subjt:  KAAVRINDKDDETLNMHVDTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSA

Query:  IESSTIDVALKDKYWINAMQEKLLQFKHNNVWTLV
        IESSTIDVALKDKYWINAMQEKLLQFKHNNVWTLV
Subjt:  IESSTIDVALKDKYWINAMQEKLLQFKHNNVWTLV

A0A5D3BJA9 Gag-pol polyprotein3.7e-14638.63Show/hide
Query:  LKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLSDEDTDDTKDDSSMNAFTA
        +  ++   +NAQTSNQYRR++ ++ T+R NE S+RR+  Y KKKEG+ R FRCRECGGV HYQAECPT LR QK ++  TLSDE  D   DDS++NAFT 
Subjt:  LKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLSDEDTDDTKDDSSMNAFTA

Query:  CFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKA-----------------------------ETEATTAGSPHAK
          T  +  D+SECS+E   ++LT + L+ L KED EA+ IQKE IQDL+E+NE L+                               E   +   S H  
Subjt:  CFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKA-----------------------------ETEATTAGSPHAK

Query:  SPR----------------------------------------------RICYYC---------------------------------------------
        S R                                              R+CYYC                                             
Subjt:  SPR----------------------------------------------RICYYC---------------------------------------------

Query:  -------------------------------------RQK-------------------------------------------AAVRINDKDDETLNMHV
                                             RQK                                           A  +IND++DE  NM  
Subjt:  -------------------------------------RQK-------------------------------------------AAVRINDKDDETLNMHV

Query:  DTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTIDVALKDKYWINA
          +   V+V K D   + +    E   +E I   SEL+  AHV+KNHP SS+I DPS G+ T++KEK+DY KM+ DLCY S  E ST+D +L+D+Y +NA
Subjt:  DTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTIDVALKDKYWINA

Query:  MQEKLLQFKHNNVWTL----------------------------------------VEGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLN
        MQE+LLQFK NNVWTL                                        VEG+DFDETF+PVARL+AIRLLL IS ++KFKLYQ+DVKS FLN
Subjt:  MQEKLLQFKHNNVWTL----------------------------------------VEGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLN

Query:  GYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMK
        GY+N+EVYVAQP +FV+ E+ +HVYKLNKALYGLKQAPRAWY+RLT+YLR KGYS+GE DKTLFIHR +DQL+VAQIYVDD+IF  FPQ LVNNFI VM+
Subjt:  GYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMK

Query:  SEFEMSM-------------------------------------ADPRTSHLEVVKRILKYVHGTSDFRILYSYDTTSILVGYCDVDGQVLLMIGKA-SL
        SEFEMSM                                     ADPR ++LE VKRILKY+HGT+DF ++Y YDTT  LVGYCD D       G     
Subjt:  SEFEMSM-------------------------------------ADPRTSHLEVVKRILKYVHGTSDFRILYSYDTTSILVGYCDVDGQVLLMIGKA-SL

Query:  EVVSVLGTILSHGSGLYMVKSTKDELAAQISSPSVQ--KVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPKQIIDFSYPPTLGTYAPNISETPLSDMDLD
        ++  +L  +++   G Y+            S P +Q  +VRG+RFKSTPP+R YRLPS+K Q + SN                                 
Subjt:  EVVSVLGTILSHGSGLYMVKSTKDELAAQISSPSVQ--KVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPKQIIDFSYPPTLGTYAPNISETPLSDMDLD

Query:  DLDDVPLAQLLKKTNVLEVTVEMPASPSMSVHSQESSST-EGVFVPTSGI
        D DDV L  LLKKT+   ++ ++P+ P  S+HSQESSST +G  +P   I
Subjt:  DLDDVPLAQLLKKTNVLEVTVEMPASPSMSVHSQESSST-EGVFVPTSGI

A0A5D3BPJ2 Gag-pol polyprotein2.2e-13538.9Show/hide
Query:  MNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLSDEDTDDTKDDSSMNAFTACFT
        MNTA    +T     + DG N+TR+ ++ S+RRN D+ KKKE  GR FRCREC G GHYQAECPT LR QKK+Y+ATL+DED+DD + D   NAFTAC T
Subjt:  MNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLSDEDTDDTKDDSSMNAFTACFT

Query:  AIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERL----------------------------------------MKAETE--
         I+ EDD+ECSD   DE+LT + LKMLRKED++A+AIQKE IQ L+E+NER                                         +K  TE  
Subjt:  AIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERL----------------------------------------MKAETE--

Query:  -ATTAGSPHAKSPRRI--------------------------------------------------------------CYYCRQKA-AVRINDKDDETLN
         AT   +  AKS R +                                                              CY    +    + N K D+ + 
Subjt:  -ATTAGSPHAKSPRRI--------------------------------------------------------------CYYCRQKA-AVRINDKDDETLN

Query:  M---------HVDTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVP--SAHVRKNHP--PSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIE
        +          V    L + +  ++ + N  D  S      +  D + ++P  ++ + K  P   S +  DPS GI T++KEKVDY+KMIVDLCY S IE
Subjt:  M---------HVDTSMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVP--SAHVRKNHP--PSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIE

Query:  SSTIDVALKDKYWINAMQEKLLQFKHNNVWTLV------EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNF
         ++I+ ALKD+YWINAMQE+LLQFK NN+WTLV      EG++FDETFAPVARL+AIRLLL IS + KFKLYQ+DVKS F+NGY+N+EV+V QP E VN 
Subjt:  SSTIDVALKDKYWINAMQEKLLQFKHNNVWTLV------EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNF

Query:  EYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSMADPRTSHLEVV
        E+ +HVYKLNKALYGLKQAPRAWYERLTIYL  KGYS+G  DKTLFI+RT+ +LIVAQIYVDD+IF  FP+ LV+NFI++MKSEFEMSM    +  L   
Subjt:  EYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSMADPRTSHLEVV

Query:  KRILKYVHGTSDFRILYSYDTTSILVGYCDVDGQVLLMIGKASLEVVSVLGTILSHGSGLYMVKSTKDELAAQI-------SSPSVQKVRGQR--FKSTP
           L+    + D  I      T                             T+  HG    ++ +++ ++A  +       S P    +   +   K   
Subjt:  KRILKYVHGTSDFRILYSYDTTSILVGYCDVDGQVLLMIGKASLEVVSVLGTILSHGSGLYMVKSTKDELAAQI-------SSPSVQKVRGQR--FKSTP

Query:  PQRLYRLPSKKSQA----KVSNKLPKQII--------DFSYPPTLGTYAPNISETPLSDMDLDDLDDVPLAQLLKKTNVLEVTVEMPASPSMSVHSQESS
         +  +RL   + Q+    + S+KL + ++        + S P +   +AP +  T +SDMD DD DDVPLA+L K+T + +V+ ++P  P  S+HSQE+ 
Subjt:  PQRLYRLPSKKSQA----KVSNKLPKQII--------DFSYPPTLGTYAPNISETPLSDMDLDDLDDVPLAQLLKKTNVLEVTVEMPASPSMSVHSQESS

Query:  STEGVFVPTSGIRHTSNFQPGQLVHSSPSASLPFEPDVAHASVL----GNISVVPEGRTDVRSDENEVDPPNLDIRFEEVLTNFDDNLTVPLGSPKIPVA
                                   P+   PF    AH S+     G+I   P  + + R +E+EV+P  L+I  EEV  N DDN  +   S  IP  
Subjt:  STEGVFVPTSGIRHTSNFQPGQLVHSSPSASLPFEPDVAHASVL----GNISVVPEGRTDVRSDENEVDPPNLDIRFEEVLTNFDDNLTVPLGSPKIPVA

Query:  PKLAKRKSQQ
         K A+RK QQ
Subjt:  PKLAKRKSQQ

A0A5D3DZD2 Gag-pol polyprotein2.9e-13538Show/hide
Query:  MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS
        MDESIALLT+QF+  +R L N N  G+N QTSNQYRR++    TRR NE SNRR+ DY KKKEG+ + FRCRECGGVGHYQ ECP  LR QKK++  TL 
Subjt:  MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLS

Query:  DEDTDDTK-DDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLM-------------------------
        DE++ +++ DD ++NAF    T    +DDSECS E  +++L+ + L+ L KED EA+AIQKE IQDL+E+NERLM                         
Subjt:  DEDTDDTK-DDSSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLM-------------------------

Query:  --KAETE--------------------ATTAGSPHA----------------------------KSPRRICYYCRQKAAVRI------------------
           + TE                     T+A S  A                            KS  R CYYC +K  +R+                  
Subjt:  --KAETE--------------------ATTAGSPHA----------------------------KSPRRICYYCRQKAAVRI------------------

Query:  ------------------------------------------------------------------------NDKDDETLNMHVDTSMLLVEVPKVDTLP
                                                                                ND++DET NM    +   VE  K D   
Subjt:  ------------------------------------------------------------------------NDKDDETLNMHVDTSMLLVEVPKVDTLP

Query:  NGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTID-VALKDKYWINAMQEKLLQFKHNNVW-
        +  + + +  S+E+I   S+L+ SAHV+KNHP SS+I DPS G+ T++K+K+DY KM+ DLCY S+IE ST+D  A KD+YW+NAMQE+LL+ + NNVW 
Subjt:  NGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTID-VALKDKYWINAMQEKLLQFKHNNVW-

Query:  ---------------------------------------TLVEGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEF
                                               T VEGVDFDETFAPVARL+AIRLLL IS ++KFKLYQ+DVKSDFLNGY+N+EVYVAQP  F
Subjt:  ---------------------------------------TLVEGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEF

Query:  VNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDL-VNNFIDVMKSEFEMSM-------
        V+FE+ +HVYKLNKALYGLKQAP AWY+RLT+YLR +GYS+GE DKTLFI   +DQL+VAQIYVDD+IF  FPQ L  N+ I + + ++  +M       
Subjt:  VNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDL-VNNFIDVMKSEFEMSM-------

Query:  --------------------ADPRTSHLEVVKRILKYVHGTSDFRILYSYDTTSILVGYCDVDGQVLLMIGKASLEVVSVLGTILSHGSGLYMVKSTKDE
                            A+ R +HLE VKRILKY HGTSDF ++                                                     
Subjt:  --------------------ADPRTSHLEVVKRILKYVHGTSDFRILYSYDTTSILVGYCDVDGQVLLMIGKASLEVVSVLGTILSHGSGLYMVKSTKDE

Query:  LAAQISSPSVQKVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPKQIIDFSYPPTLGTYAP--------NISETPLSDMDLDDLDDVPLAQLLKKTNVLEV
                    VRG RFK T  +R YRL S+K Q +  ++L + +     P    + AP          SE  +SDMD DD D+V L +LLKK +    
Subjt:  LAAQISSPSVQKVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPKQIIDFSYPPTLGTYAP--------NISETPLSDMDLDDLDDVPLAQLLKKTNVLEV

Query:  TVEMPASPSMSVHSQESSSTE
           +P+ P  S+  QESSST+
Subjt:  TVEMPASPSMSVHSQESSSTE

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.1e-1735.33Show/hide
Query:  VDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEA
        +D++ETFAPVAR+ + R +L +      K++Q+DVK+ FLNG + +E+Y+  P          +V KLNKA+YGLKQA R W+E     L+   +     
Subjt:  VDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEA

Query:  DKTLFI---HRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMS
        D+ ++I      N+ + V  +YVDDV+        +NNF   +  +F M+
Subjt:  DKTLFI---HRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMS

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein9.7e-1130.08Show/hide
Query:  IRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIV
        I++ L I+  +   +  +D+   FL   + +E+Y+  P++       + V KLNKALYGLKQ+P+ W + L  YL   G    +   T  +++T D+ ++
Subjt:  IRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIV

Query:  AQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSM
          +YVDD +     +  ++ FI+ +KS FE+ +
Subjt:  AQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.1e-2033.77Show/hide
Query:  EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKG
        +G+DFDE F+PV ++ +IR +L ++     ++ Q+DVK+ FL+G + +E+Y+ QP  F        V KLNK+LYGLKQAPR WY +   +++ + Y K 
Subjt:  EGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKG

Query:  EADKTLFIHR-TNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSMADP
         +D  ++  R + +  I+  +YVDD++     + L+      +   F+M    P
Subjt:  EADKTLFIHR-TNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSMADP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.5e-2330.04Show/hide
Query:  ESSTIDVALKDKYWINAMQEKLLQFKHNNVWTLV-----------------------------------------EGVDFDETFAPVARLQAIRLLLRIS
        E  T   ALKD+ W NAM  ++     N+ W LV                                          G+D+ ETF+PV +  +IR++L ++
Subjt:  ESSTIDVALKDKYWINAMQEKLLQFKHNNVWTLV-----------------------------------------EGVDFDETFAPVARLQAIRLLLRIS

Query:  FLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDV
          + + + Q+DV + FL G +  +VY++QP  F++ +   +V KL KALYGLKQAPRAWY  L  YL   G+    +D +LF+ +    ++   +YVDD+
Subjt:  FLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDV

Query:  IFEDFPQDLVNNFIDVMKSEFEM
        +       L++N +D +   F +
Subjt:  IFEDFPQDLVNNFIDVMKSEFEM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.5e-2236.05Show/hide
Query:  GVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGE
        G+D+ ETF+PV +  +IR++L ++  + + + Q+DV + FL G +  EVY++QP  FV+ +   +V +L KA+YGLKQAPRAWY  L  YL   G+    
Subjt:  GVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGE

Query:  ADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEM
        +D +LF+ +    +I   +YVDD++       L+ + +D +   F +
Subjt:  ADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.4e-1932.92Show/hide
Query:  WTLVEGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYH----QHVYKLNKALYGLKQAPRAWYERLTIYL
        +T  EG+DF ETF+PV +L +++L+L IS +  F L+Q+D+ + FLNG +++E+Y+  P  +   +        V  L K++YGLKQA R W+ + ++ L
Subjt:  WTLVEGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYH----QHVYKLNKALYGLKQAPRAWYERLTIYL

Query:  RCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSMADP
           G+ +  +D T F+  T    +   +YVDD+I        V+     +KS F++    P
Subjt:  RCKGYSKGEADKTLFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSMADP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAGTCTATAGCTCTACTGACCAAGCAGTTCTCTAAAGGGGTCAGAAGATTAAAAAATATGAATACTGCAGGATTGAATGCTCAAACTTCAAATCAATATCGAAG
AAAGGATGGTGAGAACGCTACTAGAAGGTATAATGAAGCCTCAAATAGGAGGAATGGTGATTATGGAAAGAAAAAGGAGGGAGAAGGAAGGTTTTTCAGATGTAGAGAAT
GTGGGGGCGTTGGGCATTACCAGGCTGAATGTCCTACAATTTTAAGAGGACAAAAGAAAGATTATCATGCTACCTTGTCAGATGAGGACACTGATGATACTAAAGATGAC
AGTAGCATGAATGCCTTCACAGCATGTTTTACCGCAATTGATTTCGAAGACGATAGTGAATGTTCTGATGAAGTTGGTGATGAAGATCTAACTTTTAAAGATCTCAAGAT
GTTGAGGAAAGAAGACACTGAAGCCAAAGCAATTCAAAAAGAAATAATTCAAGATCTAATGGAAAAAAATGAACGATTAATGAAAGCTGAAACTGAAGCAACTACTGCCG
GTAGTCCTCATGCTAAATCTCCCAGAAGGATTTGTTATTATTGTAGACAAAAAGCTGCCGTACGAATTAATGATAAGGATGATGAGACTCTAAATATGCATGTGGATACT
TCTATGCTTCTTGTGGAAGTACCTAAAGTTGATACTCTGCCAAATGGTACTGATATAAACTCAGAAACGATATCTAAGGAAGTTATAGTTGATAACTCTGAACTTGTTCC
ATCTGCACATGTGAGAAAGAATCATCCACCAAGTTCTATGATATGTGATCCTTCAACTGGAATTATTACCAAAAAGAAAGAGAAAGTAGATTACTCGAAGATGATTGTTG
ATTTATGTTATACATCTGCCATTGAATCTTCAACTATTGATGTTGCTCTTAAAGATAAATATTGGATAAACGCAATGCAAGAAAAACTACTTCAATTCAAGCATAACAAC
GTCTGGACATTGGTCGAGGGGGTTGACTTTGATGAAACATTTGCGCCTGTTGCTAGGCTTCAAGCCATTCGCTTGTTACTCAGAATATCTTTTTTAAAAAAATTTAAATT
ATATCAAATTGATGTAAAAAGTGATTTCCTGAATGGTTACATAAACAAAGAAGTCTATGTTGCTCAACCCAACGAATTTGTCAATTTTGAGTATCATCAGCATGTGTATA
AGCTTAATAAAGCTCTATATGGGCTTAAGCAAGCTCCTAGAGCTTGGTACGAACGATTGACGATCTATTTAAGATGTAAAGGGTATTCCAAAGGCGAGGCTGACAAAACA
TTATTTATTCACAGAACAAATGATCAACTCATTGTTGCTCAAATCTATGTTGATGATGTCATATTTGAGGATTTTCCTCAAGATCTTGTTAATAACTTCATTGATGTCAT
GAAGTCAGAATTTGAGATGAGCATGGCTGATCCTCGAACATCACATTTAGAAGTTGTTAAACGGATCCTCAAGTATGTTCATGGAACAAGTGATTTTAGAATTCTGTATT
CCTATGACACGACTTCGATTTTGGTTGGATATTGCGATGTCGATGGGCAGGTTCTTCTGATGATAGGAAAAGCATCTCTAGAGGTTGTTTCTGTCTTGGGAACAATCTTA
TCTCATGGTTCAGGTTTGTATATGGTGAAATCGACCAAAGACGAACTTGCAGCTCAAATTTCATCACCCTCTGTACAAAAGGTGAGAGGCCAGAGGTTCAAAAGTACTCC
ACCACAGAGATTGTATCGTCTCCCATCCAAAAAATCTCAAGCGAAGGTTTCAAACAAACTACCTAAACAGATTATTGATTTCTCTTATCCTCCGACTTTAGGTACATATG
CACCAAATATCTCTGAGACTCCTTTGTCTGATATGGATTTAGATGACTTGGATGATGTTCCTTTGGCTCAATTGTTGAAGAAGACCAATGTTCTTGAGGTTACTGTTGAA
ATGCCTGCATCTCCTTCCATGTCTGTTCATTCTCAGGAAAGCTCGTCGACTGAAGGAGTGTTTGTTCCTACTTCTGGTATTCGTCATACTTCTAATTTTCAACCTGGACA
GTTAGTTCATTCTTCTCCTTCTGCGTCACTTCCCTTTGAACCGGATGTTGCCCATGCATCTGTTCTTGGTAATATTTCTGTTGTACCTGAAGGGAGAACTGATGTTCGAA
GTGATGAGAATGAGGTGGATCCTCCAAATCTTGACATTCGCTTTGAAGAAGTTCTTACCAATTTTGATGATAATCTCACTGTTCCACTTGGTTCTCCTAAAATCCCCGTT
GCACCAAAGCTAGCAAAGAGGAAATCTCAACAAAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAGTCTATAGCTCTACTGACCAAGCAGTTCTCTAAAGGGGTCAGAAGATTAAAAAATATGAATACTGCAGGATTGAATGCTCAAACTTCAAATCAATATCGAAG
AAAGGATGGTGAGAACGCTACTAGAAGGTATAATGAAGCCTCAAATAGGAGGAATGGTGATTATGGAAAGAAAAAGGAGGGAGAAGGAAGGTTTTTCAGATGTAGAGAAT
GTGGGGGCGTTGGGCATTACCAGGCTGAATGTCCTACAATTTTAAGAGGACAAAAGAAAGATTATCATGCTACCTTGTCAGATGAGGACACTGATGATACTAAAGATGAC
AGTAGCATGAATGCCTTCACAGCATGTTTTACCGCAATTGATTTCGAAGACGATAGTGAATGTTCTGATGAAGTTGGTGATGAAGATCTAACTTTTAAAGATCTCAAGAT
GTTGAGGAAAGAAGACACTGAAGCCAAAGCAATTCAAAAAGAAATAATTCAAGATCTAATGGAAAAAAATGAACGATTAATGAAAGCTGAAACTGAAGCAACTACTGCCG
GTAGTCCTCATGCTAAATCTCCCAGAAGGATTTGTTATTATTGTAGACAAAAAGCTGCCGTACGAATTAATGATAAGGATGATGAGACTCTAAATATGCATGTGGATACT
TCTATGCTTCTTGTGGAAGTACCTAAAGTTGATACTCTGCCAAATGGTACTGATATAAACTCAGAAACGATATCTAAGGAAGTTATAGTTGATAACTCTGAACTTGTTCC
ATCTGCACATGTGAGAAAGAATCATCCACCAAGTTCTATGATATGTGATCCTTCAACTGGAATTATTACCAAAAAGAAAGAGAAAGTAGATTACTCGAAGATGATTGTTG
ATTTATGTTATACATCTGCCATTGAATCTTCAACTATTGATGTTGCTCTTAAAGATAAATATTGGATAAACGCAATGCAAGAAAAACTACTTCAATTCAAGCATAACAAC
GTCTGGACATTGGTCGAGGGGGTTGACTTTGATGAAACATTTGCGCCTGTTGCTAGGCTTCAAGCCATTCGCTTGTTACTCAGAATATCTTTTTTAAAAAAATTTAAATT
ATATCAAATTGATGTAAAAAGTGATTTCCTGAATGGTTACATAAACAAAGAAGTCTATGTTGCTCAACCCAACGAATTTGTCAATTTTGAGTATCATCAGCATGTGTATA
AGCTTAATAAAGCTCTATATGGGCTTAAGCAAGCTCCTAGAGCTTGGTACGAACGATTGACGATCTATTTAAGATGTAAAGGGTATTCCAAAGGCGAGGCTGACAAAACA
TTATTTATTCACAGAACAAATGATCAACTCATTGTTGCTCAAATCTATGTTGATGATGTCATATTTGAGGATTTTCCTCAAGATCTTGTTAATAACTTCATTGATGTCAT
GAAGTCAGAATTTGAGATGAGCATGGCTGATCCTCGAACATCACATTTAGAAGTTGTTAAACGGATCCTCAAGTATGTTCATGGAACAAGTGATTTTAGAATTCTGTATT
CCTATGACACGACTTCGATTTTGGTTGGATATTGCGATGTCGATGGGCAGGTTCTTCTGATGATAGGAAAAGCATCTCTAGAGGTTGTTTCTGTCTTGGGAACAATCTTA
TCTCATGGTTCAGGTTTGTATATGGTGAAATCGACCAAAGACGAACTTGCAGCTCAAATTTCATCACCCTCTGTACAAAAGGTGAGAGGCCAGAGGTTCAAAAGTACTCC
ACCACAGAGATTGTATCGTCTCCCATCCAAAAAATCTCAAGCGAAGGTTTCAAACAAACTACCTAAACAGATTATTGATTTCTCTTATCCTCCGACTTTAGGTACATATG
CACCAAATATCTCTGAGACTCCTTTGTCTGATATGGATTTAGATGACTTGGATGATGTTCCTTTGGCTCAATTGTTGAAGAAGACCAATGTTCTTGAGGTTACTGTTGAA
ATGCCTGCATCTCCTTCCATGTCTGTTCATTCTCAGGAAAGCTCGTCGACTGAAGGAGTGTTTGTTCCTACTTCTGGTATTCGTCATACTTCTAATTTTCAACCTGGACA
GTTAGTTCATTCTTCTCCTTCTGCGTCACTTCCCTTTGAACCGGATGTTGCCCATGCATCTGTTCTTGGTAATATTTCTGTTGTACCTGAAGGGAGAACTGATGTTCGAA
GTGATGAGAATGAGGTGGATCCTCCAAATCTTGACATTCGCTTTGAAGAAGTTCTTACCAATTTTGATGATAATCTCACTGTTCCACTTGGTTCTCCTAAAATCCCCGTT
GCACCAAAGCTAGCAAAGAGGAAATCTCAACAAAATTGA
Protein sequenceShow/hide protein sequence
MDESIALLTKQFSKGVRRLKNMNTAGLNAQTSNQYRRKDGENATRRYNEASNRRNGDYGKKKEGEGRFFRCRECGGVGHYQAECPTILRGQKKDYHATLSDEDTDDTKDD
SSMNAFTACFTAIDFEDDSECSDEVGDEDLTFKDLKMLRKEDTEAKAIQKEIIQDLMEKNERLMKAETEATTAGSPHAKSPRRICYYCRQKAAVRINDKDDETLNMHVDT
SMLLVEVPKVDTLPNGTDINSETISKEVIVDNSELVPSAHVRKNHPPSSMICDPSTGIITKKKEKVDYSKMIVDLCYTSAIESSTIDVALKDKYWINAMQEKLLQFKHNN
VWTLVEGVDFDETFAPVARLQAIRLLLRISFLKKFKLYQIDVKSDFLNGYINKEVYVAQPNEFVNFEYHQHVYKLNKALYGLKQAPRAWYERLTIYLRCKGYSKGEADKT
LFIHRTNDQLIVAQIYVDDVIFEDFPQDLVNNFIDVMKSEFEMSMADPRTSHLEVVKRILKYVHGTSDFRILYSYDTTSILVGYCDVDGQVLLMIGKASLEVVSVLGTIL
SHGSGLYMVKSTKDELAAQISSPSVQKVRGQRFKSTPPQRLYRLPSKKSQAKVSNKLPKQIIDFSYPPTLGTYAPNISETPLSDMDLDDLDDVPLAQLLKKTNVLEVTVE
MPASPSMSVHSQESSSTEGVFVPTSGIRHTSNFQPGQLVHSSPSASLPFEPDVAHASVLGNISVVPEGRTDVRSDENEVDPPNLDIRFEEVLTNFDDNLTVPLGSPKIPV
APKLAKRKSQQN