; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0006317 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0006317
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag-pol polyprotein
Genome locationchr06:4707810..4717998
RNA-Seq ExpressionIVF0006317
SyntenyIVF0006317
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035966.1 F5J5.1 [Cucumis melo var. makuwa]0.046.89Show/hide
Query:  MIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCTTAKEAWKILEVAYEGTSKVKISRLQLI
        MIFFIKTL GKAWR LVAGY PP++ V+GVSVPKPEVDWT+AEEQASVGN RALNAIFNGVDLNVFKLINSC+TAKEAWK LEVAYEGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCTTAKEAWKILEVAYEGTSKVKISRLQLI

Query:  TSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDELFGSLLTFKMTISDRESKKDKGVAFKS
        TSK EAL+M EDES+S+YN+RVL+IA++SLLLGEKI +SKIVRKVLRS+ RKFDMKVTAIEEA DITTL+LDELFGSLLTF+M  +DRESKK KG++FKS
Subjt:  TSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDELFGSLLTFKMTISDRESKKDKGVAFKS

Query:  AYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKEEVRRSFRCRECEGFGHYQVECPMFLRR
         +  K      D + N DESIALLTKQ            +TN                 L N +N N                         ECP FLR+
Subjt:  AYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKEEVRRSFRCRECEGFGHYQVECPMFLRR

Query:  QKKNYYATLLDEDSNDNEVDHG-LNAFTTCITEINLD-NSECFDKDEDEDLTFEELKML--SHSEARAIQKERIQDLMEENERLMGVISSLKIKLKEVQG
        QKKN+  TL DE+S D+  D G +NAF   IT+ N D NSEC  + ++++L+ E+LK L     EAR IQKE IQDL+EENE LM VISSLK+KL+EVQ 
Subjt:  QKKNYYATLLDEDSNDNEVDHG-LNAFTTCITEINLD-NSECFDKDEDEDLTFEELKML--SHSEARAIQKERIQDLMEENERLMGVISSLKIKLKEVQG

Query:  KYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTRSFRWTCYYCGRKGHIRPFCYKLQRD
        + +Q +KSVKMLNSG +NLDSIL +  N S +YGLGF +S  + K+TS++KFVPA+++ + +T         S +S   T YYCG+KGHIR  CYKL++D
Subjt:  KYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTRSFRWTCYYCGRKGHIRPFCYKLQRD

Query:  KRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLKECASRYVTFGNGAKGRIIAKENIDK
        +  QQK               NRS  +  MV R+K  + CK+AFT++QT +D WYFDSGCSRHMTG+RS+F  L +C   +VTFG+GAKG+IIAK NI+K
Subjt:  KRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLKECASRYVTFGNGAKGRIIAKENIDK

Query:  NNLPCLSDVRYVDRLKGILIS-------------------------------------------------------------------------------
        ++LP L+DVRYVD LK  LIS                                                                               
Subjt:  NNLPCLSDVRYVDRLKGILIS-------------------------------------------------------------------------------

Query:  ------------------------------------------KLLTQRVTFTTGLLLDLERL-----------------------------------SLM
                                                  K  T  V     L L L+R                                    S+M
Subjt:  ------------------------------------------KLLTQRVTFTTGLLLDLERL-----------------------------------SLM

Query:  ETINVVDDETVNIPVDNSLCPVEVPKTDALIDDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEP
        ETINVV ++     +D+S+  +   + D   + + + +  I +E  ADN    P+         SSIIGDP  G+ TR+K+K+DYLKM+ADL Y   IEP
Subjt:  ETINVVDDETVNIPVDNSLCPVEVPKTDALIDDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEP

Query:  STVDIALKDEYWINAMQEELLQFRQNNVWTLV-------EGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDS
        STVD  +KDEYW+NAMQEELLQFR+NN+WTLV       EGV+FDETFA VARLEAIRLLL +S IQ FKLYQMDVKSAFLN YLN EVYV QPKGFVD 
Subjt:  STVDIALKDEYWINAMQEELLQFRQNNVWTLV-------EGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDS

Query:  EHPQHVYTLNKALYGLKQAPKA------------------------------------------------------------------------------
        EHP+HVY LNKALYGLKQAP+A                                                                              
Subjt:  EHPQHVYTLNKALYGLKQAPKA------------------------------------------------------------------------------

Query:  -------------------------CQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRK-------
                                  + DI Y VGICARYQ DP+ +HLEVVKRILKYVHG S+F +MYSY+TT  LVGY DVDWAGS+DDRK       
Subjt:  -------------------------CQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRK-------

Query:  -STFKGTHTPNISMTPLSNMDSDDLDNVPLAHLLKKTNVPEVTDEI-PVAPSVSAH-----------------------SQESSSTEGVFFPTLSIAPAS
           FK T        P   +  +      L   L+   +P+V +   PV+P+V AH                       SQESSSTEGVF PTL     S
Subjt:  -STFKGTHTPNISMTPLSNMDSDDLDNVPLAHLLKKTNVPEVTDEI-PVAPSVSAH-----------------------SQESSSTEGVFFPTLSIAPAS

Query:  NVQP-GSSVCSPPSVSL--AFAPDDVHAYVLDNVPSDVSTTSEGQTDVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSS---SEVPVALKPVKRKSQQ
           P G S    PS+S      PD V A++L+   +  S           N+ E+ P       +++    DD +A  PSS   ++ P   KP KRK+QQ
Subjt:  NVQP-GSSVCSPPSVSL--AFAPDDVHAYVLDNVPSDVSTTSEGQTDVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSS---SEVPVALKPVKRKSQQ

Query:  NRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS
         RRN TTKIGRKK+P N+PSVPIDGI FHH+E+VQ WKFV++R + DE                                   LI+EFI+NLP+EF+DPS
Subjt:  NRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS

Query:  --------------------------NVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQI
                                  N VDIDCSPS  + ++LA+VLS GTLSTWPVNGI A ALS+KYAI+HKIGI NWFPS HAS++S ALGTFLYQI
Subjt:  --------------------------NVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQI

Query:  CNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGF
        CND+KVDT  FIYNQ LRHVGSF VKV IA PR FSSLLLHLNG VLT SDAP P PKT++L YRLFQGSH+PDIDHDVH + G  IFD +DWD+S EGF
Subjt:  CNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGF

Query:  FVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRE
        +V+R+LA+ I+NSLTAESRA++ SI LLSER LEVD+     K   P T+R++
Subjt:  FVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRE

KAA0036824.1 uncharacterized protein E6C27_scaffold20G001240 [Cucumis melo var. makuwa]0.043.72Show/hide
Query:  MEIIREGPSASRPPILDGKNYSYWKPCMIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCT
        MEII+EGPSASRPP+L+GKNYSYWKP MIFFIKTLDGKAWR L+AGY PPM  V+GVSVPKPEVDWTDAEEQA VGNARALNAIFNGV+LN+ KLIN C+
Subjt:  MEIIREGPSASRPPILDGKNYSYWKPCMIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCT

Query:  TAKEAWKILEVAYEGTSKVKISRLQLITSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDE
         AKEAWK LEVAYE                           +S+YN+RVLEI ++SLLLGEKI + KIVRKVLRSLPRKFD+KVTAIEEA DI TL+LDE
Subjt:  TAKEAWKILEVAYEGTSKVKISRLQLITSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDE

Query:  LFGSLLTFKMTISDRESKKDKGVAFKSAYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKE
        LFGSLLTFKMTI+DRESKK KG+AFKS +                                                                       
Subjt:  LFGSLLTFKMTISDRESKKDKGVAFKSAYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKE

Query:  EVRRSFRCRECEGFGHYQVECPMFLRRQKKNYYATLLDEDSNDN-EVDHGLNAFTTCITEINLDN-SECFDKDEDEDLTFEELKMLSHSEARAIQKERIQ
                                            +DE+S D+ + D  +N FT  IT+ N D+ SEC  + ++++LT E+L+        A+ KE  +
Subjt:  EVRRSFRCRECEGFGHYQVECPMFLRRQKKNYYATLLDEDSNDN-EVDHGLNAFTTCITEINLDN-SECFDKDEDEDLTFEELKMLSHSEARAIQKERIQ

Query:  DLMEENERLMGVISSLKIKLKEVQGKYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTR
                         +KL+EVQ + +Q +K VKMLNSGTENLDSIL S +N S ++GLGF  S   +K+TS++KFVPA++  + E   T      + +
Subjt:  DLMEENERLMGVISSLKIKLKEVQGKYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTR

Query:  SFRWTCYYCGRKGHIRPFCYKLQRDKRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLK
        S   TCYYC RKGHIR  CYKL++D+  QQK  +             RS  +  M L + T                        S     +R   S  K
Subjt:  SFRWTCYYCGRKGHIRPFCYKLQRDKRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLK

Query:  ECASRYVTFGNGAKGRIIAKENIDKNNLPCLSDVRYVDRLKGILISKLLTQRVTFTTGLLLDLERLSLMETINVVDDETVNIPVDNSLCPVEVPKTDALI
        E    Y  F N   G ++   N+  N+L                                      S +  +N  +DET N+    +   VEV K     
Subjt:  ECASRYVTFGNGAKGRIIAKENIDKNNLPCLSDVRYVDRLKGILISKLLTQRVTFTTGLLLDLERLSLMETINVVDDETVNIPVDNSLCPVEVPKTDALI

Query:  DDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEPSTVDIALKDEYWINAMQEELLQFRQNNVWTL
                       ADNP   P              GDP A + TR+KEK+DY+KM+ADLCY S IEPST D A KDEYW+NAMQEELLQFR+NNVWTL
Subjt:  DDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEPSTVDIALKDEYWINAMQEELLQFRQNNVWTL

Query:  V----------------------------------------EGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFV
        V                                        EG+ FDETFAPVA+LEAIRLLL  ++++  K                            
Subjt:  V----------------------------------------EGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFV

Query:  DSEHPQHVYTLNKALYGLKQAPKACQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRKS-------
        D++  +  + L +++ G      A  PDI Y VGICARYQADP  S LEVVKRILKYVHGTS+F +MYSYDTT  LVGYCD DWAG +DDRK+       
Subjt:  DSEHPQHVYTLNKALYGLKQAPKACQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRKS-------

Query:  ----------------------------------------TFKGTHT-------PNISMTP---------------------------------------
                                                T KG++        PN+  +P                                       
Subjt:  ----------------------------------------TFKGTHT-------PNISMTP---------------------------------------

Query:  ------LSNMDSDDLDNVPLAHLLKKTNVPEVTDEIPVAPSVSAHSQESSSTEGVFFPT---LSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNV
              +  +DSDD D+VPL  LLKKT+ P + ++   A     HS      EGVF PT   L  +PA+      SV  P S SLA  PD    ++L N 
Subjt:  ------LSNMDSDDLDNVPLAHLLKKTNVPEVTDEIPVAPSVSAHSQESSSTEGVFFPT---LSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNV

Query:  PS----DVSTTSEGQTDVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSSS---EVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHH
         +     +  +          ++++ P N       +P   DD  A  PS+    E P   +P KRK+QQ RRN TTK GRKK+  NIPSVPIDGI FHH
Subjt:  PS----DVSTTSEGQTDVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSSS---EVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHH

Query:  EENVQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS--------------------------NVVD
        EENVQ WKFVVQR +ADEVN+S KHQSCMSIMDLI RAGL KTI NVGPFYPQLIREFI+NLP+EF++PS                          N +D
Subjt:  EENVQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS--------------------------NVVD

Query:  IDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIA
        ID S S P T+VLA VLSGGTLSTWPVNGI   ALSIKY  +HKIGIANWFPSSHA ++STALG FLYQICND+KVDT AFIYNQ +RHVGSF VKV IA
Subjt:  IDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIA

Query:  LPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSE
        LPRFFSSLLLHLNGAVL A+DAP P PKT++ SY+LFQGSHV DI+HDVHL+    IFD +DWDES +GF VD +LA+ IVNSLTAESRAL+ SINLLSE
Subjt:  LPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSE

Query:  RCLEVDSSHSSFKDFAPFTNRREN
        R LEVD+     K  AP T R+++
Subjt:  RCLEVDSSHSSFKDFAPFTNRREN

KAA0066164.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.13e-31538.87Show/hide
Query:  MEIIREGPSASRPPILDGKNYSYWKPCMIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCT
        M+IIREGP  S PPILDGKNYSYWKP MIFFIKTLDGKAWR LVAGY PPM+ V+GVSVPKPEVDWT+AEEQASV NARALNAIFNGV+LNVFKLINSC+
Subjt:  MEIIREGPSASRPPILDGKNYSYWKPCMIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCT

Query:  TAKEAWKILEVAYEGTSKVKISRLQLITSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDE
        TAKEAWK L VA                                    VLEIA++SLLLGEKI +SKIVRKVLRSLPRKFDMKVTAIEEA DITTL+LDE
Subjt:  TAKEAWKILEVAYEGTSKVKISRLQLITSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDE

Query:  LFGSLLTFKMTISDRESKKDKGVAFKSAYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKE
        LFGSLLTF+M  +DRESKK KG+AFKS              +  DE            E + + Y       W  +G +                     
Subjt:  LFGSLLTFKMTISDRESKKDKGVAFKSAYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKE

Query:  EVRRSFRCRECEGFGHYQVECPMFLRRQKKNYYATLLDEDSNDN-EVDHGLNAFTTCITEINLDN-SECFDKDEDEDLTFEELKML--SHSEARAIQKER
                             P FLR+QK N+  TL DE+S D+ + D  +NAFT  IT+ N D  S+C ++ + ++LT E+L+ L   + EARA     
Subjt:  EVRRSFRCRECEGFGHYQVECPMFLRRQKKNYYATLLDEDSNDN-EVDHGLNAFTTCITEINLDN-SECFDKDEDEDLTFEELKML--SHSEARAIQKER

Query:  IQDLMEENERLMGVISSLKIKLKEVQGKYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPS
          DL                   ++  K N ++   ++                               NVK                            
Subjt:  IQDLMEENERLMGVISSLKIKLKEVQGKYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPS

Query:  TRSFRWTCYYCGRKGHIRPFCYKLQRDKRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSK
                                                         R I +    L +K     +V     +TT+DAWYFDSGCSRHMTG+RS+F+ 
Subjt:  TRSFRWTCYYCGRKGHIRPFCYKLQRDKRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSK

Query:  LKECASRYVTFGNGAKGRIIAKENIDKNNLPCLSDVRYVDRLKGILISKLLTQRV------------TFTTGLLLDLERLS------------LMETINV
        LK+C + +VTF NGAKG+IIAK NI+ NNLP L+D+R ++ L   L+  +  + +                G+ L   + S            +METINV
Subjt:  LKECASRYVTFGNGAKGRIIAKENIDKNNLPCLSDVRYVDRLKGILISKLLTQRV------------TFTTGLLLDLERLS------------LMETINV

Query:  V--------------DDETVNIPVDNSLCPVEVPKTDALIDDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIAD
        V              +DET N+    +   VE PK     DD+  + +K SKE I    E + SAHVKKNH +SSIIGDP  G+ TR+KEK+DY+KM+AD
Subjt:  V--------------DDETVNIPVDNSLCPVEVPKTDALIDDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIAD

Query:  LCYTSDIEPSTVDIALKDEYWINAMQEELLQFRQNNVWTLV----------------------------------------EGVNFDETFAPVARLEAIR
        LCY S +EPSTVD AL+DEYW+NAMQEELLQFRQNNVWTLV                                        EG++FDETFA VARLEAIR
Subjt:  LCYTSDIEPSTVDIALKDEYWINAMQEELLQFRQNNVWTLV----------------------------------------EGVNFDETFAPVARLEAIR

Query:  LLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPKACQPDIT------------------------------
        LLL +S IQ FKLYQMDVKSAFL+ YLNEEVYV QPKGFVDSEHP+H+Y LNKALYGLKQA +A    +T                              
Subjt:  LLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPKACQPDIT------------------------------

Query:  -YVVGICAR----------------------------------------------------------------------------------YQ--ADPQT
         YV  I  R                                                                                  Y+  ADP+ 
Subjt:  -YVVGICAR----------------------------------------------------------------------------------YQ--ADPQT

Query:  SHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRKSTFK---------------------------------GTHTPNISMTPLSNMD
        +HLE VKRILKYVHGTS+F +MYSYDTT  LVGYCD +WAGS+DD K+  K                                       IS  P+ +  
Subjt:  SHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRKSTFK---------------------------------GTHTPNISMTPLSNMD

Query:  SDDLD----------------------NVPLAHL------------------------------------------------------------------
        +  +D                      N+ LA +                                                                  
Subjt:  SDDLD----------------------NVPLAHL------------------------------------------------------------------

Query:  -LKKTNVPEV-----------------TDEIPVAPSVSAHSQESSSTEGVFFPTLSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNVPSDVSTTS
         L+   VPEV                 T+ +P  PS S HSQESSS EGVF PT          PG    SP   S+             ++P   +   
Subjt:  -LKKTNVPEV-----------------TDEIPVAPSVSAHSQESSSTEGVFFPTLSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNVPSDVSTTS

Query:  EGQTDVQSNENEVDPSNP-AVCADEVPTNADDSLAVPPSSSEVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRW
        E QTD   N++     N   +  +++P   DD +A  PSS   P + K                   +K        P+  I FHHE++VQ WKFV+QR 
Subjt:  EGQTDVQSNENEVDPSNP-AVCADEVPTNADDSLAVPPSSSEVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRW

Query:  LADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS--------------------------NVVDIDCSPSSPSTDVLA
        +A+E                                   LIR+FI+NLP++F+DPS                          N VDIDCSPS P+T+VLA
Subjt:  LADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS--------------------------NVVDIDCSPSSPSTDVLA

Query:  SVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNG
        +VLSG TLSTW VN I A ALS+KYAI+HKI IANWF SSHA ++  ALGTFLYQ CND+KVDT  FIYNQ LRHVGSF VKV IALP+ FSSLLLHLN 
Subjt:  SVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNG

Query:  AVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDS
         VLTA+DA  P PKT++LSYRLFQGSHVPDIDHDVH + G  IF+ +DWDES EGFFVDR+LA+ IVNSLT ES AL+TSI LL+ER  EVD+
Subjt:  AVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDS

TYK30437.1 F5J5.1 [Cucumis melo var. makuwa]0.046.78Show/hide
Query:  MIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCTTAKEAWKILEVAYEGTSKVKISRLQLI
        MIFFIKTL GKAWR LVAGY PP++ V+GVSVPKPEVDWT+AEEQASVGN RALNAIFNGVDLNVFKLINSC+TAKEAWK LEVAYEGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCTTAKEAWKILEVAYEGTSKVKISRLQLI

Query:  TSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDELFGSLLTFKMTISDRESKKDKGVAFKS
        TSK EAL+M EDES+S+YN+RVL+IA++SLLLGEKI +SKIVRKVLRS+ RKFDMKVTAIEEA DITTL+LDELFGSLLTF+M  +DRESKK KG++FKS
Subjt:  TSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDELFGSLLTFKMTISDRESKKDKGVAFKS

Query:  AYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKEEVRRSFRCRECEGFGHYQVECPMFLRR
         +  K      D + N DESIALLTKQ            +TN                 L N +N N                         ECP FLR+
Subjt:  AYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKEEVRRSFRCRECEGFGHYQVECPMFLRR

Query:  QKKNYYATLLDEDSNDNEVDHG-LNAFTTCITEINLD-NSECFDKDEDEDLTFEELKML--SHSEARAIQKERIQDLMEENERLMGVISSLKIKLKEVQG
        QKKN+  TL DE+S D+  D G +NAF   IT+ N D NSEC  + ++++L+ E+LK L     EAR IQKE IQDL+EENE LM VISSLK+KL+EVQ 
Subjt:  QKKNYYATLLDEDSNDNEVDHG-LNAFTTCITEINLD-NSECFDKDEDEDLTFEELKML--SHSEARAIQKERIQDLMEENERLMGVISSLKIKLKEVQG

Query:  KYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTRSFRWTCYYCGRKGHIRPFCYKLQRD
        + +Q +KSVKMLNSG +NLDSIL +  N S +YGLGF +S  + K+TS++KFVPA+++ + +T         S +S   T YYCG+KGHIR  CYKL++D
Subjt:  KYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTRSFRWTCYYCGRKGHIRPFCYKLQRD

Query:  KRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLKECASRYVTFGNGAKGRIIAKENIDK
        +  QQK               NRS  +  MV R+K  + CK+AFT++QT +D WYFDSGCSRHMTG+RS+F  L +C   +VTFG+GAKG+IIAK NI+K
Subjt:  KRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLKECASRYVTFGNGAKGRIIAKENIDK

Query:  NNLPCLSDVRYVDRLKGILIS-------------------------------------------------------------------------------
        ++LP L+DVRYVD LK  LIS                                                                               
Subjt:  NNLPCLSDVRYVDRLKGILIS-------------------------------------------------------------------------------

Query:  ------------------------------------------KLLTQRVTFTTGLLLDLERL-----------------------------------SLM
                                                  K  T  V     L L L+R                                    S+M
Subjt:  ------------------------------------------KLLTQRVTFTTGLLLDLERL-----------------------------------SLM

Query:  ETINVVDDETVNIPVDNSLCPVEVPKTDALIDDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEP
        ETINVV ++     +D+S+  +   + D   + + + +  I +E  ADN    P+         SSIIGDP  G+ TR+K+K+DYLKM+ADL Y   IEP
Subjt:  ETINVVDDETVNIPVDNSLCPVEVPKTDALIDDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEP

Query:  STVDIALKDEYWINAMQEELLQFRQNNVWTLV-------EGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDS
        STVD  +KDEYW+NAMQEELLQFR+NN+WTLV       EGV+FDETFA VARLEAIRLLL +S IQ FKLYQMDVKSAFLN YLN EVYV QPKGFVD 
Subjt:  STVDIALKDEYWINAMQEELLQFRQNNVWTLV-------EGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDS

Query:  EHPQHVYTLNKALYGLKQAPKA------------------------------------------------------------------------------
        EHP+HVY LNKALYGLKQAP+A                                                                              
Subjt:  EHPQHVYTLNKALYGLKQAPKA------------------------------------------------------------------------------

Query:  -------------------------CQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRK-------
                                  + DI Y VGICARYQ DP+ +HLEVVKRILKYVHG S+F +MYSY+TT  LVGY DVDWAGS+DDRK       
Subjt:  -------------------------CQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRK-------

Query:  -STFKGTHTPNISMTPLSNMDSDDLDNVPLAHLLKKTNVPEVTDEI-PVAPSVSAH-----------------------SQESSSTEGVFFPTLSIAPAS
           FK T        P   +  +      L   L+   +P+V +   PV+P+V AH                       SQESSSTEGVF PTL     S
Subjt:  -STFKGTHTPNISMTPLSNMDSDDLDNVPLAHLLKKTNVPEVTDEI-PVAPSVSAH-----------------------SQESSSTEGVFFPTLSIAPAS

Query:  NVQP-GSSVCSPPSVSL--AFAPDDVHAYVLDNVPSDVSTTSEGQTDVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSS---SEVPVALKPVKRKSQQ
           P G S    PS+S      PD V A++L+     ++T +E                  +  +++    DD +A  PSS   ++ P   KP KRK+QQ
Subjt:  NVQP-GSSVCSPPSVSL--AFAPDDVHAYVLDNVPSDVSTTSEGQTDVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSS---SEVPVALKPVKRKSQQ

Query:  NRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS
         RRN TTKIGRKK+P N+PSVPIDGI FHH+E+VQ WKFV++R + DE                                   LI+EFI+NLP+EF+DPS
Subjt:  NRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS

Query:  --------------------------NVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQI
                                  N VDIDCSPS  + ++LA+VLS GTLSTWPVNGI A ALS+KYAI+HKIGI NWFPS HAS++S ALGTFLYQI
Subjt:  --------------------------NVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQI

Query:  CNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGF
        CND+KVDT  FIYNQ LRHVGSF VKV IA PR FSSLLLHLNG VLT SDAP P PKT++L YRLFQGSH+PDIDHDVH + G  IFD +DWD+S EGF
Subjt:  CNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGF

Query:  FVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRE
        +V+R+LA+ I+NSLTAESRA++ SI LLSER LEVD+     K   P T+R++
Subjt:  FVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRE

XP_008456227.1 PREDICTED: uncharacterized protein LOC103496232 [Cucumis melo]0.099.79Show/hide
Query:  MDSDDLDNVPLAHLLKKTNVPEVTDEIPVAPSVSAHSQESSSTEGVFFPTLSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNVPSDVSTTSEGQT
        MDSDDLDNVPLAHLLKKTNVPEVTDEI VAPSVSAHSQESSSTEGVFFPTLSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNVPSDVSTTSEGQT
Subjt:  MDSDDLDNVPLAHLLKKTNVPEVTDEIPVAPSVSAHSQESSSTEGVFFPTLSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNVPSDVSTTSEGQT

Query:  DVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSSSEVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEV
        DVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSSSEVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEV
Subjt:  DVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSSSEVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEV

Query:  NVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPSNVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKI
        NVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPSNVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKI
Subjt:  NVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPSNVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKI

Query:  GIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDI
        GIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDI
Subjt:  GIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDI

Query:  DHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRENG
        DHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRENG
Subjt:  DHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRENG

TrEMBL top hitse value%identityAlignment
A0A1S3C405 uncharacterized protein LOC1034962324.1e-27299.79Show/hide
Query:  MDSDDLDNVPLAHLLKKTNVPEVTDEIPVAPSVSAHSQESSSTEGVFFPTLSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNVPSDVSTTSEGQT
        MDSDDLDNVPLAHLLKKTNVPEVTDEI VAPSVSAHSQESSSTEGVFFPTLSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNVPSDVSTTSEGQT
Subjt:  MDSDDLDNVPLAHLLKKTNVPEVTDEIPVAPSVSAHSQESSSTEGVFFPTLSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNVPSDVSTTSEGQT

Query:  DVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSSSEVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEV
        DVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSSSEVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEV
Subjt:  DVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSSSEVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEV

Query:  NVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPSNVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKI
        NVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPSNVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKI
Subjt:  NVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPSNVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKI

Query:  GIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDI
        GIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDI
Subjt:  GIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDI

Query:  DHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRENG
        DHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRENG
Subjt:  DHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRENG

A0A5A7SZY3 Reverse transcriptase Ty1/copia-type domain-containing protein2.0e-30643.92Show/hide
Query:  MEIIREGPSASRPPILDGKNYSYWKPCMIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCT
        MEII+EGPSASRPP+L+GKNYSYWKP MIFFIKTLDGKAWR L+AGY PPM  V+GVSVPKPEVDWTDAEEQA VGNARALNAIFNGV+LN+ KLIN C+
Subjt:  MEIIREGPSASRPPILDGKNYSYWKPCMIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCT

Query:  TAKEAWKILEVAYEGTSKVKISRLQLITSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDE
         AKEAWK LEVAYE                           +S+YN+RVLEI ++SLLLGEKI + KIVRKVLRSLPRKFD+KVTAIEEA DI TL+LDE
Subjt:  TAKEAWKILEVAYEGTSKVKISRLQLITSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDE

Query:  LFGSLLTFKMTISDRESKKDKGVAFKSAYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKE
        LFGSLLTFKMTI+DRESKK KG+AFKS                                                                         
Subjt:  LFGSLLTFKMTISDRESKKDKGVAFKSAYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKE

Query:  EVRRSFRCRECEGFGHYQVECPMFLRRQKKNYYATLLDEDSNDN-EVDHGLNAFTTCITEINLDN-SECFDKDEDEDLTFEELKMLSHSEARAIQKERIQ
                                          T +DE+S D+ + D  +N FT  IT+ N D+ SEC  + ++++LT E+L+        A+ KE   
Subjt:  EVRRSFRCRECEGFGHYQVECPMFLRRQKKNYYATLLDEDSNDN-EVDHGLNAFTTCITEINLDN-SECFDKDEDEDLTFEELKMLSHSEARAIQKERIQ

Query:  DLMEENERLMGVISSLKIKLKEVQGKYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTR
                        ++KL+EVQ + +Q +K VKMLNSGTENLDSIL S +N S ++GLGF  S   +K+TS++KFVPA++  + E   T      + +
Subjt:  DLMEENERLMGVISSLKIKLKEVQGKYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTR

Query:  SFRWTCYYCGRKGHIRPFCYKLQRDKRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLK
        S   TCYYC RKGHIR  CYKL++D+  QQK  +             RS  +  M L + T                         R    + +  S  K
Subjt:  SFRWTCYYCGRKGHIRPFCYKLQRDKRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLK

Query:  ECASRYVTFGNGAKGRIIAKENIDKNNLPCLSDVRYVDRLKGILISKLLTQRVTFTTGLLLDLERLSLMETINVVDDETVNIPVDNSLCPVEVPKTDALI
        E    Y  F N   G ++   N+  N+L                                      S +  +N  +DET N+    +   VEV K     
Subjt:  ECASRYVTFGNGAKGRIIAKENIDKNNLPCLSDVRYVDRLKGILISKLLTQRVTFTTGLLLDLERLSLMETINVVDDETVNIPVDNSLCPVEVPKTDALI

Query:  DDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEPSTVDIALKDEYWINAMQEELLQFRQNNVWTL
                       ADNP   P              GDP A + TR+KEK+DY+KM+ADLCY S IEPST D A KDEYW+NAMQEELLQFR+NNVWTL
Subjt:  DDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEPSTVDIALKDEYWINAMQEELLQFRQNNVWTL

Query:  ----------------------------------------VEGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFV
                                                VEG+ FDETFAPVA+LEAIRLLL  ++++  K                            
Subjt:  ----------------------------------------VEGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFV

Query:  DSEHPQHVYTLNKALYGLKQAPKACQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRK--------
        D++  +  + L +++ G      A  PDI Y VGICARYQADP  S LEVVKRILKYVHGTS+F +MYSYDTT  LVGYCD DWAG +DDRK        
Subjt:  DSEHPQHVYTLNKALYGLKQAPKACQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRK--------

Query:  ---------------------------------------STFKGTH-------TPNISMTP---------------------------------------
                                               +T KG++        PN+  +P                                       
Subjt:  ---------------------------------------STFKGTH-------TPNISMTP---------------------------------------

Query:  ------LSNMDSDDLDNVPLAHLLKKTNVPEVTDEIPVAPSVSAHSQESSSTEGVFFPT---LSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNV
              +  +DSDD D+VPL  LLKKT+ P + ++   A     HS      EGVF PT   L  +PA+      SV  P S SLA  PD    ++L N 
Subjt:  ------LSNMDSDDLDNVPLAHLLKKTNVPEVTDEIPVAPSVSAHSQESSSTEGVFFPT---LSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNV

Query:  PSDVSTTSEGQTDVQSNENE-VDPSNPAVCADEVPTNADDSLAVPPSSS---EVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHHEEN
            +   E Q  V   E++    +   +    +P   DD  A  PS+    E P   +P KRK+QQ RRN TTK GRKK+  NIPSVPIDGI FHHEEN
Subjt:  PSDVSTTSEGQTDVQSNENE-VDPSNPAVCADEVPTNADDSLAVPPSSS---EVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHHEEN

Query:  VQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS--------------------------NVVDIDC
        VQ WKFVVQR +ADEVN+S KHQSCMSIMDLI RAGL KTI NVGPFYPQLIREFI+NLP+EF++PS                          N +DID 
Subjt:  VQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS--------------------------NVVDIDC

Query:  SPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPR
        S S P T+VLA VLSGGTLSTWPVNGI   ALSIKY  +HKIGIANWFPSSHA ++STALG FLYQICND+KVDT AFIYNQ +RHVGSF VKV IALPR
Subjt:  SPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPR

Query:  FFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSERCL
        FFSSLLLHLNGAVL A+DAP P PKT++ SY+LFQGSHV DI+HDVHL+    IFD +DWDES +GF VD +LA+ IVNSLTAESRAL+ SINLLSER L
Subjt:  FFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSERCL

Query:  EVDSSHSSFKDFAPFTNRREN
        EVD+     K  AP T R+++
Subjt:  EVDSSHSSFKDFAPFTNRREN

A0A5A7T169 F5J5.10.0e+0046.89Show/hide
Query:  MIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCTTAKEAWKILEVAYEGTSKVKISRLQLI
        MIFFIKTL GKAWR LVAGY PP++ V+GVSVPKPEVDWT+AEEQASVGN RALNAIFNGVDLNVFKLINSC+TAKEAWK LEVAYEGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCTTAKEAWKILEVAYEGTSKVKISRLQLI

Query:  TSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDELFGSLLTFKMTISDRESKKDKGVAFKS
        TSK EAL+M EDES+S+YN+RVL+IA++SLLLGEKI +SKIVRKVLRS+ RKFDMKVTAIEEA DITTL+LDELFGSLLTF+M  +DRESKK KG++FKS
Subjt:  TSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDELFGSLLTFKMTISDRESKKDKGVAFKS

Query:  AYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKEEVRRSFRCRECEGFGHYQVECPMFLRR
         +  K      D + N DESIALLTKQ            +TN                 L N +N N                         ECP FLR+
Subjt:  AYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKEEVRRSFRCRECEGFGHYQVECPMFLRR

Query:  QKKNYYATLLDEDSNDNEVDHG-LNAFTTCITEINL-DNSECFDKDEDEDLTFEELKML--SHSEARAIQKERIQDLMEENERLMGVISSLKIKLKEVQG
        QKKN+  TL DE+S D+  D G +NAF   IT+ N  DNSEC  + ++++L+ E+LK L     EAR IQKE IQDL+EENE LM VISSLK+KL+EVQ 
Subjt:  QKKNYYATLLDEDSNDNEVDHG-LNAFTTCITEINL-DNSECFDKDEDEDLTFEELKML--SHSEARAIQKERIQDLMEENERLMGVISSLKIKLKEVQG

Query:  KYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTRSFRWTCYYCGRKGHIRPFCYKLQRD
        + +Q +KSVKMLNSG +NLDSIL +  N S +YGLGF +S  + K+TS++KFVPA+++ + +T         S +S   T YYCG+KGHIR  CYKL++D
Subjt:  KYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTRSFRWTCYYCGRKGHIRPFCYKLQRD

Query:  KRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLKECASRYVTFGNGAKGRIIAKENIDK
        +  QQK               NRS  +  MV R+K  + CK+AFT++QT +D WYFDSGCSRHMTG+RS+F  L +C   +VTFG+GAKG+IIAK NI+K
Subjt:  KRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLKECASRYVTFGNGAKGRIIAKENIDK

Query:  NNLPCLSDVRYVDRLKGILIS-------------------------------------------------------------------------------
        ++LP L+DVRYVD LK  LIS                                                                               
Subjt:  NNLPCLSDVRYVDRLKGILIS-------------------------------------------------------------------------------

Query:  ------------------------------------------KLLTQRVTFTTGLLLDLERL-----------------------------------SLM
                                                  K  T  V     L L L+R                                    S+M
Subjt:  ------------------------------------------KLLTQRVTFTTGLLLDLERL-----------------------------------SLM

Query:  ETINVVDDETVNIPVDNSLCPVEVPKTDALIDDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEP
        ETINVV ++     +D+S+  +   + D   + + + +  I +E  ADN           N  +SSIIGDP  G+ TR+K+K+DYLKM+ADL Y   IEP
Subjt:  ETINVVDDETVNIPVDNSLCPVEVPKTDALIDDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEP

Query:  STVDIALKDEYWINAMQEELLQFRQNNVWTL-------VEGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDS
        STVD  +KDEYW+NAMQEELLQFR+NN+WTL       VEGV+FDETFA VARLEAIRLLL +S IQ FKLYQMDVKSAFLN YLN EVYV QPKGFVD 
Subjt:  STVDIALKDEYWINAMQEELLQFRQNNVWTL-------VEGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDS

Query:  EHPQHVYTLNKALYGLKQAPK-------------------------------------------------------------------------------
        EHP+HVY LNKALYGLKQAP+                                                                               
Subjt:  EHPQHVYTLNKALYGLKQAPK-------------------------------------------------------------------------------

Query:  ------------------------ACQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRK-------
                                A + DI Y VGICARYQ DP+ +HLEVVKRILKYVHG S+F +MYSY+TT  LVGY DVDWAGS+DDRK       
Subjt:  ------------------------ACQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRK-------

Query:  -STFKGTHTPNISMTPLSNMDSDDLDNVPLAHLLKKTNVPEVTDE-IPVAPSV-----------------------SAHSQESSSTEGVFFPTLSIAPAS
           FK T        P   +  +      L   L+   +P+V +   PV+P+V                       S HSQESSSTEGVF PTL     S
Subjt:  -STFKGTHTPNISMTPLSNMDSDDLDNVPLAHLLKKTNVPEVTDE-IPVAPSV-----------------------SAHSQESSSTEGVFFPTLSIAPAS

Query:  NVQP-GSSVCSPPSVSL--AFAPDDVHAYVLDNVPSDVSTTSEGQTDVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSS---SEVPVALKPVKRKSQQ
           P G S    PS+S      PD V A++L+   +  S           N+ E+ P       +++    DD +A  PSS   ++ P   KP KRK+QQ
Subjt:  NVQP-GSSVCSPPSVSL--AFAPDDVHAYVLDNVPSDVSTTSEGQTDVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSS---SEVPVALKPVKRKSQQ

Query:  NRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS
         RRN TTKIGRKK+P N+PSVPIDGI FHH+E+VQ WKFV++R + DE                                   LI+EFI+NLP+EF+DPS
Subjt:  NRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS

Query:  --------------------------NVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQI
                                  N VDIDCSPS  + ++LA+VLS GTLSTWPVNGI A ALS+KYAI+HKIGI NWFPS HAS++S ALGTFLYQI
Subjt:  --------------------------NVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQI

Query:  CNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGF
        CND+KVDT  FIYNQ LRHVGSF VKV IA PR FSSLLLHLNG VLT SDAP P PKT++L YRLFQGSH+PDIDHDVH + G  IFD +DWD+S EGF
Subjt:  CNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGF

Query:  FVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRE
        +V+R+LA+ I+NSLTAESRA++ SI LLSER LEVD+     K   P T+R++
Subjt:  FVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRE

A0A5D3DCZ8 Gag-pol polyprotein1.9e-24842.32Show/hide
Query:  MEIIREGPSASRPPILDGKNYSYWKPCMIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCT
        MEIIREGPSASRPPIL GKNYSYWKP MIFFIKTLDGKAWR LV+GY+P M+ ++GVSVPKPE+DWTDAEEQASVGNARA+NAIFNGV+L+VFKLINSC 
Subjt:  MEIIREGPSASRPPILDGKNYSYWKPCMIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCT

Query:  TAKEAWKILEVAYEGTSKVKISRLQLITSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDE
        TAKEAWKILEVA+E TSKVKISRLQLITSK EALKM EDE++SEYNERVLEIA+DSLLLGEKI ESKIV KVLRSLPRKFDMKVTAIEEAQDI TL+LDE
Subjt:  TAKEAWKILEVAYEGTSKVKISRLQLITSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDE

Query:  LFGSLLTFKMTISDRESKKDKGVAFKSAYEQKTPVNQSDNEVNPDESIALLTKQCSKM--EVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKK
        LFGSLLTF+M ISDRESKK KG+AFKS Y+Q+  VNQS NE N DESI LLTKQ SKM  + +    G    K+ + +GEN  RK ++ S RRN ++GKK
Subjt:  LFGSLLTFKMTISDRESKKDKGVAFKSAYEQKTPVNQSDNEVNPDESIALLTKQCSKM--EVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKK

Query:  KEEVRRSFRCRECEGFGHYQVECPMFLRRQKKNYYATLLDEDSNDNEVDHGLNAFTTCITEINLDNSECFDKDEDEDLTFEELKML--SHSEARAIQKER
        KE+V RSFRCREC+G                                                    EC D DED++LT EELK+L    SEA+ IQKER
Subjt:  KEEVRRSFRCRECEGFGHYQVECPMFLRRQKKNYYATLLDEDSNDNEVDHGLNAFTTCITEINLDNSECFDKDEDEDLTFEELKML--SHSEARAIQKER

Query:  IQDLMEENERLMGVISSLKIKLKEVQGKYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPS
        IQDLM+ENERLMG+ISSLK+KLKEVQ  Y+QTIKS KMLNSGT++LDSIL+  QN SSKYGLGFD S R VK   +VKFVPA+V+  T+ +C        
Subjt:  IQDLMEENERLMGVISSLKIKLKEVQGKYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPS

Query:  TRSFRWTCYYCGRKGHIRPFCYKLQRDKRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWY---FDSGCSRHMTGDRSF
         +  RW  YYC R+GHIR FCYKL RD+R+QQ+ +  + +NK    K N  +R TH + RVK S+ C VAFTT+QT  DA           + +   RS 
Subjt:  TRSFRWTCYYCGRKGHIRPFCYKLQRDKRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWY---FDSGCSRHMTGDRSF

Query:  FSKL--KECASRYVTFGNGAKGRIIAKENIDKNNLPCLSDVRYVDRLKGILISKLLT---------------QRVTFTTGLLLDLERL------------
          K    E  + +   G G   + +      +N +    ++   +  + ++ +K L                 RVT  +G+ + L  L            
Subjt:  FSKL--KECASRYVTFGNGAKGRIIAKENIDKNNLPCLSDVRYVDRLKGILISKLLT---------------QRVTFTTGLLLDLERL------------

Query:  ---------------------------------------------SLMETINVV--------------DDETVNIPVDNSLCPVEVPKTDALIDDASMNS
                                                     ++METINVV              DDET   P   S    E+PK ++ +  A  +S
Subjt:  ---------------------------------------------SLMETINVV--------------DDETVNIPVDNSLCPVEVPKTDALIDDASMNS

Query:  KKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIA-------------------DLCYTSDIEPSTVDIALKDEYWINAMQEE
          I+ EVI +    VPSAHVKKNH SSSII DP AGI TR+KE ++    I+                   DLCY S IEP++V+ +LKDEYWI  MQEE
Subjt:  KKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIA-------------------DLCYTSDIEPSTVDIALKDEYWINAMQEE

Query:  LLQFRQNNVWTL----------------------------------------VEGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLN
         LQF++NNVWTL                                        VEGV+ DETFAPVARLEAIRLLL +S  Q FKL+QMDVKSAFLN YLN
Subjt:  LLQFRQNNVWTL----------------------------------------VEGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLN

Query:  EEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPKAC---------------------------------------------------------------
        EEV V +PKGF+DSE PQ+VY LNKALYGLKQAP+AC                                                               
Subjt:  EEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPKAC---------------------------------------------------------------

Query:  ------------------------------------------------------QPDITYVVGICARYQADPQTSH---------LEVVKRILKYVHGTS
                                                              +PDI Y VGICARYQ++P+++            +   ++ +     
Subjt:  ------------------------------------------------------QPDITYVVGICARYQADPQTSH---------LEVVKRILKYVHGTS

Query:  NFEIMYSYDTTYNLV--------------------------------GYCDVDWAGSSDD------RKSTFKGTHT-PNISMTPLSNMDSDDLDNVPLAH
        N   + + +  Y                                    + D+  +GS++        ++  K   T P  S   LS++DSDDLD+V LA 
Subjt:  NFEIMYSYDTTYNLV--------------------------------GYCDVDWAGSSDD------RKSTFKGTHT-PNISMTPLSNMDSDDLDNVPLAH

Query:  LLKKTNVPEVTDEIP--VAPSVSAHSQESSSTEGVFFPTLSIAPASNVQPGSSV----CSPPSVSLAFAPDDVHAYVLDNVPSDVSTTSEGQTDVQSNEN
        L+KK  VP   D IP  V   V + SQESSS+EGVF PTL +   S+++PG S+      PP  ++A AP D                +E       N +
Subjt:  LLKKTNVPEVTDEIP--VAPSVSAHSQESSSTEGVFFPTLSIAPASNVQPGSSV----CSPPSVSLAFAPDDVHAYVLDNVPSDVSTTSEGQTDVQSNEN

Query:  EVDPSNPAVCADEVPTNADDSLAVPPSSSEVP
        +V+P  P    +EV        + P  +  VP
Subjt:  EVDPSNPAVCADEVPTNADDSLAVPPSSSEVP

A0A5D3E2Y4 F5J5.10.0e+0046.78Show/hide
Query:  MIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCTTAKEAWKILEVAYEGTSKVKISRLQLI
        MIFFIKTL GKAWR LVAGY PP++ V+GVSVPKPEVDWT+AEEQASVGN RALNAIFNGVDLNVFKLINSC+TAKEAWK LEVAYEGTSKVKISRLQLI
Subjt:  MIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCTTAKEAWKILEVAYEGTSKVKISRLQLI

Query:  TSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDELFGSLLTFKMTISDRESKKDKGVAFKS
        TSK EAL+M EDES+S+YN+RVL+IA++SLLLGEKI +SKIVRKVLRS+ RKFDMKVTAIEEA DITTL+LDELFGSLLTF+M  +DRESKK KG++FKS
Subjt:  TSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDELFGSLLTFKMTISDRESKKDKGVAFKS

Query:  AYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKEEVRRSFRCRECEGFGHYQVECPMFLRR
         +  K      D + N DESIALLTKQ            +TN                 L N +N N                         ECP FLR+
Subjt:  AYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKEEVRRSFRCRECEGFGHYQVECPMFLRR

Query:  QKKNYYATLLDEDSNDNEVDHG-LNAFTTCITEINL-DNSECFDKDEDEDLTFEELKML--SHSEARAIQKERIQDLMEENERLMGVISSLKIKLKEVQG
        QKKN+  TL DE+S D+  D G +NAF   IT+ N  DNSEC  + ++++L+ E+LK L     EAR IQKE IQDL+EENE LM VISSLK+KL+EVQ 
Subjt:  QKKNYYATLLDEDSNDNEVDHG-LNAFTTCITEINL-DNSECFDKDEDEDLTFEELKML--SHSEARAIQKERIQDLMEENERLMGVISSLKIKLKEVQG

Query:  KYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTRSFRWTCYYCGRKGHIRPFCYKLQRD
        + +Q +KSVKMLNSG +NLDSIL +  N S +YGLGF +S  + K+TS++KFVPA+++ + +T         S +S   T YYCG+KGHIR  CYKL++D
Subjt:  KYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTRSFRWTCYYCGRKGHIRPFCYKLQRD

Query:  KRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLKECASRYVTFGNGAKGRIIAKENIDK
        +  QQK               NRS  +  MV R+K  + CK+AFT++QT +D WYFDSGCSRHMTG+RS+F  L +C   +VTFG+GAKG+IIAK NI+K
Subjt:  KRYQQKAEFESHKNKSSFAKHNRSIRKTHMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLKECASRYVTFGNGAKGRIIAKENIDK

Query:  NNLPCLSDVRYVDRLKGILIS-------------------------------------------------------------------------------
        ++LP L+DVRYVD LK  LIS                                                                               
Subjt:  NNLPCLSDVRYVDRLKGILIS-------------------------------------------------------------------------------

Query:  ------------------------------------------KLLTQRVTFTTGLLLDLERL-----------------------------------SLM
                                                  K  T  V     L L L+R                                    S+M
Subjt:  ------------------------------------------KLLTQRVTFTTGLLLDLERL-----------------------------------SLM

Query:  ETINVVDDETVNIPVDNSLCPVEVPKTDALIDDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEP
        ETINVV ++     +D+S+  +   + D   + + + +  I +E  ADN           N  +SSIIGDP  G+ TR+K+K+DYLKM+ADL Y   IEP
Subjt:  ETINVVDDETVNIPVDNSLCPVEVPKTDALIDDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEP

Query:  STVDIALKDEYWINAMQEELLQFRQNNVWTL-------VEGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDS
        STVD  +KDEYW+NAMQEELLQFR+NN+WTL       VEGV+FDETFA VARLEAIRLLL +S IQ FKLYQMDVKSAFLN YLN EVYV QPKGFVD 
Subjt:  STVDIALKDEYWINAMQEELLQFRQNNVWTL-------VEGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDS

Query:  EHPQHVYTLNKALYGLKQAPK-------------------------------------------------------------------------------
        EHP+HVY LNKALYGLKQAP+                                                                               
Subjt:  EHPQHVYTLNKALYGLKQAPK-------------------------------------------------------------------------------

Query:  ------------------------ACQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRK-------
                                A + DI Y VGICARYQ DP+ +HLEVVKRILKYVHG S+F +MYSY+TT  LVGY DVDWAGS+DDRK       
Subjt:  ------------------------ACQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRK-------

Query:  -STFKGTHTPNISMTPLSNMDSDDLDNVPLAHLLKKTNVPEVTDE-IPVAPSV-----------------------SAHSQESSSTEGVFFPTLSIAPAS
           FK T        P   +  +      L   L+   +P+V +   PV+P+V                       S HSQESSSTEGVF PTL     S
Subjt:  -STFKGTHTPNISMTPLSNMDSDDLDNVPLAHLLKKTNVPEVTDE-IPVAPSV-----------------------SAHSQESSSTEGVFFPTLSIAPAS

Query:  NVQP-GSSVCSPPSVSL--AFAPDDVHAYVLDNVPSDVSTTSEGQTDVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSS---SEVPVALKPVKRKSQQ
           P G S    PS+S      PD V A++L     +++T +E                  +  +++    DD +A  PSS   ++ P   KP KRK+QQ
Subjt:  NVQP-GSSVCSPPSVSL--AFAPDDVHAYVLDNVPSDVSTTSEGQTDVQSNENEVDPSNPAVCADEVPTNADDSLAVPPSS---SEVPVALKPVKRKSQQ

Query:  NRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS
         RRN TTKIGRKK+P N+PSVPIDGI FHH+E+VQ WKFV++R + DE                                   LI+EFI+NLP+EF+DPS
Subjt:  NRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIINLPNEFSDPS

Query:  --------------------------NVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQI
                                  N VDIDCSPS  + ++LA+VLS GTLSTWPVNGI A ALS+KYAI+HKIGI NWFPS HAS++S ALGTFLYQI
Subjt:  --------------------------NVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQI

Query:  CNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGF
        CND+KVDT  FIYNQ LRHVGSF VKV IA PR FSSLLLHLNG VLT SDAP P PKT++L YRLFQGSH+PDIDHDVH + G  IFD +DWD+S EGF
Subjt:  CNDNKVDTCAFIYNQQLRHVGSFRVKVSIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGF

Query:  FVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRE
        +V+R+LA+ I+NSLTAESRA++ SI LLSER LEVD+     K   P T+R++
Subjt:  FVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDSSHSSFKDFAPFTNRRE

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.9e-1348.75Show/hide
Query:  VNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPK
        ++++ETFAPVAR+ + R +L +    N K++QMDVK+AFLN  L EE+Y+  P+G   S +  +V  LNKA+YGLKQA +
Subjt:  VNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPK

P0CV72 Secreted RxLR effector protein 1612.5e-0836.23Show/hide
Query:  QPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRKST
        +PD+   VG+ +++ +DP  +H + +KR+L+Y+  T  + + ++   T  LVGY D DWAG  + R+ST
Subjt:  QPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRKST

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-1345.12Show/hide
Query:  EGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPK
        +G++FDE F+PV ++ +IR +L ++   + ++ Q+DVK+AFL+  L EE+Y+ QP+GF  +     V  LNK+LYGLKQAP+
Subjt:  EGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-1734.16Show/hide
Query:  SDIEPSTVDIALKDEYWINAMQEELLQFRQNNVWTLV-----------------------------------------EGVNFDETFAPVARLEAIRLLL
        ++ EP T   ALKDE W NAM  E+     N+ W LV                                          G+++ ETF+PV +  +IR++L
Subjt:  SDIEPSTVDIALKDEYWINAMQEELLQFRQNNVWTLV-----------------------------------------EGVNFDETFAPVARLEAIRLLL

Query:  RVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPKA
         V+  +++ + Q+DV +AFL   L ++VY++QP GF+D + P +V  L KALYGLKQAP+A
Subjt:  RVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPKA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.6e-1730.57Show/hide
Query:  ITRKKEKVDYLKMIADLCYTSDIEPSTVDIALKDEYWINAMQEELLQFRQNNVWTLV-----------------------------------------EG
        I +  +K  Y   +A     ++ EP T   A+KD+ W  AM  E+     N+ W LV                                          G
Subjt:  ITRKKEKVDYLKMIADLCYTSDIEPSTVDIALKDEYWINAMQEELLQFRQNNVWTLV-----------------------------------------EG

Query:  VNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPKACQPDI-TYVVGI
        +++ ETF+PV +  +IR++L V+  +++ + Q+DV +AFL   L +EVY++QP GFVD + P +V  L KA+YGLKQAP+A   ++ TY++ +
Subjt:  VNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAPKACQPDI-TYVVGI

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein5.4e-0627.72Show/hide
Query:  ILDGKNYSYWKPCMIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQAS------VGNARALNAIFNGVDLNVFKLINSCTTAKEAWKI
        + D  +Y  W P  I     ++   W V+V G       V       PE+  T   E+ S      V +A+AL  + + +  +VF+   S ++AK+ W +
Subjt:  ILDGKNYSYWKPCMIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQAS------VGNARALNAIFNGVDLNVFKLINSCTTAKEAWKI

Query:  LEVAYEGTSKVKISRLQLIT-----SKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDI---TTLELDE
        L    +G  +  I RL+ +T      +LE LKM++ ES S Y ++ LEI         + S+ +I + V  +L   FD   + +EE  D+   T+  L E
Subjt:  LEVAYEGTSKVKISRLQLIT-----SKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDI---TTLELDE

Query:  LF
         F
Subjt:  LF

AT4G05360.1 Zinc knuckle (CCHC-type) family protein1.8e-0926.16Show/hide
Query:  RCRECEGFGHYQVECPMFLRRQKKNYYATLLDEDSNDNEVDHGLNAFTTCITEI--------------------------NLDNSECFDKDEDEDLTFEE
        RC EC+GF H   EC   ++ ++K +  +  + DS+D E    L AFTT  + I                            DN    D+ +D+DL+  +
Subjt:  RCRECEGFGHYQVECPMFLRRQKKNYYATLLDEDSNDNEVDHGLNAFTTCITEI--------------------------NLDNSECFDKDEDEDLTFEE

Query:  LKMLSHSEARAIQKERIQDLMEENERL--------MGVISSLKI--KLKEVQGKYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGF----------
         +   + +A     E    ++EEN  L          V+ +LK   + +E   +  +T K+++MLN+GT+ L  IL+     + K GLGF          
Subjt:  LKMLSHSEARAIQKERIQDLMEENERL--------MGVISSLKI--KLKEVQGKYNQTIKSVKMLNSGTENLDSILNSRQNSSSKYGLGF----------

Query:  -------DASIRNVKSTSKVKFVPAAVK--AKTETTCTIA------ISNPSTRSFRWTCYYCGRKGHIRPFCYKLQRDK
                ++   VK T+ V  + +  +  ++T+T  T        + +   R FR  C++CG  GHIRP C++L R+K
Subjt:  -------DASIRNVKSTSKVKFVPAAVK--AKTETTCTIA------ISNPSTRSFRWTCYYCGRKGHIRPFCYKLQRDK

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.8e-1631.46Show/hide
Query:  EKVDYLKMIADLCYTSDIEPSTVDIALKDEYWINAMQEELLQFRQNNVW----------------------------------------TLVEGVNFDET
        EKV  L     +C     EPST + A +   W  AM +E+      + W                                        T  EG++F ET
Subjt:  EKVDYLKMIADLCYTSDIEPSTVDIALKDEYWINAMQEELLQFRQNNVW----------------------------------------TLVEGVNFDET

Query:  FAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFV----DSEHPQHVYTLNKALYGLKQAPK
        F+PV +L +++L+L +S I NF L+Q+D+ +AFLN  L+EE+Y+  P G+     DS  P  V  L K++YGLKQA +
Subjt:  FAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFV----DSEHPQHVYTLNKALYGLKQAPK

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-0545.1Show/hide
Query:  ISQIKQRVGDEFEINDLENLKYFLGIEVTSSKEGISMSKRKYTLYLLTETG
        + ++K ++   F++ DL  LKYFLG+E+  S  GI++ +RKY L LL ETG
Subjt:  ISQIKQRVGDEFEINDLENLKYFLGIEVTSSKEGISMSKRKYTLYLLTETG

ATMG00810.1 DNA/RNA polymerases superfamily protein9.8e-0831.71Show/hide
Query:  KALYGLKQAPKACQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRKST
        +++ G  Q     +PDI+Y V I  +   +P  +  +++KR+L+YV GT    +    ++  N+  +CD DWAG +  R+ST
Subjt:  KALYGLKQAPKACQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRKST

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.3e-0429.6Show/hide
Query:  IITRKKEKVDYLKMIADLCYTSDI--EPSTVDIALKDEYWINAMQEELLQFRQNNVWTLV----------------------------------------
        ++TR K  ++ L     L  T+ I  EP +V  ALKD  W  AMQEEL    +N  W LV                                        
Subjt:  IITRKKEKVDYLKMIADLCYTSDI--EPSTVDIALKDEYWINAMQEELLQFRQNNVWTLV----------------------------------------

Query:  EGVNFDETFAPVARLEAIRLLLRVS
        EG+ F ET++PV R   IR +L V+
Subjt:  EGVNFDETFAPVARLEAIRLLLRVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATAATCAGAGAGGGACCATCAGCTTCACGTCCTCCTATACTGGATGGAAAAAATTACTCATATTGGAAGCCTTGTATGATATTCTTTATTAAAACATTAGATGG
AAAAGCGTGGAGAGTCCTTGTTGCTGGTTATAAACCCCCAATGGTTATTGTGGATGGAGTGTCTGTACCAAAACCTGAGGTTGATTGGACAGATGCTGAAGAACAGGCAT
CGGTTGGGAATGCTAGAGCCCTAAATGCTATATTCAATGGTGTGGATCTAAATGTGTTCAAACTTATAAACTCTTGTACTACAGCTAAAGAAGCATGGAAAATACTGGAA
GTTGCCTATGAAGGTACTTCTAAAGTTAAAATATCCAGACTGCAGTTGATAACTTCGAAACTCGAAGCTTTAAAAATGATAGAAGATGAATCAATTTCAGAGTATAATGA
GAGGGTTCTGGAGATTGCTCATGATTCACTACTGCTTGGTGAAAAGATATCTGAATCTAAAATTGTTCGAAAGGTGTTACGTTCTCTACCAAGGAAATTTGACATGAAGG
TTACTGCCATAGAAGAAGCTCAGGATATAACCACTTTGGAACTTGATGAGTTATTTGGGTCACTACTCACATTTAAAATGACTATATCTGATAGAGAAAGCAAGAAAGAC
AAGGGGGTCGCTTTTAAATCTGCATATGAACAGAAGACTCCAGTGAATCAATCTGATAATGAAGTCAATCCAGATGAGTCAATAGCTCTCTTGACTAAACAATGCTCTAA
AATGGAAGTTCAAAAGTATGAATACGGTTGGACCAACTGCAAAACCTGGAAGAAAAATGGTGAGAACTTTACAAGAAAGGCAGATGAACTCTCAAACAGGAGAAATGACA
ACTACGGAAAGAAGAAAGAGGAAGTAAGGAGGTCTTTCAGATGTAGAGAGTGTGAAGGTTTCGGTCATTATCAAGTTGAATGTCCCATGTTTCTAAGAAGACAAAAGAAA
AATTATTATGCTACCTTGTTAGATGAAGACTCTAATGATAATGAAGTTGATCATGGATTGAATGCCTTCACAACATGCATTACAGAAATCAATCTTGATAATAGTGAGTG
TTTTGATAAAGATGAGGATGAGGATCTAACGTTTGAAGAACTCAAGATGTTAAGTCACTCAGAAGCTAGAGCTATACAAAAAGAAAGAATTCAAGATTTGATGGAAGAAA
ATGAACGATTGATGGGTGTCATATCATCTCTAAAAATAAAATTGAAAGAGGTTCAAGGTAAGTATAATCAGACAATTAAGTCTGTAAAGATGCTGAACTCAGGAACAGAG
AACTTAGACTCAATACTAAACTCAAGACAGAATAGTTCAAGTAAATATGGTCTTGGGTTTGATGCTTCCATAAGGAATGTTAAGTCTACATCTAAAGTGAAATTTGTTCC
TGCTGCAGTAAAAGCAAAAACTGAAACAACTTGTACAATTGCTATTTCAAATCCATCTACTAGATCTTTCCGATGGACCTGTTATTACTGTGGTCGGAAAGGTCATATTA
GACCATTTTGCTACAAGTTACAAAGGGACAAAAGGTATCAGCAGAAGGCAGAATTTGAAAGTCATAAGAATAAGTCTAGTTTTGCTAAACACAACAGAAGTATAAGGAAA
ACTCACATGGTTTTGAGAGTTAAAACATCTGACAATTGCAAGGTTGCATTTACAACTATTCAAACCACAAATGATGCTTGGTACTTTGATAGTGGATGTTCAAGGCATAT
GACTGGTGATAGGTCATTCTTTTCTAAATTAAAGGAATGTGCCTCAAGATATGTTACTTTTGGTAATGGTGCCAAAGGAAGAATTATTGCCAAAGAAAATATTGATAAAA
ATAATCTACCCTGTCTAAGTGATGTTAGATATGTGGATAGATTAAAAGGAATCTTGATTAGTAAGCTGTTAACACAGCGTGTCACATTCACAACAGGATTACTACTTGAT
TTGGAACGACTGTCACTTATGGAAACGATCAATGTTGTGGATGATGAGACTGTAAATATACCTGTGGATAATTCGCTGTGTCCTGTGGAGGTACCTAAAACTGATGCTTT
AATAGATGATGCTAGTATGAACTCAAAAAAGATATCTAAGGAAGTTATAGCTGATAATCCTGAGTTTGTTCCTTCTGCACATGTGAAGAAAAATCATCTATCAAGTTCTA
TAATAGGTGATCCTGTGGCTGGAATTATTACTCGAAAGAAAGAGAAAGTAGATTACTTGAAGATGATTGCTGACTTATGTTATACGTCTGATATTGAACCCTCAACTGTT
GACATTGCTCTTAAAGATGAATATTGGATTAACGCAATGCAAGAAGAACTACTCCAATTCAGGCAAAACAATGTCTGGACATTGGTCGAGGGGGTTAACTTTGATGAAAC
ATTTGCACCTGTTGCCAGACTTGAAGCTATTCGCTTGTTACTCAGAGTATCTTATATTCAAAATTTTAAATTATATCAAATGGATGTAAAAAGTGCTTTCTTGAATAGAT
ACTTAAATGAAGAAGTCTATGTTACTCAACCTAAGGGTTTTGTTGATTCCGAACATCCTCAACATGTGTATACGCTTAATAAAGCACTATATGGGCTTAAGCAAGCTCCT
AAAGCTTGTCAACCTGACATAACTTATGTTGTTGGAATATGTGCCCGTTATCAGGCTGATCCTCAAACGTCACATTTAGAGGTTGTTAAACGTATTCTTAAGTATGTTCA
TGGGACAAGTAACTTTGAAATTATGTATTCCTATGACACGACTTATAATTTGGTTGGATATTGTGATGTTGATTGGGCAGGTTCTTCTGATGATAGGAAGAGCACCTTTA
AAGGTACACATACACCAAATATCTCTATGACTCCTTTATCTAACATGGACTCAGATGATTTGGATAATGTCCCTTTAGCTCATTTGTTGAAGAAGACTAATGTCCCAGAG
GTTACTGATGAAATACCAGTGGCTCCTTCTGTGTCTGCTCATTCTCAAGAGAGCTCGTCAACTGAGGGAGTATTTTTTCCTACTCTTAGTATTGCTCCGGCTTCTAATGT
TCAACCTGGATCGTCAGTGTGTTCCCCTCCTTCTGTGTCACTTGCCTTTGCACCAGATGATGTCCATGCATATGTTCTTGATAATGTTCCTAGTGATGTTTCTACTACAT
CTGAAGGGCAAACTGACGTTCAAAGTAATGAGAATGAGGTGGACCCTTCAAACCCTGCTGTGTGCGCTGATGAAGTTCCCACAAATGCTGATGATAGTCTTGCTGTTCCA
CCAAGTTCTTCTGAAGTCCCGGTTGCACTAAAGCCAGTGAAGAGGAAATCACAACAAAATCGACGCAATAAAACCACCAAAATAGGTAGAAAGAAGGTTCCTCCTAATAT
TCCATCTGTTCCCATTGATGGAATCTTGTTTCATCATGAGGAGAATGTTCAGTGTTGGAAGTTTGTGGTTCAACGGTGGTTAGCTGATGAGGTAAATGTTTCTGACAAAC
ATCAATCGTGCATGAGTATCATGGACCTCATTGAAAGGGCAGGTTTAGCAAAAACTATTTTGAATGTTGGTCCGTTTTATCCTCAGCTTATTAGGGAGTTTATAATCAAT
TTGCCTAATGAATTTAGTGATCCAAGTAATGTTGTTGATATTGACTGTTCTCCATCGTCTCCCTCCACTGATGTTCTGGCTTCTGTATTATCTGGTGGAACTTTATCTAC
GTGGCCTGTAAATGGAATCTTTGCAGTCGCTCTTAGCATTAAATATGCCATTATGCATAAGATTGGTATTGCTAATTGGTTCCCCTCCTCTCATGCCTCCAATGTATCTA
CAGCCTTAGGAACTTTTTTGTATCAAATTTGTAATGACAATAAGGTGGATACATGTGCTTTTATATATAATCAGCAGTTAAGGCATGTTGGTTCGTTTAGGGTAAAGGTT
TCTATTGCTCTCCCGCGATTTTTCTCTAGTCTACTACTTCACTTAAATGGTGCCGTGTTAACTGCATCAGATGCTCCTCGACCTAATCCTAAAACATTATCACTCAGTTA
CAGACTTTTTCAGGGCAGTCATGTGCCTGATATTGATCATGATGTGCATCTGTCTTGCGGCTCATGCATTTTCGATAGAAGTGACTGGGATGAGTCTGATGAAGGCTTCT
TTGTTGATAGGAAGTTGGCTTCTTGTATTGTTAATTCACTTACTGCTGAATCTCGTGCTTTGTCTACTTCTATCAATCTGTTGTCTGAACGGTGCTTAGAGGTGGATAGC
TCTCATTCGTCATTTAAAGATTTTGCCCCATTTACCAATCGAAGGGAAAATGGGCCGTTTCTATACATTATTCTGTGTGGAAAAAGTGTAGAAGTGAATGGTCTACGGCT
GAAGGTTTTTGAAGGATTCACAGAAGCTGCTGAAACTAAAAAGATAGAAATCAGTCAAATAAAGCAGAGAGTGGGTGATGAATTTGAAATCAATGATTTAGAAAATCTAA
AATATTTCCTTGGAATAGAGGTAACCAGCTCTAAAGAAGGTATCTCCATGTCTAAGAGAAAATACACCCTTTATTTACTAACTGAGACAGGAAACTTTGATGATCAACTT
CCAGTTGATCAAGAACAATATCAATGCCTTGTGGGTAAATTGAATTACTTATCCCATACTCATCCTGATATTTCCTTTACTGTGCCAAGAAAGTCAAAGAAGGTGGATAC
AACCCAACAAATTATTGATGAAAATGAAGCACTTCGTAAACGTATTCGAGAGTTGGAACAAAAAGTTCAAAGCATCCCAACATCGGAACATGGTAGTTGCTCCAAGAGCA
AAGTCCAAGAGAAGGAGAAAGTTTTAGAAAATGTAGCAAAAGAGAAGAAAGGGATTGCTACTAGTGTACCACCGGTATCTCTTACATCAAAGAAGAAGGTCGTAGAAGAA
GAGGAAGTGAAAGAGAGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATAATCAGAGAGGGACCATCAGCTTCACGTCCTCCTATACTGGATGGAAAAAATTACTCATATTGGAAGCCTTGTATGATATTCTTTATTAAAACATTAGATGG
AAAAGCGTGGAGAGTCCTTGTTGCTGGTTATAAACCCCCAATGGTTATTGTGGATGGAGTGTCTGTACCAAAACCTGAGGTTGATTGGACAGATGCTGAAGAACAGGCAT
CGGTTGGGAATGCTAGAGCCCTAAATGCTATATTCAATGGTGTGGATCTAAATGTGTTCAAACTTATAAACTCTTGTACTACAGCTAAAGAAGCATGGAAAATACTGGAA
GTTGCCTATGAAGGTACTTCTAAAGTTAAAATATCCAGACTGCAGTTGATAACTTCGAAACTCGAAGCTTTAAAAATGATAGAAGATGAATCAATTTCAGAGTATAATGA
GAGGGTTCTGGAGATTGCTCATGATTCACTACTGCTTGGTGAAAAGATATCTGAATCTAAAATTGTTCGAAAGGTGTTACGTTCTCTACCAAGGAAATTTGACATGAAGG
TTACTGCCATAGAAGAAGCTCAGGATATAACCACTTTGGAACTTGATGAGTTATTTGGGTCACTACTCACATTTAAAATGACTATATCTGATAGAGAAAGCAAGAAAGAC
AAGGGGGTCGCTTTTAAATCTGCATATGAACAGAAGACTCCAGTGAATCAATCTGATAATGAAGTCAATCCAGATGAGTCAATAGCTCTCTTGACTAAACAATGCTCTAA
AATGGAAGTTCAAAAGTATGAATACGGTTGGACCAACTGCAAAACCTGGAAGAAAAATGGTGAGAACTTTACAAGAAAGGCAGATGAACTCTCAAACAGGAGAAATGACA
ACTACGGAAAGAAGAAAGAGGAAGTAAGGAGGTCTTTCAGATGTAGAGAGTGTGAAGGTTTCGGTCATTATCAAGTTGAATGTCCCATGTTTCTAAGAAGACAAAAGAAA
AATTATTATGCTACCTTGTTAGATGAAGACTCTAATGATAATGAAGTTGATCATGGATTGAATGCCTTCACAACATGCATTACAGAAATCAATCTTGATAATAGTGAGTG
TTTTGATAAAGATGAGGATGAGGATCTAACGTTTGAAGAACTCAAGATGTTAAGTCACTCAGAAGCTAGAGCTATACAAAAAGAAAGAATTCAAGATTTGATGGAAGAAA
ATGAACGATTGATGGGTGTCATATCATCTCTAAAAATAAAATTGAAAGAGGTTCAAGGTAAGTATAATCAGACAATTAAGTCTGTAAAGATGCTGAACTCAGGAACAGAG
AACTTAGACTCAATACTAAACTCAAGACAGAATAGTTCAAGTAAATATGGTCTTGGGTTTGATGCTTCCATAAGGAATGTTAAGTCTACATCTAAAGTGAAATTTGTTCC
TGCTGCAGTAAAAGCAAAAACTGAAACAACTTGTACAATTGCTATTTCAAATCCATCTACTAGATCTTTCCGATGGACCTGTTATTACTGTGGTCGGAAAGGTCATATTA
GACCATTTTGCTACAAGTTACAAAGGGACAAAAGGTATCAGCAGAAGGCAGAATTTGAAAGTCATAAGAATAAGTCTAGTTTTGCTAAACACAACAGAAGTATAAGGAAA
ACTCACATGGTTTTGAGAGTTAAAACATCTGACAATTGCAAGGTTGCATTTACAACTATTCAAACCACAAATGATGCTTGGTACTTTGATAGTGGATGTTCAAGGCATAT
GACTGGTGATAGGTCATTCTTTTCTAAATTAAAGGAATGTGCCTCAAGATATGTTACTTTTGGTAATGGTGCCAAAGGAAGAATTATTGCCAAAGAAAATATTGATAAAA
ATAATCTACCCTGTCTAAGTGATGTTAGATATGTGGATAGATTAAAAGGAATCTTGATTAGTAAGCTGTTAACACAGCGTGTCACATTCACAACAGGATTACTACTTGAT
TTGGAACGACTGTCACTTATGGAAACGATCAATGTTGTGGATGATGAGACTGTAAATATACCTGTGGATAATTCGCTGTGTCCTGTGGAGGTACCTAAAACTGATGCTTT
AATAGATGATGCTAGTATGAACTCAAAAAAGATATCTAAGGAAGTTATAGCTGATAATCCTGAGTTTGTTCCTTCTGCACATGTGAAGAAAAATCATCTATCAAGTTCTA
TAATAGGTGATCCTGTGGCTGGAATTATTACTCGAAAGAAAGAGAAAGTAGATTACTTGAAGATGATTGCTGACTTATGTTATACGTCTGATATTGAACCCTCAACTGTT
GACATTGCTCTTAAAGATGAATATTGGATTAACGCAATGCAAGAAGAACTACTCCAATTCAGGCAAAACAATGTCTGGACATTGGTCGAGGGGGTTAACTTTGATGAAAC
ATTTGCACCTGTTGCCAGACTTGAAGCTATTCGCTTGTTACTCAGAGTATCTTATATTCAAAATTTTAAATTATATCAAATGGATGTAAAAAGTGCTTTCTTGAATAGAT
ACTTAAATGAAGAAGTCTATGTTACTCAACCTAAGGGTTTTGTTGATTCCGAACATCCTCAACATGTGTATACGCTTAATAAAGCACTATATGGGCTTAAGCAAGCTCCT
AAAGCTTGTCAACCTGACATAACTTATGTTGTTGGAATATGTGCCCGTTATCAGGCTGATCCTCAAACGTCACATTTAGAGGTTGTTAAACGTATTCTTAAGTATGTTCA
TGGGACAAGTAACTTTGAAATTATGTATTCCTATGACACGACTTATAATTTGGTTGGATATTGTGATGTTGATTGGGCAGGTTCTTCTGATGATAGGAAGAGCACCTTTA
AAGGTACACATACACCAAATATCTCTATGACTCCTTTATCTAACATGGACTCAGATGATTTGGATAATGTCCCTTTAGCTCATTTGTTGAAGAAGACTAATGTCCCAGAG
GTTACTGATGAAATACCAGTGGCTCCTTCTGTGTCTGCTCATTCTCAAGAGAGCTCGTCAACTGAGGGAGTATTTTTTCCTACTCTTAGTATTGCTCCGGCTTCTAATGT
TCAACCTGGATCGTCAGTGTGTTCCCCTCCTTCTGTGTCACTTGCCTTTGCACCAGATGATGTCCATGCATATGTTCTTGATAATGTTCCTAGTGATGTTTCTACTACAT
CTGAAGGGCAAACTGACGTTCAAAGTAATGAGAATGAGGTGGACCCTTCAAACCCTGCTGTGTGCGCTGATGAAGTTCCCACAAATGCTGATGATAGTCTTGCTGTTCCA
CCAAGTTCTTCTGAAGTCCCGGTTGCACTAAAGCCAGTGAAGAGGAAATCACAACAAAATCGACGCAATAAAACCACCAAAATAGGTAGAAAGAAGGTTCCTCCTAATAT
TCCATCTGTTCCCATTGATGGAATCTTGTTTCATCATGAGGAGAATGTTCAGTGTTGGAAGTTTGTGGTTCAACGGTGGTTAGCTGATGAGGTAAATGTTTCTGACAAAC
ATCAATCGTGCATGAGTATCATGGACCTCATTGAAAGGGCAGGTTTAGCAAAAACTATTTTGAATGTTGGTCCGTTTTATCCTCAGCTTATTAGGGAGTTTATAATCAAT
TTGCCTAATGAATTTAGTGATCCAAGTAATGTTGTTGATATTGACTGTTCTCCATCGTCTCCCTCCACTGATGTTCTGGCTTCTGTATTATCTGGTGGAACTTTATCTAC
GTGGCCTGTAAATGGAATCTTTGCAGTCGCTCTTAGCATTAAATATGCCATTATGCATAAGATTGGTATTGCTAATTGGTTCCCCTCCTCTCATGCCTCCAATGTATCTA
CAGCCTTAGGAACTTTTTTGTATCAAATTTGTAATGACAATAAGGTGGATACATGTGCTTTTATATATAATCAGCAGTTAAGGCATGTTGGTTCGTTTAGGGTAAAGGTT
TCTATTGCTCTCCCGCGATTTTTCTCTAGTCTACTACTTCACTTAAATGGTGCCGTGTTAACTGCATCAGATGCTCCTCGACCTAATCCTAAAACATTATCACTCAGTTA
CAGACTTTTTCAGGGCAGTCATGTGCCTGATATTGATCATGATGTGCATCTGTCTTGCGGCTCATGCATTTTCGATAGAAGTGACTGGGATGAGTCTGATGAAGGCTTCT
TTGTTGATAGGAAGTTGGCTTCTTGTATTGTTAATTCACTTACTGCTGAATCTCGTGCTTTGTCTACTTCTATCAATCTGTTGTCTGAACGGTGCTTAGAGGTGGATAGC
TCTCATTCGTCATTTAAAGATTTTGCCCCATTTACCAATCGAAGGGAAAATGGGCCGTTTCTATACATTATTCTGTGTGGAAAAAGTGTAGAAGTGAATGGTCTACGGCT
GAAGGTTTTTGAAGGATTCACAGAAGCTGCTGAAACTAAAAAGATAGAAATCAGTCAAATAAAGCAGAGAGTGGGTGATGAATTTGAAATCAATGATTTAGAAAATCTAA
AATATTTCCTTGGAATAGAGGTAACCAGCTCTAAAGAAGGTATCTCCATGTCTAAGAGAAAATACACCCTTTATTTACTAACTGAGACAGGAAACTTTGATGATCAACTT
CCAGTTGATCAAGAACAATATCAATGCCTTGTGGGTAAATTGAATTACTTATCCCATACTCATCCTGATATTTCCTTTACTGTGCCAAGAAAGTCAAAGAAGGTGGATAC
AACCCAACAAATTATTGATGAAAATGAAGCACTTCGTAAACGTATTCGAGAGTTGGAACAAAAAGTTCAAAGCATCCCAACATCGGAACATGGTAGTTGCTCCAAGAGCA
AAGTCCAAGAGAAGGAGAAAGTTTTAGAAAATGTAGCAAAAGAGAAGAAAGGGATTGCTACTAGTGTACCACCGGTATCTCTTACATCAAAGAAGAAGGTCGTAGAAGAA
GAGGAAGTGAAAGAGAGGAAGTGA
Protein sequenceShow/hide protein sequence
MEIIREGPSASRPPILDGKNYSYWKPCMIFFIKTLDGKAWRVLVAGYKPPMVIVDGVSVPKPEVDWTDAEEQASVGNARALNAIFNGVDLNVFKLINSCTTAKEAWKILE
VAYEGTSKVKISRLQLITSKLEALKMIEDESISEYNERVLEIAHDSLLLGEKISESKIVRKVLRSLPRKFDMKVTAIEEAQDITTLELDELFGSLLTFKMTISDRESKKD
KGVAFKSAYEQKTPVNQSDNEVNPDESIALLTKQCSKMEVQKYEYGWTNCKTWKKNGENFTRKADELSNRRNDNYGKKKEEVRRSFRCRECEGFGHYQVECPMFLRRQKK
NYYATLLDEDSNDNEVDHGLNAFTTCITEINLDNSECFDKDEDEDLTFEELKMLSHSEARAIQKERIQDLMEENERLMGVISSLKIKLKEVQGKYNQTIKSVKMLNSGTE
NLDSILNSRQNSSSKYGLGFDASIRNVKSTSKVKFVPAAVKAKTETTCTIAISNPSTRSFRWTCYYCGRKGHIRPFCYKLQRDKRYQQKAEFESHKNKSSFAKHNRSIRK
THMVLRVKTSDNCKVAFTTIQTTNDAWYFDSGCSRHMTGDRSFFSKLKECASRYVTFGNGAKGRIIAKENIDKNNLPCLSDVRYVDRLKGILISKLLTQRVTFTTGLLLD
LERLSLMETINVVDDETVNIPVDNSLCPVEVPKTDALIDDASMNSKKISKEVIADNPEFVPSAHVKKNHLSSSIIGDPVAGIITRKKEKVDYLKMIADLCYTSDIEPSTV
DIALKDEYWINAMQEELLQFRQNNVWTLVEGVNFDETFAPVARLEAIRLLLRVSYIQNFKLYQMDVKSAFLNRYLNEEVYVTQPKGFVDSEHPQHVYTLNKALYGLKQAP
KACQPDITYVVGICARYQADPQTSHLEVVKRILKYVHGTSNFEIMYSYDTTYNLVGYCDVDWAGSSDDRKSTFKGTHTPNISMTPLSNMDSDDLDNVPLAHLLKKTNVPE
VTDEIPVAPSVSAHSQESSSTEGVFFPTLSIAPASNVQPGSSVCSPPSVSLAFAPDDVHAYVLDNVPSDVSTTSEGQTDVQSNENEVDPSNPAVCADEVPTNADDSLAVP
PSSSEVPVALKPVKRKSQQNRRNKTTKIGRKKVPPNIPSVPIDGILFHHEENVQCWKFVVQRWLADEVNVSDKHQSCMSIMDLIERAGLAKTILNVGPFYPQLIREFIIN
LPNEFSDPSNVVDIDCSPSSPSTDVLASVLSGGTLSTWPVNGIFAVALSIKYAIMHKIGIANWFPSSHASNVSTALGTFLYQICNDNKVDTCAFIYNQQLRHVGSFRVKV
SIALPRFFSSLLLHLNGAVLTASDAPRPNPKTLSLSYRLFQGSHVPDIDHDVHLSCGSCIFDRSDWDESDEGFFVDRKLASCIVNSLTAESRALSTSINLLSERCLEVDS
SHSSFKDFAPFTNRRENGPFLYIILCGKSVEVNGLRLKVFEGFTEAAETKKIEISQIKQRVGDEFEINDLENLKYFLGIEVTSSKEGISMSKRKYTLYLLTETGNFDDQL
PVDQEQYQCLVGKLNYLSHTHPDISFTVPRKSKKVDTTQQIIDENEALRKRIRELEQKVQSIPTSEHGSCSKSKVQEKEKVLENVAKEKKGIATSVPPVSLTSKKKVVEE
EEVKERK