; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014443 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014443
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationtig00000589:412608..432231
RNA-Seq ExpressionSgr014443
SyntenySgr014443
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0045037 - protein import into chloroplast stroma (biological process)
GO:0009706 - chloroplast inner membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008320 - protein transmembrane transporter activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025640 - GYF domain 2
IPR036397 - Ribonuclease H superfamily
IPR037471 - Protein TIC56
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0068025.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.6e-30959.82Show/hide
Query:  QSVGEPQRTRRLHNVRSIKSEACVIESMHDNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHK
        Q +G P        + S++    +  S  D +  C ICPLAKQ++LSFTSNN LS+ AFDL+H DIWGPFS+ TY+ +SYFLT+VDD+TRYTW+F+LK K
Subjt:  QSVGEPQRTRRLHNVRSIKSEACVIESMHDNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHK

Query:  SDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTP
        SDV+S++P FFKL+ETQ+G  IK+ RSDNA +L F  FF  KGVIHQ+SCV+ PQQNSVVE+KHQHILN ARALYFQS+VP+ FWGDC++TAVYLI+RTP
Subjt:  SDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTP

Query:  SPLLKWCTPFELLNGSKADYSSIRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFP
        S LL+W  PF+ LN    DY+S++VFG LC+AS+L  + SKF PRAIP+VF+GYP GMK YKLYDIE +K+FISRD                        
Subjt:  SPLLKWCTPFELLNGSKADYSSIRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFP

Query:  GFVLPKPFNVSTEVEPPSLSYNQASASAQVSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPT-T
              P N     E   +  N+ + S Q                              +QP   +   +RRS RI+KPPSYL AYHCSLL++ S PT  
Subjt:  GFVLPKPFNVSTEVEPPSLSYNQASASAQVSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPT-T

Query:  SSRFPLQQFISYSRLSSTHRNLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKG
        S+++P+ Q++SY  L  T++  IL  ST  E  FYH+AVV   W  AM AEL AME N+TWS+VPLP GK++IGCRW+YKIKHKADGSIERYK RLVAKG
Subjt:  SSRFPLQQFISYSRLSSTHRNLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKG

Query:  YTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLI
        YTQQEGLD+ ETFS V K+VTVKTLLT+AVSK W L+QLDVNNAFLHG+LFEEVYMDLPLGY    + +GE LVC+LHKSIYGLKQASRQWF KFS FL+
Subjt:  YTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLI

Query:  SLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTS
        SLGF QSKA+YSLF++G  +SF+ALLVYVDDIIITG+NA  IQ LK  LN  F LKDLG LK+FLGLEL R+S+G+F+SQ++YTLQL+EDTG LG KPT 
Subjt:  SLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTS

Query:  VPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK------------------------------
        VPMDP  KL +S+ D+L D + YRRLIGRLLYLTISRPDITFAVHKLSQFM+KP  +H+ AA+ L++YLK                              
Subjt:  VPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK------------------------------

Query:  --RSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKL
          RS+TGFCVFLG+SLVSWKSKKQQT++RSSAEAEY+AL VT CE+IWL +LL +L+++   PALLFCDNQAA++IA+NP+FHE+TKHIELDCHF+RD++
Subjt:  --RSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKL

Query:  IDGTIKLLPVR
        IDG+IKLLPVR
Subjt:  IDGTIKLLPVR

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]1.3e-27453.29Show/hide
Query:  DNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDN
        D ++ C  C L+KQ+RL   S N++SA+ F+L+H D WGPFS  +  G  +F T+VDD +RYTW+++LK KSDVLS+ P F ++V TQFG  +K  RSDN
Subjt:  DNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDN

Query:  APELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCL
        APEL F DFF   G+ H  SCVERPQQNSVVERKHQHILNVARAL FQS +P+ +W DC+ T+VYLINRTPSP+L   TPFELL+G    YS ++VFGCL
Subjt:  APELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCL

Query:  CFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGFVLPKPFNVSTEVEPPSLSYNQASASAQ
        C+ASTL S R KF PRAI  VF+GYPPG K YKL ++E  ++FISRDVIFHE  FP+   +  S+              +++ EV P S       A AQ
Subjt:  CFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGFVLPKPFNVSTEVEPPSLSYNQASASAQ

Query:  VSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLILNVSTHF
          S                                       R++R    PS+L  YHC  +S+    +TS+  P+   ++YS+LSS+HR  + N+S+  
Subjt:  VSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLILNVSTHF

Query:  EPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAV
        EP  + QAV    WR AMD EL A+E N TWS+V LP GK  +GCRW+YK K  ADGS++RYKARLVAKGYTQQEGLD+LETFSPVAKLVTV+TLL +A 
Subjt:  EPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAV

Query:  SKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALLVYVD
         +GW L+QLDVNNAFLHGDL EEVYM LP G+ S  +      VCKLHKSIYGLKQASRQWFAKFS  L+S+GF QS AD SLF++   N F+AL+VYVD
Subjt:  SKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALLVYVD

Query:  DIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRLIGRL
        DI+I  ++ +    LKD LN+ FKLKDLG+LKYFLG+E+ RS+ G+ + QR+Y + LL + G LGCKP + PM+   KL   + ++L+DP+ YRRLIGRL
Subjt:  DIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRLIGRL

Query:  LYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFLGDSLVSWKSKKQQTISRS
        LYLTI+RPD+ FAV+KLSQ++S P   H+ AA ++L+Y+K                                RS+TG+CVFLG+SL+SW++KKQQT+SRS
Subjt:  LYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFLGDSLVSWKSKKQQTISRS

Query:  SAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLF
        SAEAEYR+LA + CEI+W+  LL DL V  + P +LFCD+QAAVHIASNPVFHERTKHI++DCH +R+K+    +KL+ V S  QLAD FTKPL  +   
Subjt:  SAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLF

Query:  PLLSKMGVLDIH
         LL KMGV +IH
Subjt:  PLLSKMGVLDIH

KZV53534.1 hypothetical protein F511_42283 [Dorcoceras hygrometricum]1.9e-27352.62Show/hide
Query:  ESMHDNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQF
        +S   +I+ C +C ++KQKRL F S+N  +A +F+L+H D+WGPFS  +  G+ +FLT+VDD TR+TW+++L+ KS+V S+ P F ++V+TQFG  IK  
Subjt:  ESMHDNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQF

Query:  RSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRV
        RSDNAP+L F + F   G++H +SCVERPQQNS+VERKHQHILNVARAL FQS VPI +W DC++T++YLINRTPS +L   TPFELL+G    YS +++
Subjt:  RSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRV

Query:  FGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGFVLPKPFNVSTEVEPPSLSYNQAS
        FGCLC+ASTL S R KF PRAI  VFLGYPPG + YKL +++  ++ IS DVIFHE  FPF     +    D  P ++                S N   
Subjt:  FGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGFVLPKPFNVSTEVEPPSLSYNQAS

Query:  ASAQVSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLILNV
          +Q+++ S  +PD  P+   ++Q                      RS RI +PP +L  YHC + SS+  P+TS+  PL  F++YS+LS  HRNL+ N+
Subjt:  ASAQVSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLILNV

Query:  STHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLL
        S+  EP  + QAV    W+ AM  EL A+E N TWS+V LP GK  +GCRW+YK K  ADGS++RYKARLVAKGYTQQEGLD+LETFSPVAK+VTV+TLL
Subjt:  STHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLL

Query:  TVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALL
         +A ++GWSL+QL V+NAFLHG+L EEVYM LP GY+S         VCKLHKS+YGLKQASRQWFAKFS  L+S+GF QS AD SLF+K   N F+ LL
Subjt:  TVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALL

Query:  VYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRL
        VYVDDI+I  +N      LK  LN  FKLKDLG LKYFLG+E+ RSS GI + QR+Y +  L + G +GC+P S PM+  +K+   + ++L DPS YRRL
Subjt:  VYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRL

Query:  IGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFLGDSLVSWKSKKQQT
        IGRLLYLT++RPD+ FAV+KLSQ++SKP   H+ AA ++LRY+K                                RS+TG+CVFLG+S++SW++KKQ T
Subjt:  IGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFLGDSLVSWKSKKQQT

Query:  ISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSA
        +SRSSAEAEYR++A   CEI+W+ +LL DL V+   PA LFCD+QAA+HIASNPVFHERTKHI++DCH IR+K+  G +KL+ V S  QLAD FTK L  
Subjt:  ISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSA

Query:  ALLFPLLSKMGVLDIH
        +    LLSKMG+ +IH
Subjt:  ALLFPLLSKMGVLDIH

XP_022145657.1 protein TIC 56, chloroplastic [Momordica charantia]1.2e-28893.75Show/hide
Query:  MASINFNPFENWFSKRPNPIPPINLIAFRDSLSQKSSTSPNFASTSLSNVFRKPQKPEKAPDEPGYYGKMLEQFYWECDSLPDNRHTPEVEKILNEDPVF
        MASINFNPFENWFS+RPNPIPP+NL AFRDSLSQKSSTSPNFASTSLSN F+K QKP+KA DEPGYYGKMLEQF+WECD+LPD RHTPEVEKILNEDPV 
Subjt:  MASINFNPFENWFSKRPNPIPPINLIAFRDSLSQKSSTSPNFASTSLSNVFRKPQKPEKAPDEPGYYGKMLEQFYWECDSLPDNRHTPEVEKILNEDPVF

Query:  EKKENPTEEELEKNEKLWKALRTSPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFGL
        + KENP+EEE+EKNEKLWKALR SPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRES+DKFWDFARQFFFGL
Subjt:  EKKENPTEEELEKNEKLWKALRTSPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFGL

Query:  WGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDFNMRSGGWYYKDRLGRTRGPCELINLKTAWGGGIIDKDTFIWGEDMDEWAPIHMIYGLERAIATWEVRL
        WGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDF MRSGGWYYKDRLGRTRGP ELI LKTAWGGGIIDKDTFIWGEDMDEWAPIHM+YGLERAIATWEVRL
Subjt:  WGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDFNMRSGGWYYKDRLGRTRGPCELINLKTAWGGGIIDKDTFIWGEDMDEWAPIHMIYGLERAIATWEVRL

Query:  GAAATAFLHKLQKGIPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKIIP
        GAAATAFLHKLQKGIPPWVPLKG EKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTS LEADHMPNKYIP+DLRYKLAKIIP
Subjt:  GAAATAFLHKLQKGIPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKIIP

Query:  GLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGFRKIMEKVQADAAARDARRKERREATKRAEY
        GLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGF KIM+KVQADAAARDARRKERREA KRAE+
Subjt:  GLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGFRKIMEKVQADAAARDARRKERREATKRAEY

Query:  ERAIFGEVTKDQ
        ER IFG V KDQ
Subjt:  ERAIFGEVTKDQ

XP_038905710.1 protein TIC 56, chloroplastic [Benincasa hispida]4.4e-28693.36Show/hide
Query:  MASINFNPFENWFSKRPNPIPPINLIAFRDSLSQKSSTSPNFASTSLSNVFRKPQKPEKAPDEPGYYGKMLEQFYWECDSLPDNRHTPEVEKILNEDPVF
        MASINFNPFENWFS+RPNPIP  NLIAFRDSLSQKSS SPNFAS SLSNVF+K QKPEKA DEPGYYGKMLEQFYWEC++LPD RH PEVEKIL+EDPVF
Subjt:  MASINFNPFENWFSKRPNPIPPINLIAFRDSLSQKSSTSPNFASTSLSNVFRKPQKPEKAPDEPGYYGKMLEQFYWECDSLPDNRHTPEVEKILNEDPVF

Query:  EKKENPTEEELEKNEKLWKALRTSPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFGL
        E KENPT+EELEKNEKLWK +R SPVV+FLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFGL
Subjt:  EKKENPTEEELEKNEKLWKALRTSPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFGL

Query:  WGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDFNMRSGGWYYKDRLGRTRGPCELINLKTAWGGGIIDKDTFIWGEDMDEWAPIHMIYGLERAIATWEVRL
        WGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDF MRSGGWYYKDRLGRTRGP ELI LKTAWGGGIIDKDTFIWGEDMDEWAPIHM+YGLERAIATWEVRL
Subjt:  WGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDFNMRSGGWYYKDRLGRTRGPCELINLKTAWGGGIIDKDTFIWGEDMDEWAPIHMIYGLERAIATWEVRL

Query:  GAAATAFLHKLQKGIPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKIIP
        GAAATAFLHKLQKGIPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKIIP
Subjt:  GAAATAFLHKLQKGIPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKIIP

Query:  GLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGFRKIMEKVQADAAARDARRKERREATKRAEY
        GLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFR FF+LSTRVYNKMERTIPGF KIMEKVQ DAAAR+ARRKERREA KRAE+
Subjt:  GLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGFRKIMEKVQADAAARDARRKERREATKRAEY

Query:  ERAIFGEVTKDQ
        ER IFGEV  D+
Subjt:  ERAIFGEVTKDQ

TrEMBL top hitse value%identityAlignment
A0A2N9EHN7 Integrase catalytic domain-containing protein4.5e-27655.31Show/hide
Query:  DNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDN
        ++  TC +CPLAKQKRL F + N LS  +FDL+H DIWGP+  PT  G+ YFLTLVDD TR TWI+L++ KSD   ++  F  +++TQF   IKQ RSDN
Subjt:  DNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDN

Query:  APELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCL
          E    +F+ SKG+IHQ SCVE PQQNSVVERKHQHILNVAR+L FQS +P+++WG C+ TAVYLINR P P+L   +PFE L      Y+ ++VFGCL
Subjt:  APELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCL

Query:  CFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHT-------VTEASVLPDPF---PGFVLPKPFNVSTEVEPPSL
        CFASTLSSHR+KF PRA   VFLGYP G+K YKL D+   K+FISRDV+FHE IFPF T        T  +  P+P    P F+ P    ++ ++ P S 
Subjt:  CFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHT-------VTEASVLPDPF---PGFVLPKPFNVSTEVEPPSL

Query:  SYNQASASAQVSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSL---LSSTSLPTTSSR---FPLQQFISYSR
            A   +  +SP  F  DISP +  T        +I    P       +RRS R+ KPP+YL  YHC L   + STS P  +S    +PL   +SY  
Subjt:  SYNQASASAQVSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSL---LSSTSLPTTSSR---FPLQQFISYSR

Query:  LSSTHRNLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFS
        LS THRN  L+V+   EP  +HQA    HW+ AM AELAA+E N TW++ PLP GKH IGC+W+YK+K K+DGS+ERYKARLVAKGYTQQEGLD+ ETFS
Subjt:  LSSTHRNLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFS

Query:  PVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE--HLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYS
        PVAK  TV+TLL VA +K WSL QLDVNNAFLHGDL EEVYM LPLG+     SKGE  +LVCKL+KS+YGLKQASRQWFAKFS  +I  GF QS +DYS
Subjt:  PVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE--HLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYS

Query:  LFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSS
        LF +  G  F+ALLVYVDDI+I  ++  ++  LKD L+A FKLKDLG+LK+FLGLE+ RS+ GI + QR Y L +L D+G LG KP + PM+  LK+  S
Subjt:  LFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSS

Query:  NVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFL
          ++LADPS YRRLIGRLLYLT++RPDI+++V +LSQFMSKP   HL AA+ +LRY+K                                RSITG+CV++
Subjt:  NVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFL

Query:  GDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRS
        G SL+SWKSKKQ T+SRSSAEAEYRA+A   CE++WL  LL +LQ      ALLFCD+QAA+HIA+NPV+HERTKHIELDCH IR+K+  G ++ L V S
Subjt:  GDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRS

Query:  HSQLADPFTKPLS
         +QLAD  TK L+
Subjt:  HSQLADPFTKPLS

A0A2N9H2Y3 Integrase catalytic domain-containing protein1.5e-27954.35Show/hide
Query:  VGEPQRTRRLHNVRSIKSEACVIESMHDNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSD
        +G P  + R+H + SI   A       ++  TC +CPLAKQ++L F +NN LS K+FDL+H DIWGP+  PT  G+ YFLTLVDD TR TWI+L++ KSD
Subjt:  VGEPQRTRRLHNVRSIKSEACVIESMHDNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSD

Query:  VLSVVPQFFKLVETQFGCCIKQFRSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSP
          +++  F  ++ TQF   IKQ RSDN  E    DF+ SKG+IHQ SCVE PQQNSVVERKHQHILNVARAL FQS +P+++WG C+ TAVYLINR P P
Subjt:  VLSVVPQFFKLVETQFGCCIKQFRSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSP

Query:  LLKWCTPFELLNGSKADYSSIRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGF
        +L   +PFE L      Y+ ++VFGCLCFASTLS HR+KF PRA    FLGYP G+K YKL ++   K+ ISRDV+FHE IFPF   T    LPD F  F
Subjt:  LLKWCTPFELLNGSKADYSSIRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGF

Query:  V--LPKPFNVSTEVEPPSLSYNQASASAQVSSPSSFLPDISPVIHTTQQPLMDAHN---------IVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSL-
        +   P+P + +    PP  S+  A   +  S+P+   P +S  +       +  HN         I    P       +RRS R+ KPP+YL  YHC L 
Subjt:  V--LPKPFNVSTEVEPPSLSYNQASASAQVSSPSSFLPDISPVIHTTQQPLMDAHN---------IVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSL-

Query:  --LSSTSLPTTSSR---FPLQQFISYSRLSSTHRNLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKAD
          + STS P  +S    +PL   +SY  LS THRN  L+V+   EP F+HQA    HW+ AM AELAA+E N TW++ PLP GKH IGC+W+YK+K K+D
Subjt:  --LSSTSLPTTSSR---FPLQQFISYSRLSSTHRNLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKAD

Query:  GSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE--HLVCKLHKSIYGL
        GS+ERYKARLVAKGYTQQEGLD+ ETFSPVAK  TV+TLL VA  K WSL QLDVNNAFLHGDL EEVYM LP G+     SKGE  +LVCKL+KS+YGL
Subjt:  GSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE--HLVCKLHKSIYGL

Query:  KQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYT
        KQASRQWFAKFS  +I  GF QSK+DYSLF +  G +F+ALLVYVDDI+I  ++   +  LKD L+A FKLKDLG+LKYFLGLE+ RS+ GI + QR Y 
Subjt:  KQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYT

Query:  LQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------
        L +L D+G LG KP + PM+  LK+  S  ++L DPS YRRL+GRLLYLT++RPDI+++V KLSQFMSKP   HL+AA+ +LRY+K              
Subjt:  LQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------

Query:  ------------------RSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHE
                          RSITG+CV++GDSL+SWKSKKQ T+SRSSAEAEYRA+A   CE++WL  LL +LQ      ALLFCD+QAA+HIA+NPV+HE
Subjt:  ------------------RSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHE

Query:  RTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMG
        RTKHIELDCH IR+K+  G ++ L V S +QLAD  TK L +A    LL KMG
Subjt:  RTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMG

A0A2N9IZK3 Uncharacterized protein6.1e-28155.42Show/hide
Query:  CGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELA
        C +CPLAKQKRL F +NN +S+ AFD++H DIWGP+  PT  G+ YFLTLVDD TR TW++L+K KS+   ++  F  +++TQFG  +K  RSDN  E +
Subjt:  CGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELA

Query:  FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAST
          DF+ ++G+IHQ SCVE PQQNSVVERKHQHILNVAR+L FQS +P++FWG  VLTAVYLINR PSP+L   +P+E L      YS +RVFGCLCFAST
Subjt:  FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAST

Query:  LSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASV------LPDPFPGFV-LPKPFNVSTEVEPPSLSYNQASAS
        LS+HR+KF PRA P VFLGYP G+K YKL D+    + ISRDVIFHE +FPF     A        LP   P F  +P    +S  +     S    S S
Subjt:  LSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASV------LPDPFPGFV-LPKPFNVSTEVEPPSLSYNQASAS

Query:  AQV-SSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSL------LSSTSLPTTSSRFPLQQFISYSRLSSTHRN
          + +SPS+  P I              H  VP     +  PL RRS R+SKPP+YL  YHC +       SS+S  +T + +PL   +SY  LS +HR 
Subjt:  AQV-SSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSL------LSSTSLPTTSSRFPLQQFISYSRLSSTHRN

Query:  LILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVT
          L+V+   EP  + QA    HWR AM  EL A+E N TWS+  LP GKH IGC+W+YK+K KADGS+ERYKARLVAKGYTQQEGLD+ ETFSPVAK  T
Subjt:  LILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVT

Query:  VKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE-HLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGN
        V+TLL +A ++ WSL QLDVNNAFLHGDL EEVYM LP G+     SKGE +LVCKL KS+YGLKQASRQWFAKFS  LI  GF QSK+DYSLF +  G 
Subjt:  VKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE-HLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGN

Query:  SFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADP
        +F+ LLVYVDDI+I  +N + +  LKD L+A FKLKDLG+LKYFLGLE+ RSS GI + QR Y L +L D+G LG KP   PM+ +LKL  S+ D L+DP
Subjt:  SFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADP

Query:  SIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFLGDSLVSWK
        S YRRL+GRLLYLT++RPDI+++V +LSQFM+KP  +HLAAA+ +L+Y+K                                RS+TG+CVFLG+SL+SWK
Subjt:  SIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFLGDSLVSWK

Query:  SKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPF
        SKKQ TISRSSAEAEYRA+A   CE++WL  LL++LQV     ALL+CD+QAA+HIA+NPVFHERTKHIELDCH IR+K+ DG ++ L V S +QLAD  
Subjt:  SKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPF

Query:  TKPLSAALLFPLLSKMGVLDIH
        TK L +     LLSKMGV +I+
Subjt:  TKPLSAALLFPLLSKMGVLDIH

A0A5A7VQN7 Cysteine-rich RLK (Receptor-like protein kinase) 87.7e-31059.82Show/hide
Query:  QSVGEPQRTRRLHNVRSIKSEACVIESMHDNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHK
        Q +G P        + S++    +  S  D +  C ICPLAKQ++LSFTSNN LS+ AFDL+H DIWGPFS+ TY+ +SYFLT+VDD+TRYTW+F+LK K
Subjt:  QSVGEPQRTRRLHNVRSIKSEACVIESMHDNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHK

Query:  SDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTP
        SDV+S++P FFKL+ETQ+G  IK+ RSDNA +L F  FF  KGVIHQ+SCV+ PQQNSVVE+KHQHILN ARALYFQS+VP+ FWGDC++TAVYLI+RTP
Subjt:  SDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTP

Query:  SPLLKWCTPFELLNGSKADYSSIRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFP
        S LL+W  PF+ LN    DY+S++VFG LC+AS+L  + SKF PRAIP+VF+GYP GMK YKLYDIE +K+FISRD                        
Subjt:  SPLLKWCTPFELLNGSKADYSSIRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFP

Query:  GFVLPKPFNVSTEVEPPSLSYNQASASAQVSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPT-T
              P N     E   +  N+ + S Q                              +QP   +   +RRS RI+KPPSYL AYHCSLL++ S PT  
Subjt:  GFVLPKPFNVSTEVEPPSLSYNQASASAQVSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPT-T

Query:  SSRFPLQQFISYSRLSSTHRNLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKG
        S+++P+ Q++SY  L  T++  IL  ST  E  FYH+AVV   W  AM AEL AME N+TWS+VPLP GK++IGCRW+YKIKHKADGSIERYK RLVAKG
Subjt:  SSRFPLQQFISYSRLSSTHRNLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKG

Query:  YTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLI
        YTQQEGLD+ ETFS V K+VTVKTLLT+AVSK W L+QLDVNNAFLHG+LFEEVYMDLPLGY    + +GE LVC+LHKSIYGLKQASRQWF KFS FL+
Subjt:  YTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLI

Query:  SLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTS
        SLGF QSKA+YSLF++G  +SF+ALLVYVDDIIITG+NA  IQ LK  LN  F LKDLG LK+FLGLEL R+S+G+F+SQ++YTLQL+EDTG LG KPT 
Subjt:  SLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTS

Query:  VPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK------------------------------
        VPMDP  KL +S+ D+L D + YRRLIGRLLYLTISRPDITFAVHKLSQFM+KP  +H+ AA+ L++YLK                              
Subjt:  VPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK------------------------------

Query:  --RSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKL
          RS+TGFCVFLG+SLVSWKSKKQQT++RSSAEAEY+AL VT CE+IWL +LL +L+++   PALLFCDNQAA++IA+NP+FHE+TKHIELDCHF+RD++
Subjt:  --RSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKL

Query:  IDGTIKLLPVR
        IDG+IKLLPVR
Subjt:  IDGTIKLLPVR

A0A6J1CXB7 protein TIC 56, chloroplastic6.0e-28993.75Show/hide
Query:  MASINFNPFENWFSKRPNPIPPINLIAFRDSLSQKSSTSPNFASTSLSNVFRKPQKPEKAPDEPGYYGKMLEQFYWECDSLPDNRHTPEVEKILNEDPVF
        MASINFNPFENWFS+RPNPIPP+NL AFRDSLSQKSSTSPNFASTSLSN F+K QKP+KA DEPGYYGKMLEQF+WECD+LPD RHTPEVEKILNEDPV 
Subjt:  MASINFNPFENWFSKRPNPIPPINLIAFRDSLSQKSSTSPNFASTSLSNVFRKPQKPEKAPDEPGYYGKMLEQFYWECDSLPDNRHTPEVEKILNEDPVF

Query:  EKKENPTEEELEKNEKLWKALRTSPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFGL
        + KENP+EEE+EKNEKLWKALR SPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRES+DKFWDFARQFFFGL
Subjt:  EKKENPTEEELEKNEKLWKALRTSPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFGL

Query:  WGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDFNMRSGGWYYKDRLGRTRGPCELINLKTAWGGGIIDKDTFIWGEDMDEWAPIHMIYGLERAIATWEVRL
        WGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDF MRSGGWYYKDRLGRTRGP ELI LKTAWGGGIIDKDTFIWGEDMDEWAPIHM+YGLERAIATWEVRL
Subjt:  WGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDFNMRSGGWYYKDRLGRTRGPCELINLKTAWGGGIIDKDTFIWGEDMDEWAPIHMIYGLERAIATWEVRL

Query:  GAAATAFLHKLQKGIPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKIIP
        GAAATAFLHKLQKGIPPWVPLKG EKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTS LEADHMPNKYIP+DLRYKLAKIIP
Subjt:  GAAATAFLHKLQKGIPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKIIP

Query:  GLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGFRKIMEKVQADAAARDARRKERREATKRAEY
        GLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGF KIM+KVQADAAARDARRKERREA KRAE+
Subjt:  GLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGFRKIMEKVQADAAARDARRKERREATKRAEY

Query:  ERAIFGEVTKDQ
        ER IFG V KDQ
Subjt:  ERAIFGEVTKDQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-10829.07Show/hide
Query:  RSIKSEACVIESMHDNIDTCGICPLAKQKRLSF--TSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKL
        +++ S+  ++ ++  + + C  C   KQ RL F    + +   +   +VH+D+ GP +  T    +YF+  VD  T Y   +L+K+KSDV S+   F   
Subjt:  RSIKSEACVIESMHDNIDTCGICPLAKQKRLSF--TSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKL

Query:  VETQFGCCIKQFRSDNAPELAFGD---FFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLL--KWCT
         E  F   +     DN  E    +   F   KG+ +  +    PQ N V ER  + I   AR +   +++   FWG+ VLTA YLINR PS  L     T
Subjt:  VETQFGCCIKQFRSDNAPELAFGD---FFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLL--KWCT

Query:  PFELLNGSKADYSSIRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHE------EIFPFHTVTEASVLPDPFPGF
        P+E+ +  K     +RVFG   +   + + + KF  ++  ++F+GY P    +KL+D   +K  ++RDV+  E          F TV            F
Subjt:  PFELLNGSKADYSSIRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHE------EIFPFHTVTEASVLPDPFPGF

Query:  VLPKPFNVSTEV--EPPSLSYNQASASAQVSSPSSFLPDISPVIHTT-QQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYL----------------
               + TE   E       Q    ++ S   +F  D   +I T       +  NI   +            ++  K   +L                
Subjt:  VLPKPFNVSTEV--EPPSLSYNQASASAQVSSPSSFLPDISPVIHTT-QQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYL----------------

Query:  ---HAYHCSLLSSTS------LPTTSSRFPLQQFISYSRLSSTHRNLILNVSTHFE--PQFYHQAVVFD---HWRAAMDAELAAMEDNKTWSVVPLPHGK
           H     + + T       +   S R   +  ISY+   ++   ++LN  T F   P  + +    D    W  A++ EL A + N TW++   P  K
Subjt:  ---HAYHCSLLSSTS------LPTTSSRFPLQQFISYSRLSSTHRNLILNVSTHFE--PQFYHQAVVFD---HWRAAMDAELAAMEDNKTWSVVPLPHGK

Query:  HTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKG
        + +  RW++ +K+   G+  RYKARLVA+G+TQ+  +D+ ETF+PVA++ + + +L++ +     + Q+DV  AFL+G L EE+YM LP G + + D+  
Subjt:  HTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKG

Query:  EHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFV--KGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLE
           VCKL+K+IYGLKQA+R WF  F   L    F  S  D  +++  KG  N  + +L+YVDD++I   + + + + K  L   F++ DL  +K+F+G+ 
Subjt:  EHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFV--KGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLE

Query:  LTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQL--KLCSSNVDLLADPSIYRRLIGRLLYLTI-SRPDITFAVHKLSQFMSKPCKSHLAAAHDL
        +      I++SQ  Y  ++L       C   S P+  ++  +L +S+ D     +  R LIG L+Y+ + +RPD+T AV+ LS++ SK           +
Subjt:  LTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQL--KLCSSNVDLLADPSIYRRLIGRLLYLTI-SRPDITFAVHKLSQFMSKPCKSHLAAAHDL

Query:  LRYLK----------------------------------RSITGFCVFLGD-SLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSL
        LRYLK                                  +S TG+   + D +L+ W +K+Q +++ SS EAEY AL     E +WL  LL  + +++  
Subjt:  LRYLK----------------------------------RSITGFCVFLGD-SLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSL

Query:  PALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVL
        P  ++ DNQ  + IA+NP  H+R KHI++  HF R+++ +  I L  + + +QLAD FTKPL AA    L  K+G+L
Subjt:  PALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.0e-13332.67Show/hide
Query:  IDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP
        +  C  C   KQ R+SF +++       DLV++D+ GP    +  G+ YF+T +DD++R  W+++LK K  V  V  +F  LVE + G  +K+ RSDN  
Subjt:  IDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP

Query:  ELA---FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGC
        E     F ++  S G+ H+ +    PQ N V ER ++ I+   R++   +++P  FWG+ V TA YLINR+PS  L +  P  +    +  YS ++VFGC
Subjt:  ELA---FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGC

Query:  LCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGFVLPKPFNVSTEVEPPSLSYNQASASA
          FA      R+K   ++IP +F+GY      Y+L+D  ++K+  SRDV+F E       V  A+ + +     ++P    +      PS S N  SA +
Subjt:  LCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGFVLPKPFNVSTEVEPPSLSYNQASASA

Query:  QVSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLILNVSTH
             S        VI   +Q                    +       + P+     H  L  S      S R+P  +++  S         +  V +H
Subjt:  QVSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLILNVSTH

Query:  FEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVA
         E          +    AM  E+ +++ N T+ +V LP GK  + C+W++K+K   D  + RYKARLV KG+ Q++G+DF E FSPV K+ +++T+L++A
Subjt:  FEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVA

Query:  VSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKG-TGNSFVALLVY
         S    + QLDV  AFLHGDL EE+YM+ P G+   V  K +H+VCKL+KS+YGLKQA RQW+ KF  F+ S  + ++ +D  ++ K  + N+F+ LL+Y
Subjt:  VSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKG-TGNSFVALLVY

Query:  VDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTG--IFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKL----CSSNVDLLADPS-
        VDD++I G +   I  LK  L+  F +KDLG  +  LG+++ R  T   +++SQ  Y  ++LE       KP S P+   LKL    C + V+   + + 
Subjt:  VDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTG--IFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKL----CSSNVDLLADPS-

Query:  -IYRRLIGRLLY-LTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLKRSITGFCVFLGDS--------------------------------LVSW
          Y   +G L+Y +  +RPDI  AV  +S+F+  P K H  A   +LRYL R  TG C+  G S                                 +SW
Subjt:  -IYRRLIGRLLY-LTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLKRSITGFCVFLGDS--------------------------------LVSW

Query:  KSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADP
        +SK Q+ ++ S+ EAEY A   T  E+IWL   L++L +      +++CD+Q+A+ ++ N ++H RTKHI++  H+IR+ + D ++K+L + ++   AD 
Subjt:  KSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADP

Query:  FTK
         TK
Subjt:  FTK

Q7Y1W1 Protein TIC 56, chloroplastic1.8e-22671.93Show/hide
Query:  MASINFNPFENWFSKRPNPIPPINLIAFRDSLSQKSSTSPNFASTSLSNVFRKPQKPEKA-PDEPGYYGKMLEQFYWECDSLPDNRHTPEVEKILNEDPV
        M+S+NFNPF+NWF K PNP+P IN ++  DS   KS  SPNFAS  L    +K  KPE A  DEPG Y ++ EQF WEC+++PD RHTPEV+K+LNEDPV
Subjt:  MASINFNPFENWFSKRPNPIPPINLIAFRDSLSQKSSTSPNFASTSLSNVFRKPQKPEKA-PDEPGYYGKMLEQFYWECDSLPDNRHTPEVEKILNEDPV

Query:  FEKKENPTEEELEKNEKLWKALRTSPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFG
        FEKKENP+ EE+E  +K W++ R SPVVQF+ RAEEIA   N++EL++N+ PYR EDK  WRAIPHVPG DGRPMPRKAIK+K ESDDKFWDF +QF FG
Subjt:  FEKKENPTEEELEKNEKLWKALRTSPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFG

Query:  LWGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDFNMRSGGWYYKDRLGRTRGPCELINLKTAWGGGIIDKDTFIWGEDMDEWAPIHMIYGLERAIATWEVR
        LWGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDF M++GGW+YKDRLGR+RGPCE+I LKTA+G GIID+DTFIWGEDMDEWAPIHM+YGLE AIATWEVR
Subjt:  LWGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDFNMRSGGWYYKDRLGRTRGPCELINLKTAWGGGIIDKDTFIWGEDMDEWAPIHMIYGLERAIATWEVR

Query:  LGAAATAFLHKLQKGIPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKII
        LGAAATAFLHKLQKGIPPWVPLKG+E KTYKQLQ+EA+ESK+RD+AVL AN GVWPGVR PSHALFLWASGSELT+ LE+DHMPNK+IP+ LR +LAK+I
Subjt:  LGAAATAFLHKLQKGIPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKII

Query:  PGLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGFRKIMEKVQADAAARDARRKERREATKRAE
        PGLRPWEV+S+EQAM+QI+Y GEW+REPLGTYTTGPPYIR WN+ V R+FRIF+NLS RV  K+ERT+PGF  IM+KVQ D   R ARR +RRE   R E
Subjt:  PGLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGFRKIMEKVQADAAARDARRKERREATKRAE

Query:  YERAIFGEVTKDQ
          +   G   +D+
Subjt:  YERAIFGEVTKDQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-16236.26Show/hide
Query:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSP--TYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP
        +C  C + K  ++ F+ +   S +  + +++D+W   SSP  ++  + Y++  VD  TRYTW++ LK KS V      F  L+E +F   I  F SDN  
Subjt:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSP--TYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP

Query:  E-LAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLC
        E +A  ++F   G+ H  S    P+ N + ERKH+HI+     L   + +P  +W      AVYLINR P+PLL+  +PF+ L G+  +Y  +RVFGC C
Subjt:  E-LAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLC

Query:  FASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPF-----------------------HTV--TEASVLPDPF---PGF
        +      ++ K   ++   VFLGY     AY    ++  +L+ISR V F E  FPF                       HT   T   VLP P    P  
Subjt:  FASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPF-----------------------HTV--TEASVLPDPF---PGF

Query:  VLPKPFNVSTEVEPPSLSYNQASASAQVSSPSSFLPDI------SPVIHTTQ-QPLMDAHNIVPQQPPLIDPP--LVRRSARISKPPSYLHAYHCSLLSS
            P + S       +S +   +S   S PSS  P         P    TQ Q    +     Q  P  + P  L +  +  ++  S   +   S  SS
Subjt:  VLPKPFNVSTEVEPPSLSYNQASASAQVSSPSSFLPDI------SPVIHTTQ-QPLMDAHNIVPQQPPLIDPP--LVRRSARISKPPSYLHAYHCSLLSS

Query:  TSLPTTSSRF-----PLQQFISYSRLS--STHR----------------NLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKH
        ++ PT  S       PL Q ++ +  +  +TH                 +L ++++   EP+   QA+  + WR AM +E+ A   N TW +VP P    
Subjt:  TSLPTTSSRF-----PLQQFISYSRLS--STHR----------------NLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKH

Query:  TI-GCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKG
        TI GCRWI+  K+ +DGS+ RYKARLVAKGY Q+ GLD+ ETFSPV K  +++ +L VAV + W + QLDVNNAFL G L ++VYM  P G+   +D   
Subjt:  TI-GCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKG

Query:  EHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELT
         + VCKL K++YGLKQA R W+ +  ++L+++GF  S +D SLFV   G S V +LVYVDDI+ITG++ + + +  D L+  F +KD   L YFLG+E  
Subjt:  EHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELT

Query:  RSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK
        R  TG+ +SQR Y L LL  T  +  KP + PM P  KL   +   L DP+ YR ++G L YL  +RPDI++AV++LSQFM  P + HL A   +LRYL 
Subjt:  RSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK

Query:  R--------------------------------SITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDN
                                         S  G+ V+LG   +SW SKKQ+ + RSS EAEYR++A T+ E+ W+ +LL +L ++++ P +++CDN
Subjt:  R--------------------------------SITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDN

Query:  QAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGV
          A ++ +NPVFH R KHI +D HFIR+++  G ++++ V +H QLAD  TKPLS        SK+GV
Subjt:  QAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-16036.36Show/hide
Query:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYA--GHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP
        +C  C + K  ++ F+++   S+K  + +++D+W   SSP  +   + Y++  VD  TRYTW++ LK KS V      F  LVE +F   I    SDN  
Subjt:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYA--GHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP

Query:  E-LAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLC
        E +   D+    G+ H  S    P+ N + ERKH+HI+ +   L   + VP  +W      AVYLINR P+PLL+  +PF+ L G   +Y  ++VFGC C
Subjt:  E-LAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLC

Query:  FASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTV-----TEASVLPDPFPGF-----------VLPKP----FNV
        +      +R K   ++    F+GY     AY    I   +L+ SR V F E  FPF T      T      D  P +           VLP P     ++
Subjt:  FASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTV-----TEASVLPDPFPGF-----------VLPKP----FNV

Query:  STEVEPPS----LSYNQASA----SAQVSSPSSFLPDISPVIHTTQQPLMDAH----------------------NIVPQQPPLIDPPLVRRSARISKPP
         T   PPS    L   Q S+    S+ +SSPSS  P  +   H   QP    H                      N   Q  PL   P+   S  I  P 
Subjt:  STEVEPPS----LSYNQASA----SAQVSSPSSFLPDISPVIHTTQQPLMDAH----------------------NIVPQQPPLIDPPLVRRSARISKPP

Query:  SYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLS--STHR----------------NLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSV
        + +   +    SSTS P      P    I  +  +  +TH                 +   +++ + EP+   QA+  D WR AM +E+ A   N TW +
Subjt:  SYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLS--STHR----------------NLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSV

Query:  VPLPHGKHTI-GCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGY
        VP P    TI GCRWI+  K  +DGS+ RYKARLVAKGY Q+ GLD+ ETFSPV K  +++ +L VAV + W + QLDVNNAFL G L +EVYM  P G+
Subjt:  VPLPHGKHTI-GCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGY

Query:  NSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLK
           VD      VC+L K+IYGLKQA R W+ +   +L+++GF  S +D SLFV   G S + +LVYVDDI+ITG++   ++H  D L+  F +K+   L 
Subjt:  NSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLK

Query:  YFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAA
        YFLG+E  R   G+ +SQR YTL LL  T  L  KP + PM    KL   +   L DP+ YR ++G L YL  +RPD+++AV++LSQ+M  P   H  A 
Subjt:  YFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAA

Query:  HDLLRYLKR--------------------------------SITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSL
          +LRYL                                  S  G+ V+LG   +SW SKKQ+ + RSS EAEYR++A T+ E+ W+ +LL +L +Q+S 
Subjt:  HDLLRYLKR--------------------------------SITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSL

Query:  PALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDI
        P +++CDN  A ++ +NPVFH R KHI LD HFIR+++  G ++++ V +H QLAD  TKPLS         K+GV+ +
Subjt:  PALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-14647.84Show/hide
Query:  PLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLILNVSTHFEPQFYHQAVVFDHWRAAMDA
        P  +  N VP+       P V  S R ++ P+YL  Y+C  ++S ++        + QF+SY ++S  + + ++ ++   EP  Y++A  F  W  AMD 
Subjt:  PLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLILNVSTHFEPQFYHQAVVFDHWRAAMDA

Query:  ELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDL
        E+ AME   TW +  LP  K  IGC+W+YKIK+ +DG+IERYKARLVAKGYTQQEG+DF+ETFSPV KL +VK +L ++    ++L QLD++NAFL+GDL
Subjt:  ELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDL

Query:  FEEVYMDLPLGYNSHV-DSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLL
         EE+YM LP GY +   DS   + VC L KSIYGLKQASRQWF KFS  LI  GF QS +D++ F+K T   F+ +LVYVDDIII  +N + +  LK  L
Subjt:  FEEVYMDLPLGYNSHV-DSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLL

Query:  NAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQ
         + FKL+DLG LKYFLGLE+ RS+ GI + QR Y L LL++TG LGCKP+SVPMDP +   + +     D   YRRLIGRL+YL I+R DI+FAV+KLSQ
Subjt:  NAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQ

Query:  FMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWL
        F   P  +H  A   +L Y+K                                RS  G+C+FLG SL+SWKSKKQQ +S+SSAEAEYRAL+    E++WL
Subjt:  FMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWL

Query:  TNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAAL
            R+LQ+ +S P LLFCDN AA+HIA+N VFHERTKHIE DCH +R++ +         +++ +  D FT+ LS  L
Subjt:  TNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAAL

AT5G01590.1 unknown protein1.3e-22771.93Show/hide
Query:  MASINFNPFENWFSKRPNPIPPINLIAFRDSLSQKSSTSPNFASTSLSNVFRKPQKPEKA-PDEPGYYGKMLEQFYWECDSLPDNRHTPEVEKILNEDPV
        M+S+NFNPF+NWF K PNP+P IN ++  DS   KS  SPNFAS  L    +K  KPE A  DEPG Y ++ EQF WEC+++PD RHTPEV+K+LNEDPV
Subjt:  MASINFNPFENWFSKRPNPIPPINLIAFRDSLSQKSSTSPNFASTSLSNVFRKPQKPEKA-PDEPGYYGKMLEQFYWECDSLPDNRHTPEVEKILNEDPV

Query:  FEKKENPTEEELEKNEKLWKALRTSPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFG
        FEKKENP+ EE+E  +K W++ R SPVVQF+ RAEEIA   N++EL++N+ PYR EDK  WRAIPHVPG DGRPMPRKAIK+K ESDDKFWDF +QF FG
Subjt:  FEKKENPTEEELEKNEKLWKALRTSPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFG

Query:  LWGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDFNMRSGGWYYKDRLGRTRGPCELINLKTAWGGGIIDKDTFIWGEDMDEWAPIHMIYGLERAIATWEVR
        LWGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDF M++GGW+YKDRLGR+RGPCE+I LKTA+G GIID+DTFIWGEDMDEWAPIHM+YGLE AIATWEVR
Subjt:  LWGFRQRPYPPGRPIDVAQAIGYKRLEKRYYDFNMRSGGWYYKDRLGRTRGPCELINLKTAWGGGIIDKDTFIWGEDMDEWAPIHMIYGLERAIATWEVR

Query:  LGAAATAFLHKLQKGIPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKII
        LGAAATAFLHKLQKGIPPWVPLKG+E KTYKQLQ+EA+ESK+RD+AVL AN GVWPGVR PSHALFLWASGSELT+ LE+DHMPNK+IP+ LR +LAK+I
Subjt:  LGAAATAFLHKLQKGIPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKII

Query:  PGLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGFRKIMEKVQADAAARDARRKERREATKRAE
        PGLRPWEV+S+EQAM+QI+Y GEW+REPLGTYTTGPPYIR WN+ V R+FRIF+NLS RV  K+ERT+PGF  IM+KVQ D   R ARR +RRE   R E
Subjt:  PGLRPWEVLSVEQAMEQITYNGEWHREPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGFRKIMEKVQADAAARDARRKERREATKRAE

Query:  YERAIFGEVTKDQ
          +   G   +D+
Subjt:  YERAIFGEVTKDQ

ATMG00240.1 Gag-Pol-related retrotransposon family protein8.2e-0431.9Show/hide
Query:  LYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLKRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEII--WLTNLLRDLQVQV
        +YLTI+RPD+TFAV++LSQF S    + + A + +L Y+K ++ G  +F   S  S    K    S  ++  + R      C ++  W    LR     +
Subjt:  LYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLKRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEII--WLTNLLRDLQVQV

Query:  SLPALLFCDNQAAVHI
          P LL   N  A+H+
Subjt:  SLPALLFCDNQAAVHI

ATMG00810.1 DNA/RNA polymerases superfamily protein6.0e-3940.44Show/hide
Query:  LLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSS-NVDLLADPSIY
        LL+YVDDI++TGS+ + +  L   L++ F +KDLG + YFLG+++    +G+F+SQ  Y  Q+L + G L CKP S P+   LKL SS +     DPS +
Subjt:  LLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSS-NVDLLADPSIY

Query:  RRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFLGDSLVSWKSKK
        R ++G L YLT++RPDI++AV+ + Q M +P  +       +LRY+K                                RS TGFC FLG +++SW +K+
Subjt:  RRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDLLRYLK--------------------------------RSITGFCVFLGDSLVSWKSKK

Query:  QQTISRSSAEAEYRALAVTACEIIW
        Q T+SRSS E EYRALA+TA E+ W
Subjt:  QQTISRSSAEAEYRALAVTACEIIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.7e-2143.12Show/hide
Query:  NLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLV
        +L +  +   EP+    A+    W  AM  EL A+  NKTW +VP P  ++ +GC+W++K K  +DG+++R KARLVAKG+ Q+EG+ F+ET+SPV +  
Subjt:  NLILNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLV

Query:  TVKTLLTVA
        T++T+L VA
Subjt:  TVKTLLTVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGTTTTGGCATGGGGTGGAAATGAGAATGACGCTCGATACTTGAATAATGCTGCAGTCTTGATATTATCATGTTCTAGCACATCCGGAGATACTTTGAGAACTAA
CCCAACCGTTGACGGCCGGAAGTGGATCGTCGCCGTGAAGGAACCTCGTGACTTGTCTCATCGTCGGCCGAGCTTCTGGTTTCTGCCGATCGTGCCGACAACGCCGGTCG
TGTGCGATATTTCGTCGTGGTCGTATTGCCTGGCCAGACCAAAATCGCTCAATCGGGCATTCATGTCGGCATCTATTAGAACATTGCTTGGCTTTACATCCCGATTAACC
AAGTTCTTGTGTCTTAATCGTCCCAAGCTTTCGATTTCCGCTGCGAATTCCTTCATTCCTTGCTTTGAGTTTCTTCTTACCCTCTTCACAGCAATCTCAATTCCAGTTGA
ACGACAATCTCTTTCCCAGTCCTCAAGCCTCTCAGTTTGCCACATTCTTTTAAACAAGAAAGCTAAGAAAAGAATTCCCAGAAGGGCCACAACAGATAAAACTGCTATAA
AGACCTTGAACTGGGAATTGGGTGGTGAAGAAGGATCTTGCTCTTTTGGTGGTTTGGGAAGGAGATCGACATTTAGCGGCGGTGCCGGTCCGTTCACTGCAAAGCTCCAT
CCCAAAATGAGTCAGGTCAATCGACTGCGAAAGGAGTGGCTTTGTTGGTTTTGGCTGCTTCAATGGAGCTACACTGACGTTCACAATTCTGTTGGGGCCGTCATATTCAA
TCCAAGCAACGATTGCATCACCAGAATCCATTTGCAAGTCGCCTTCATTAGGATCGCCATAATTGGAATAAGCAGCAGCCAAGGCGAAGGCCAAGCCATAGCCACCTTGG
CCAGGGCTCGATGGATCAATAGCAAACACAAAACTTGTGCTGAAAGACGAAGCATTCGGATATGCTTCAGAACTTTTATCAAACATCTGGAATCGAATCGGATAGAATGC
GTGGCCGACAACGTTTTGCAATCTGTTGGTGAGCCTCAACGCACCAGACGGCTTCACAACGTACGCTCCATCAAGAGCGAAGCTTGTGTCATTGAATCCATGCATGATAA
TATTGATACTTGTGGGATTTGTCCTTTGGCTAAGCAAAAGCGTTTGTCCTTTACTTCAAACAATAGTCTTTCGGCCAAAGCTTTTGATCTTGTGCATGCTGACATTTGGG
GACCATTTTCTAGTCCTACTTATGCTGGTCATTCTTACTTTCTTACACTTGTTGATGATTCTACAAGATATACATGGATTTTCTTGCTTAAACATAAGTCTGATGTGTTA
TCTGTGGTTCCTCAATTTTTTAAACTGGTTGAGACACAATTTGGTTGCTGCATAAAACAATTTCGTTCAGATAATGCACCGGAACTTGCTTTTGGTGATTTCTTTCGTAG
TAAAGGTGTCATTCATCAGTTTTCTTGTGTGGAGCGTCCTCAACAAAATTCGGTTGTAGAAAGGAAACACCAACATATTTTGAATGTTGCTCGTGCTCTTTATTTCCAAT
CTAGAGTACCCATTCGATTTTGGGGTGATTGTGTTCTTACTGCTGTCTATCTGATTAATAGAACTCCTTCTCCTTTGTTAAAATGGTGTACTCCATTTGAGTTGTTGAAT
GGATCTAAGGCAGACTATAGCTCCATCCGTGTGTTTGGATGTTTGTGTTTTGCGTCAACTCTCTCTTCGCATCGCTCAAAGTTTCATCCCCGAGCCATTCCTGCTGTTTT
TCTTGGCTATCCTCCAGGTATGAAAGCTTATAAGTTGTATGACATTGAGCAGCAAAAATTATTCATATCAAGGGATGTTATTTTTCACGAGGAAATATTTCCTTTTCATA
CTGTTACTGAGGCCTCTGTACTCCCAGACCCCTTTCCTGGTTTTGTTTTGCCTAAGCCCTTTAATGTCAGCACTGAAGTTGAGCCACCAAGTCTCTCTTATAACCAGGCT
TCAGCATCTGCACAGGTCTCTTCTCCATCATCCTTTTTGCCTGATATTTCTCCTGTCATTCATACAACTCAGCAGCCTTTGATGGATGCTCATAACATTGTGCCTCAACA
GCCTCCTCTCATTGACCCACCCCTTGTTCGACGTTCTGCACGTATCTCTAAGCCACCTTCTTATTTACATGCTTATCATTGTAGTCTTCTTTCCTCTACCTCTTTGCCTA
CCACCTCTTCTCGTTTTCCCCTACAACAATTTATTTCTTACTCTCGACTTTCCTCAACTCATCGCAATTTGATTCTTAATGTTTCTACACATTTTGAGCCACAATTTTAC
CACCAAGCAGTCGTCTTTGACCATTGGCGGGCTGCTATGGATGCAGAGTTGGCAGCTATGGAGGATAATAAGACTTGGAGCGTTGTTCCTCTCCCTCATGGTAAGCACAC
TATTGGATGTCGGTGGATATATAAGATTAAACATAAAGCTGATGGTTCTATTGAACGTTACAAGGCTCGTTTAGTTGCTAAAGGTTATACACAGCAAGAGGGCTTGGATT
TCCTTGAAACTTTCTCTCCGGTTGCCAAGTTAGTCACTGTTAAGACCCTTCTTACTGTTGCAGTTTCTAAAGGGTGGTCTTTGATGCAATTGGATGTTAACAACGCTTTC
TTACATGGTGATTTGTTTGAGGAGGTTTATATGGATTTACCTTTAGGATACAATTCTCATGTTGATAGTAAGGGGGAGCATTTAGTCTGTAAATTGCATAAATCAATCTA
TGGCCTCAAGCAAGCCTCTAGGCAATGGTTTGCCAAATTTTCTCACTTCTTAATTTCTTTGGGATTCTTCCAGTCGAAGGCCGACTATTCATTGTTTGTCAAAGGGACTG
GGAATTCTTTTGTTGCTCTTTTAGTATACGTCGATGACATCATTATTACTGGGTCTAATGCTTCAACTATTCAGCACTTGAAGGATCTTCTTAATGCACATTTCAAACTC
AAGGATCTTGGCTCTCTTAAATATTTTCTTGGGCTTGAACTTACTCGTTCCTCTACTGGTATTTTTGTATCACAGAGACATTATACTTTACAGCTACTGGAAGACACAGG
TTTCCTTGGCTGTAAGCCCACTAGTGTTCCTATGGATCCTCAGTTGAAGTTGTGTTCTTCTAATGTTGATTTATTGGCTGACCCTTCAATATATCGACGTCTTATTGGCC
GCCTCTTATACTTGACTATATCTCGACCTGACATAACATTTGCGGTTCATAAACTGAGTCAGTTTATGTCGAAGCCCTGTAAATCTCACCTTGCGGCTGCTCATGATTTG
TTGCGCTACTTAAAACGCTCCATCACTGGTTTTTGTGTATTTTTAGGAGATTCATTGGTGTCGTGGAAATCAAAGAAGCAACAAACCATTTCCAGATCATCAGCAGAGGC
AGAGTATCGTGCTCTTGCTGTAACAGCATGTGAAATCATTTGGTTGACTAATCTTTTACGTGATTTACAGGTTCAAGTTAGCCTTCCAGCTCTTTTGTTTTGTGACAATC
AGGCAGCAGTTCATATTGCCTCTAATCCTGTTTTCCATGAGCGCACAAAACACATTGAGTTAGATTGTCATTTTATTCGAGACAAACTCATTGATGGGACCATCAAACTT
CTTCCAGTTAGATCACATTCCCAGCTTGCGGATCCTTTTACAAAACCTCTATCTGCTGCGCTTCTTTTCCCATTGTTGTCCAAGATGGGCGTTTTGGATATACATCGCCA
CTTCAAGGAGCAAAACTCCATAGGCAAACACATCTGTAGTTTTAGATGCTTTTCCAATGCGAGCCAACTCGGGCGAGATGTAGCCGATTGTGCCGACGACACTAGTTTCG
ACCCAAGCTTTCGATCTCTGCAGCGAATTCCCTCATTCCCTGACTCGAATTTCTTGTTACCTTCTTTACAGCAACTTCACTTCCTGTGGAAAAACAAAATTCCCACAAGT
GTCATAGCAGATAATATTGATATGAGAACGGTAACCAGAGGATTGGATGTCGATGATGAGGATGAAGAATTTTCCGCTTTTGGTGGTGTTGGAAGCCAAGAAACATTCAA
CGCAGGGGCTGAGAAGCCAACAAACATTGTCTCTTCTACAACTGTAGTCAGGTTTATGCGATATGAAATCAATGGCGCGGTTGGTTTCTCCACGTTAGTGGGAGCTATCG
TAACATTTACAACACCTGGAAATTGGGTTGATGGAGCTATAACAAAGGCGAGGCCGTACCCAGCTGGGCCGGAGTTTGATGGGACTATTGCAAACACAAAAGTTGTGCTG
AAAGAAGAAACATCTGAAACTCGTTCAGAACTAAAGAGGAGCAATACAGAAGAGCAATATCAGAAGGCAAGCCAAAGAAGCTGCCATAGCCATGCGAGGAGGCTTAAGAA
GCAAAAGAAATCACAGCTTAAGAAGAAGAAAACAATATCCCTACTTCCTGCAGCCCACTTCTCTGCAGTGGCCCAATCTCAGTGTGTCTATCCACTCGTTTCCTTCCCTC
TCCATGTCCTGCTCTCCGACTCCTCCGCCAGTCCTCCATTTTCATCCTGGTCGAGCAGCTTCTGCTTCTGTCCAAAACCCTACTTTCATTCATTGCCTGCAATCTCAGCA
TCTTACTATCTGATTGACCTTCTCGAAACCAGAGTCTCTATCTCAGTTATGGCGTCAATCAACTTCAACCCGTTCGAAAACTGGTTCTCCAAACGTCCGAACCCCATCCC
TCCCATCAATCTCATTGCATTTAGAGACTCGTTATCTCAGAAATCCTCAACGTCCCCAAACTTTGCTTCTACGAGCCTCTCGAACGTCTTCAGAAAGCCTCAGAAACCAG
AGAAAGCACCCGATGAACCAGGGTATTACGGGAAAATGCTAGAGCAATTTTACTGGGAATGCGACAGCTTACCCGATAACAGACACACCCCCGAGGTTGAGAAAATCTTG
AACGAAGATCCTGTCTTCGAAAAGAAGGAGAATCCAACCGAAGAGGAGCTCGAAAAGAACGAGAAGCTCTGGAAAGCGTTGCGAACCAGCCCCGTTGTGCAATTCCTGGA
ACGTGCGGAGGAAATTGCAGCCAAATACAATGAATTAGAGCTCAAAGAGAATGAGAATCCATACAGAGACGAGGACAAGAAGCTGTGGAGGGCAATTCCTCATGTGCCTG
GGCTTGATGGGCGGCCTATGCCGAGGAAAGCAATAAAGACGAAAAGAGAATCCGATGACAAGTTCTGGGATTTTGCGAGACAGTTCTTTTTTGGGCTTTGGGGCTTCCGG
CAGAGACCATATCCACCTGGTCGCCCGATCGATGTTGCGCAGGCCATTGGGTATAAGCGCCTCGAGAAGCGGTACTATGATTTTAACATGAGGAGTGGAGGTTGGTACTA
TAAGGATCGGTTGGGTCGTACTAGGGGACCATGTGAACTAATTAATCTTAAAACAGCTTGGGGAGGTGGGATTATTGACAAGGACACATTTATTTGGGGGGAGGACATGG
ATGAATGGGCGCCAATTCACATGATTTACGGATTGGAGCGTGCAATTGCTACTTGGGAAGTTAGACTCGGTGCTGCTGCCACTGCTTTCCTTCACAAATTGCAAAAAGGT
ATACCTCCATGGGTTCCTCTTAAGGGACAAGAGAAGAAAACCTACAAGCAACTCCAACAAGAGGCTTTAGAGAGCAAGAGACGAGATCTAGCAGTACTAGCAGCTAATGA
CGGTGTTTGGCCTGGAGTTAGAATTCCTAGTCACGCTCTGTTTCTTTGGGCCAGTGGATCTGAGCTGACATCTACGTTGGAGGCAGATCACATGCCAAACAAGTACATCC
CTAGGGATCTAAGGTACAAGTTAGCTAAAATTATACCCGGGTTAAGACCGTGGGAGGTTCTCAGTGTGGAGCAAGCGATGGAACAGATAACATATAATGGCGAATGGCAT
CGTGAACCTCTGGGGACATACACGACTGGCCCTCCGTACATCAGGCATTGGAATAAAGATGTCAAGAGAATGTTTAGGATATTCTTCAACCTCAGCACCCGAGTTTACAA
TAAAATGGAGCGGACAATTCCTGGTTTCCGTAAAATAATGGAGAAAGTCCAGGCCGATGCCGCTGCCCGGGATGCTAGACGGAAGGAGAGGAGGGAGGCAACGAAGAGAG
CCGAATACGAGAGAGCGATCTTCGGCGAAGTCACAAAGGATCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTGTTTTGGCATGGGGTGGAAATGAGAATGACGCTCGATACTTGAATAATGCTGCAGTCTTGATATTATCATGTTCTAGCACATCCGGAGATACTTTGAGAACTAA
CCCAACCGTTGACGGCCGGAAGTGGATCGTCGCCGTGAAGGAACCTCGTGACTTGTCTCATCGTCGGCCGAGCTTCTGGTTTCTGCCGATCGTGCCGACAACGCCGGTCG
TGTGCGATATTTCGTCGTGGTCGTATTGCCTGGCCAGACCAAAATCGCTCAATCGGGCATTCATGTCGGCATCTATTAGAACATTGCTTGGCTTTACATCCCGATTAACC
AAGTTCTTGTGTCTTAATCGTCCCAAGCTTTCGATTTCCGCTGCGAATTCCTTCATTCCTTGCTTTGAGTTTCTTCTTACCCTCTTCACAGCAATCTCAATTCCAGTTGA
ACGACAATCTCTTTCCCAGTCCTCAAGCCTCTCAGTTTGCCACATTCTTTTAAACAAGAAAGCTAAGAAAAGAATTCCCAGAAGGGCCACAACAGATAAAACTGCTATAA
AGACCTTGAACTGGGAATTGGGTGGTGAAGAAGGATCTTGCTCTTTTGGTGGTTTGGGAAGGAGATCGACATTTAGCGGCGGTGCCGGTCCGTTCACTGCAAAGCTCCAT
CCCAAAATGAGTCAGGTCAATCGACTGCGAAAGGAGTGGCTTTGTTGGTTTTGGCTGCTTCAATGGAGCTACACTGACGTTCACAATTCTGTTGGGGCCGTCATATTCAA
TCCAAGCAACGATTGCATCACCAGAATCCATTTGCAAGTCGCCTTCATTAGGATCGCCATAATTGGAATAAGCAGCAGCCAAGGCGAAGGCCAAGCCATAGCCACCTTGG
CCAGGGCTCGATGGATCAATAGCAAACACAAAACTTGTGCTGAAAGACGAAGCATTCGGATATGCTTCAGAACTTTTATCAAACATCTGGAATCGAATCGGATAGAATGC
GTGGCCGACAACGTTTTGCAATCTGTTGGTGAGCCTCAACGCACCAGACGGCTTCACAACGTACGCTCCATCAAGAGCGAAGCTTGTGTCATTGAATCCATGCATGATAA
TATTGATACTTGTGGGATTTGTCCTTTGGCTAAGCAAAAGCGTTTGTCCTTTACTTCAAACAATAGTCTTTCGGCCAAAGCTTTTGATCTTGTGCATGCTGACATTTGGG
GACCATTTTCTAGTCCTACTTATGCTGGTCATTCTTACTTTCTTACACTTGTTGATGATTCTACAAGATATACATGGATTTTCTTGCTTAAACATAAGTCTGATGTGTTA
TCTGTGGTTCCTCAATTTTTTAAACTGGTTGAGACACAATTTGGTTGCTGCATAAAACAATTTCGTTCAGATAATGCACCGGAACTTGCTTTTGGTGATTTCTTTCGTAG
TAAAGGTGTCATTCATCAGTTTTCTTGTGTGGAGCGTCCTCAACAAAATTCGGTTGTAGAAAGGAAACACCAACATATTTTGAATGTTGCTCGTGCTCTTTATTTCCAAT
CTAGAGTACCCATTCGATTTTGGGGTGATTGTGTTCTTACTGCTGTCTATCTGATTAATAGAACTCCTTCTCCTTTGTTAAAATGGTGTACTCCATTTGAGTTGTTGAAT
GGATCTAAGGCAGACTATAGCTCCATCCGTGTGTTTGGATGTTTGTGTTTTGCGTCAACTCTCTCTTCGCATCGCTCAAAGTTTCATCCCCGAGCCATTCCTGCTGTTTT
TCTTGGCTATCCTCCAGGTATGAAAGCTTATAAGTTGTATGACATTGAGCAGCAAAAATTATTCATATCAAGGGATGTTATTTTTCACGAGGAAATATTTCCTTTTCATA
CTGTTACTGAGGCCTCTGTACTCCCAGACCCCTTTCCTGGTTTTGTTTTGCCTAAGCCCTTTAATGTCAGCACTGAAGTTGAGCCACCAAGTCTCTCTTATAACCAGGCT
TCAGCATCTGCACAGGTCTCTTCTCCATCATCCTTTTTGCCTGATATTTCTCCTGTCATTCATACAACTCAGCAGCCTTTGATGGATGCTCATAACATTGTGCCTCAACA
GCCTCCTCTCATTGACCCACCCCTTGTTCGACGTTCTGCACGTATCTCTAAGCCACCTTCTTATTTACATGCTTATCATTGTAGTCTTCTTTCCTCTACCTCTTTGCCTA
CCACCTCTTCTCGTTTTCCCCTACAACAATTTATTTCTTACTCTCGACTTTCCTCAACTCATCGCAATTTGATTCTTAATGTTTCTACACATTTTGAGCCACAATTTTAC
CACCAAGCAGTCGTCTTTGACCATTGGCGGGCTGCTATGGATGCAGAGTTGGCAGCTATGGAGGATAATAAGACTTGGAGCGTTGTTCCTCTCCCTCATGGTAAGCACAC
TATTGGATGTCGGTGGATATATAAGATTAAACATAAAGCTGATGGTTCTATTGAACGTTACAAGGCTCGTTTAGTTGCTAAAGGTTATACACAGCAAGAGGGCTTGGATT
TCCTTGAAACTTTCTCTCCGGTTGCCAAGTTAGTCACTGTTAAGACCCTTCTTACTGTTGCAGTTTCTAAAGGGTGGTCTTTGATGCAATTGGATGTTAACAACGCTTTC
TTACATGGTGATTTGTTTGAGGAGGTTTATATGGATTTACCTTTAGGATACAATTCTCATGTTGATAGTAAGGGGGAGCATTTAGTCTGTAAATTGCATAAATCAATCTA
TGGCCTCAAGCAAGCCTCTAGGCAATGGTTTGCCAAATTTTCTCACTTCTTAATTTCTTTGGGATTCTTCCAGTCGAAGGCCGACTATTCATTGTTTGTCAAAGGGACTG
GGAATTCTTTTGTTGCTCTTTTAGTATACGTCGATGACATCATTATTACTGGGTCTAATGCTTCAACTATTCAGCACTTGAAGGATCTTCTTAATGCACATTTCAAACTC
AAGGATCTTGGCTCTCTTAAATATTTTCTTGGGCTTGAACTTACTCGTTCCTCTACTGGTATTTTTGTATCACAGAGACATTATACTTTACAGCTACTGGAAGACACAGG
TTTCCTTGGCTGTAAGCCCACTAGTGTTCCTATGGATCCTCAGTTGAAGTTGTGTTCTTCTAATGTTGATTTATTGGCTGACCCTTCAATATATCGACGTCTTATTGGCC
GCCTCTTATACTTGACTATATCTCGACCTGACATAACATTTGCGGTTCATAAACTGAGTCAGTTTATGTCGAAGCCCTGTAAATCTCACCTTGCGGCTGCTCATGATTTG
TTGCGCTACTTAAAACGCTCCATCACTGGTTTTTGTGTATTTTTAGGAGATTCATTGGTGTCGTGGAAATCAAAGAAGCAACAAACCATTTCCAGATCATCAGCAGAGGC
AGAGTATCGTGCTCTTGCTGTAACAGCATGTGAAATCATTTGGTTGACTAATCTTTTACGTGATTTACAGGTTCAAGTTAGCCTTCCAGCTCTTTTGTTTTGTGACAATC
AGGCAGCAGTTCATATTGCCTCTAATCCTGTTTTCCATGAGCGCACAAAACACATTGAGTTAGATTGTCATTTTATTCGAGACAAACTCATTGATGGGACCATCAAACTT
CTTCCAGTTAGATCACATTCCCAGCTTGCGGATCCTTTTACAAAACCTCTATCTGCTGCGCTTCTTTTCCCATTGTTGTCCAAGATGGGCGTTTTGGATATACATCGCCA
CTTCAAGGAGCAAAACTCCATAGGCAAACACATCTGTAGTTTTAGATGCTTTTCCAATGCGAGCCAACTCGGGCGAGATGTAGCCGATTGTGCCGACGACACTAGTTTCG
ACCCAAGCTTTCGATCTCTGCAGCGAATTCCCTCATTCCCTGACTCGAATTTCTTGTTACCTTCTTTACAGCAACTTCACTTCCTGTGGAAAAACAAAATTCCCACAAGT
GTCATAGCAGATAATATTGATATGAGAACGGTAACCAGAGGATTGGATGTCGATGATGAGGATGAAGAATTTTCCGCTTTTGGTGGTGTTGGAAGCCAAGAAACATTCAA
CGCAGGGGCTGAGAAGCCAACAAACATTGTCTCTTCTACAACTGTAGTCAGGTTTATGCGATATGAAATCAATGGCGCGGTTGGTTTCTCCACGTTAGTGGGAGCTATCG
TAACATTTACAACACCTGGAAATTGGGTTGATGGAGCTATAACAAAGGCGAGGCCGTACCCAGCTGGGCCGGAGTTTGATGGGACTATTGCAAACACAAAAGTTGTGCTG
AAAGAAGAAACATCTGAAACTCGTTCAGAACTAAAGAGGAGCAATACAGAAGAGCAATATCAGAAGGCAAGCCAAAGAAGCTGCCATAGCCATGCGAGGAGGCTTAAGAA
GCAAAAGAAATCACAGCTTAAGAAGAAGAAAACAATATCCCTACTTCCTGCAGCCCACTTCTCTGCAGTGGCCCAATCTCAGTGTGTCTATCCACTCGTTTCCTTCCCTC
TCCATGTCCTGCTCTCCGACTCCTCCGCCAGTCCTCCATTTTCATCCTGGTCGAGCAGCTTCTGCTTCTGTCCAAAACCCTACTTTCATTCATTGCCTGCAATCTCAGCA
TCTTACTATCTGATTGACCTTCTCGAAACCAGAGTCTCTATCTCAGTTATGGCGTCAATCAACTTCAACCCGTTCGAAAACTGGTTCTCCAAACGTCCGAACCCCATCCC
TCCCATCAATCTCATTGCATTTAGAGACTCGTTATCTCAGAAATCCTCAACGTCCCCAAACTTTGCTTCTACGAGCCTCTCGAACGTCTTCAGAAAGCCTCAGAAACCAG
AGAAAGCACCCGATGAACCAGGGTATTACGGGAAAATGCTAGAGCAATTTTACTGGGAATGCGACAGCTTACCCGATAACAGACACACCCCCGAGGTTGAGAAAATCTTG
AACGAAGATCCTGTCTTCGAAAAGAAGGAGAATCCAACCGAAGAGGAGCTCGAAAAGAACGAGAAGCTCTGGAAAGCGTTGCGAACCAGCCCCGTTGTGCAATTCCTGGA
ACGTGCGGAGGAAATTGCAGCCAAATACAATGAATTAGAGCTCAAAGAGAATGAGAATCCATACAGAGACGAGGACAAGAAGCTGTGGAGGGCAATTCCTCATGTGCCTG
GGCTTGATGGGCGGCCTATGCCGAGGAAAGCAATAAAGACGAAAAGAGAATCCGATGACAAGTTCTGGGATTTTGCGAGACAGTTCTTTTTTGGGCTTTGGGGCTTCCGG
CAGAGACCATATCCACCTGGTCGCCCGATCGATGTTGCGCAGGCCATTGGGTATAAGCGCCTCGAGAAGCGGTACTATGATTTTAACATGAGGAGTGGAGGTTGGTACTA
TAAGGATCGGTTGGGTCGTACTAGGGGACCATGTGAACTAATTAATCTTAAAACAGCTTGGGGAGGTGGGATTATTGACAAGGACACATTTATTTGGGGGGAGGACATGG
ATGAATGGGCGCCAATTCACATGATTTACGGATTGGAGCGTGCAATTGCTACTTGGGAAGTTAGACTCGGTGCTGCTGCCACTGCTTTCCTTCACAAATTGCAAAAAGGT
ATACCTCCATGGGTTCCTCTTAAGGGACAAGAGAAGAAAACCTACAAGCAACTCCAACAAGAGGCTTTAGAGAGCAAGAGACGAGATCTAGCAGTACTAGCAGCTAATGA
CGGTGTTTGGCCTGGAGTTAGAATTCCTAGTCACGCTCTGTTTCTTTGGGCCAGTGGATCTGAGCTGACATCTACGTTGGAGGCAGATCACATGCCAAACAAGTACATCC
CTAGGGATCTAAGGTACAAGTTAGCTAAAATTATACCCGGGTTAAGACCGTGGGAGGTTCTCAGTGTGGAGCAAGCGATGGAACAGATAACATATAATGGCGAATGGCAT
CGTGAACCTCTGGGGACATACACGACTGGCCCTCCGTACATCAGGCATTGGAATAAAGATGTCAAGAGAATGTTTAGGATATTCTTCAACCTCAGCACCCGAGTTTACAA
TAAAATGGAGCGGACAATTCCTGGTTTCCGTAAAATAATGGAGAAAGTCCAGGCCGATGCCGCTGCCCGGGATGCTAGACGGAAGGAGAGGAGGGAGGCAACGAAGAGAG
CCGAATACGAGAGAGCGATCTTCGGCGAAGTCACAAAGGATCAGTAG
Protein sequenceShow/hide protein sequence
MIVLAWGGNENDARYLNNAAVLILSCSSTSGDTLRTNPTVDGRKWIVAVKEPRDLSHRRPSFWFLPIVPTTPVVCDISSWSYCLARPKSLNRAFMSASIRTLLGFTSRLT
KFLCLNRPKLSISAANSFIPCFEFLLTLFTAISIPVERQSLSQSSSLSVCHILLNKKAKKRIPRRATTDKTAIKTLNWELGGEEGSCSFGGLGRRSTFSGGAGPFTAKLH
PKMSQVNRLRKEWLCWFWLLQWSYTDVHNSVGAVIFNPSNDCITRIHLQVAFIRIAIIGISSSQGEGQAIATLARARWINSKHKTCAERRSIRICFRTFIKHLESNRIEC
VADNVLQSVGEPQRTRRLHNVRSIKSEACVIESMHDNIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVL
SVVPQFFKLVETQFGCCIKQFRSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLN
GSKADYSSIRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGFVLPKPFNVSTEVEPPSLSYNQA
SASAQVSSPSSFLPDISPVIHTTQQPLMDAHNIVPQQPPLIDPPLVRRSARISKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLILNVSTHFEPQFY
HQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAF
LHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGNSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKL
KDLGSLKYFLGLELTRSSTGIFVSQRHYTLQLLEDTGFLGCKPTSVPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMSKPCKSHLAAAHDL
LRYLKRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKL
LPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHRHFKEQNSIGKHICSFRCFSNASQLGRDVADCADDTSFDPSFRSLQRIPSFPDSNFLLPSLQQLHFLWKNKIPTS
VIADNIDMRTVTRGLDVDDEDEEFSAFGGVGSQETFNAGAEKPTNIVSSTTVVRFMRYEINGAVGFSTLVGAIVTFTTPGNWVDGAITKARPYPAGPEFDGTIANTKVVL
KEETSETRSELKRSNTEEQYQKASQRSCHSHARRLKKQKKSQLKKKKTISLLPAAHFSAVAQSQCVYPLVSFPLHVLLSDSSASPPFSSWSSSFCFCPKPYFHSLPAISA
SYYLIDLLETRVSISVMASINFNPFENWFSKRPNPIPPINLIAFRDSLSQKSSTSPNFASTSLSNVFRKPQKPEKAPDEPGYYGKMLEQFYWECDSLPDNRHTPEVEKIL
NEDPVFEKKENPTEEELEKNEKLWKALRTSPVVQFLERAEEIAAKYNELELKENENPYRDEDKKLWRAIPHVPGLDGRPMPRKAIKTKRESDDKFWDFARQFFFGLWGFR
QRPYPPGRPIDVAQAIGYKRLEKRYYDFNMRSGGWYYKDRLGRTRGPCELINLKTAWGGGIIDKDTFIWGEDMDEWAPIHMIYGLERAIATWEVRLGAAATAFLHKLQKG
IPPWVPLKGQEKKTYKQLQQEALESKRRDLAVLAANDGVWPGVRIPSHALFLWASGSELTSTLEADHMPNKYIPRDLRYKLAKIIPGLRPWEVLSVEQAMEQITYNGEWH
REPLGTYTTGPPYIRHWNKDVKRMFRIFFNLSTRVYNKMERTIPGFRKIMEKVQADAAARDARRKERREATKRAEYERAIFGEVTKDQ