; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028805 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028805
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationtig00153206:2240080..2245946
RNA-Seq ExpressionSgr028805
SyntenySgr028805
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0055085 - transmembrane transport (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0015267 - channel activity (molecular function)
GO:0016772 - transferase activity, transferring phosphorus-containing groups (molecular function)
InterPro domainsIPR000425 - Major intrinsic protein
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR022357 - Major intrinsic protein, conserved site
IPR023271 - Aquaporin-like
IPR034294 - Aquaporin transporter
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0068025.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]0.0e+0064.09Show/hide
Query:  CGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELA
        C ICPLAKQ++LSFTSNN LS+ AFDL+H DIWGPFS+ TY+ +SYFLT+VDD+TRYTW+F+LK KSDV+S++P FFKL+ETQ+G  IK+ RSDNA +L 
Subjt:  CGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELA

Query:  FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAST
        F  FF  KGVIHQ+SCV+ PQQNSVVE+KHQHILN ARALYFQS+VP+ FWGDC++TAVYLI+RTPS LL+W  PF+ LN    DY+S++VFG LC+AS+
Subjt:  FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAST

Query:  LSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPDVAPVID
        L  + SKF PRAIP+VF+GYP GMK YKLYDIE +K+FISRD +  E +     + E ++                                        
Subjt:  LSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPDVAPVID

Query:  TTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPT-TSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWR
         +QQ          +QP+  +   +RRS R++KPPSYL AYHCSLL++ S PT  S+++P+ Q++SY  L  T++  +L  ST  E  FYH+AVV   W 
Subjt:  TTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPT-TSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWR

Query:  AAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAF
         AM AEL AME N+TWS+VPLP GK++IGCRW+YKIKHKADGSIERYK RLVAKGYTQQEGLD+ ETFS V K+VTVKTLLT+AVSK W L+QLDVNNAF
Subjt:  AAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAF

Query:  LHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHL
        LHG+LFEEVYMDLPLGY    + +GE LVC+LHKSIYGLKQASRQWF KFS FL+SLGF QSKA+YSLF++G   SF+ALLVYVDDIIITG+NA  IQ L
Subjt:  LHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHL

Query:  KDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVH
        K  LN  F LKDLG LK+FLGLELAR+S+G+F+SQ++YTLQL+EDTG LG KPT +PMDP  KL +S+ D+L D + YRRLIGRLLYLTISRPDITFAVH
Subjt:  KDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVH

Query:  KLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACE
        KLSQFM KP  +H+ AA+ L++YLK SPG+GI LP V +F I AFAD++WG+ LDTRRS+TGFCVFLG+SLVSWKSKKQQT++RSSAEAEY+AL VT CE
Subjt:  KLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACE

Query:  IIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVR
        +IWL +LL +L+++   PALLFCDNQAA++IA+NP+FHE+TKHIELDCHF+RD++IDG+IKLLPVR
Subjt:  IIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVR

KAG7578768.1 GAG-pre-integrase domain [Arabidopsis thaliana x Arabidopsis arenosa]1.8e-27954.78Show/hide
Query:  CGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELA
        C  C LAKQKRLSF S N++S + FDLVH D+WGPFS  ++    YFLTLVDD TR TWI+LLK KSDV  + P F   VETQ+   +K  RSDNAPEL+
Subjt:  CGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELA

Query:  FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAST
        F    +SKG+ H FSCV+ PQQNSVVERKHQHILNVARAL FQS +PI++W DC+ T+VYLINRTPSPLL   TPFELL   K  YS ++ FGCLC+ ST
Subjt:  FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAST

Query:  LSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPDVAPVID
            R KF PRA  +VFLGYP G K YK+ ++E   + ISR+VIFHE+IFPFH+   A    D F   +LP P   +T     S+    PF P    V+ 
Subjt:  LSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPDVAPVID

Query:  TTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWRA
              +D+ N +        P    R  R  + PSYL  YHC+L+      + ++  PL   + Y++L+  ++  +LN+S   EP+ + +AV  + W  
Subjt:  TTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWRA

Query:  AMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFL
         M+ EL    D  T+SVV LP GK  IGCRW+YKIKH ADG+I+RY+ARLVAKGYTQQEG+D+++TFSPVAKLVTVK LL ++  +GWSL Q+DV NAFL
Subjt:  AMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFL

Query:  HGDLFEEVYMDLPLGYNSHV-DSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHL
        HGDL EE+YMDLP GY     ++   + V +LHKS+YGLKQASRQW  KFS  L++ GF QS++D++LFVK   + F+ALLVYVDDI+I  ++ + +  L
Subjt:  HGDLFEEVYMDLPLGYNSHV-DSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHL

Query:  KDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVH
        K +L A FKLKDLG  KYFLGLE+AR+ +GI VSQR Y L LLE  G LGCKP S PMD  ++L + + DLL D ++YR LIG+LLYLTI+R DITFAVH
Subjt:  KDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVH

Query:  KLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACE
        KLSQF+++P   HL AAH ++RYLK  PG+G+F  A S+ +++AF+D+DWG C DTRRS TGFCVFLG SL+SWKSKKQ T SRSSAE+EYRALA T CE
Subjt:  KLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACE

Query:  IIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHRPS
        ++WL+ LL+DL VQV  P  L+CD+ AA+ IASN VFHERTKHIE+DCH +RDK+  G +KL+ V + +QL D FTK L       L+S+     +  P+
Subjt:  IIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHRPS

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]5.7e-29456.1Show/hide
Query:  IDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP
        ++ C  C L+KQ+RL   S N++SA+ F+L+H D WGPFS  +  G  +F T+VDD +RYTW+++LK KSDVLS+ P F ++V TQFG  +K  RSDNAP
Subjt:  IDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP

Query:  ELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCF
        EL F DFF   G+ H  SCVERPQQNSVVERKHQHILNVARAL FQS +P+ +W DC+ T+VYLINRTPSP+L   TPFELL+G    YS ++VFGCLC+
Subjt:  ELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCF

Query:  ASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPDVAP
        ASTL S R KF PRAI  VF+GYPPG K YKL ++E  ++FISRDVIFHE  FP+   +  S+              +++ EV P S  +PS  +P  A 
Subjt:  ASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPDVAP

Query:  VIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDH
                          Q H        R++R    PS+L  YHC  +S+    +TS+  P+   ++YS+LSS+HR  V N+S+  EP  + QAV    
Subjt:  VIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDH

Query:  WRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNN
        WR AMD EL A+E N TWS+V LP GK  +GCRW+YK K  ADGS++RYKARLVAKGYTQQEGLD+LETFSPVAKLVTV+TLL +A  +GW L+QLDVNN
Subjt:  WRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNN

Query:  AFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQ
        AFLHGDL EEVYM LP G+ S  +      VCKLHKSIYGLKQASRQWFAKFS  L+S+GF QS AD SLF++   + F+AL+VYVDDI+I  ++ +   
Subjt:  AFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQ

Query:  HLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFA
         LKD LN+ FKLKDLG+LKYFLG+E+ARS+ G+ + QR+Y + LL + G LGCKP + PM+   KL   + ++L+DP+ YRRLIGRLLYLTI+RPD+ FA
Subjt:  HLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFA

Query:  VHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTA
        V+KLSQ+++ P   H+ AA ++L+Y+K + GQG+F  + S+ ++RAF+D+DWGACLDTRRS+TG+CVFLG+SL+SW++KKQQT+SRSSAEAEYR+LA + 
Subjt:  VHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTA

Query:  CEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHR
        CEI+W+  LL DL V  + P +LFCD+QAAVHIASNPVFHERTKHI++DCH +R+K+    +KL+ V S  QLAD FTKPL  +    LL KMGV +IH 
Subjt:  CEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHR

Query:  PS
         S
Subjt:  PS

KZV53534.1 hypothetical protein F511_42283 [Dorcoceras hygrometricum]3.8e-29054.4Show/hide
Query:  YHQVHANIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQ
        +     +I+ C +C ++KQKRL F S+N  +A +F+L+H D+WGPFS  +  G+ +FLT+VDD TR+TW+++L+ KS+V S+ P F ++V+TQFG  IK 
Subjt:  YHQVHANIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQ

Query:  FRSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIR
         RSDNAP+L F + F   G++H +SCVERPQQNS+VERKHQHILNVARAL FQS VPI +W DC++T++YLINRTPS +L   TPFELL+G    YS ++
Subjt:  FRSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIR

Query:  VFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPD-PFPGLVLPKPFNVSTEVEPPSLSSPS
        +FGCLC+ASTL S R KF PRAI  VFLGYPPG + YKL +++  ++ IS DVIFHE  FPF     +   P   F   +LP    ++           S
Subjt:  VFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPD-PFPGLVLPKPFNVSTEVEPPSLSSPS

Query:  PFLPDVAPVIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFY
          +PD  P+   ++Q                      RS R+ +PP +L  YHC + SS+  P+TS+  PL  F++YS+LS  HRNLV N+S+  EP  +
Subjt:  PFLPDVAPVIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFY

Query:  HQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWS
         QAV    W+ AM  EL A+E N TWS+V LP GK  +GCRW+YK K  ADGS++RYKARLVAKGYTQQEGLD+LETFSPVAK+VTV+TLL +A ++GWS
Subjt:  HQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWS

Query:  LMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIIT
        L+QL V+NAFLHG+L EEVYM LP GY+S         VCKLHKS+YGLKQASRQWFAKFS  L+S+GF QS AD SLF+K   + F+ LLVYVDDI+I 
Subjt:  LMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIIT

Query:  GSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTI
         +N      LK  LN  FKLKDLG LKYFLG+E+ARSS GI + QR+Y +  L + G +GC+P S PM+  +K+   + ++L DPS YRRLIGRLLYLT+
Subjt:  GSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTI

Query:  SRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAE
        +RPD+ FAV+KLSQ+++KP   H+ AA ++LRY+K + GQG++  + S+ +++ F+D+DWGACLDTRRS+TG+CVFLG+S++SW++KKQ T+SRSSAEAE
Subjt:  SRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAE

Query:  YRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSK
        YR++A   CEI+W+ +LL DL V+   PA LFCD+QAA+HIASNPVFHERTKHI++DCH IR+K+  G +KL+ V S  QLAD FTK L  +    LLSK
Subjt:  YRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSK

Query:  MGVLDIHRPS
        MG+ +IH  S
Subjt:  MGVLDIHRPS

TQE07233.1 hypothetical protein C1H46_007143 [Malus baccata]2.0e-27853.31Show/hide
Query:  LYHQVHANIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIK
        L H   A+ + C ICPLAKQ RL FT ++  S + F L+H DIWGPF   + +   YFLT+VDD +R+TWIFL++HK D  S++  FF  V+TQF   I+
Subjt:  LYHQVHANIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIK

Query:  QFRSDNAPE-LAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSS
          R DN  E  +  DFF+  GVI+Q SCV  PQQN VVERKH+HIL  ARAL FQ+ +P+ FWG+CVLTAV+LINR P+P+L   TPFE L      +S 
Subjt:  QFRSDNAPE-LAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSS

Query:  IRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEAS----------VLPDPFPGLVLPK---PFN
        +RVFGCL +A+ ++    KF PRA   +FLGYP G KAYKLY++  +++F SRDV+FHE IFPFHT   +S          +L  P    VLP    P N
Subjt:  IRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEAS----------VLPDPFPGLVLPK---PFN

Query:  VSTEVEPPSLS-SPSPFLPDVAPVIDTTQQPLMDAHNIVP--QQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSS---------RFPLQQF
            VEP S + + SP  P  AP+      P     ++ P   +P     P++RRS+R S PP  L  Y C  L  T  P   S         R+PL  +
Subjt:  VSTEVEPPSLS-SPSPFLPDVAPVIDTTQQPLMDAHNIVP--QQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSS---------RFPLQQF

Query:  ISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDF
        ++Y R S T ++ + +++   EP+ Y  A  F HW+AAM +ELAA+E N TWS+  LP GK  IGCRW+YKIKH++DGSIERYKARLVAKGYTQ EG+DF
Subjt:  ISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDF

Query:  LETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKA
         +TFSP AK+VTV+ LL +AVS+ WSL QLDV+NAFLHGDL EE+YM LP G    +  +GE+LVC+L KS+YGLKQASRQWFAKFS  + S G+ QSK+
Subjt:  LETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKA

Query:  DYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKL
        DYSLF +  G SF ALL+YVDDI+ITG+++STI  LK  LN  FK+KDLG LKYFLG+E++RS  GIF+SQR Y L++L+D G LG +P   PM+  LKL
Subjt:  DYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKL

Query:  CSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFC
         S N  +L DP+ YRRL+GRL+YLTI+RPDIT++VH LS+FM +P   H+ AA  +L+Y+K++PGQG+F  A ++  + A+ DSDW  C  TRRS TG+C
Subjt:  CSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFC

Query:  VFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLP
        +FLG SLVSWKSK+Q+T+S SSAEAEYRA+A   CE+ WL  LL+DL +    PALL CDN+AA+HIA+NPVFHERT+HIE+DCHFIRDK+ DG++    
Subjt:  VFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLP

Query:  VRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHRPS
        V S +QLAD  TKPL       ++ K+GVL+IH P+
Subjt:  VRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHRPS

TrEMBL top hitse value%identityAlignment
A0A2N9EHN7 Integrase catalytic domain-containing protein2.1e-29457.49Show/hide
Query:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPEL
        TC +CPLAKQKRL F + N LS  +FDL+H DIWGP+  PT  G+ YFLTLVDD TR TWI+L++ KSD   ++  F  +++TQF   IKQ RSDN  E 
Subjt:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPEL

Query:  AFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAS
           +F+ SKG+IHQ SCVE PQQNSVVERKHQHILNVAR+L FQS +P+++WG C+ TAVYLINR P P+L   +PFE L      Y+ ++VFGCLCFAS
Subjt:  AFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAS

Query:  TLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHT-------VTEASVLPDPFPGL--VLPKPFNVSTEVEP--------
        TLSSHR+KF PRA   VFLGYP G+K YKL D+   K+FISRDV+FHE IFPF T        T  +  P+P       +P    ++ ++ P        
Subjt:  TLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHT-------VTEASVLPDPFPGL--VLPKPFNVSTEVEP--------

Query:  --PSLS-SPSPFLPDVAPVIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSL---LSSTSLPTTSSR---FPLQQFISYSRLSSTH
          PS+S SP PF  D++P +D T        +I    P       +RRS RV KPP+YL  YHC L   + STS P  +S    +PL   +SY  LS TH
Subjt:  --PSLS-SPSPFLPDVAPVIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSL---LSSTSLPTTSSR---FPLQQFISYSRLSSTH

Query:  RNLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKL
        RN  L+V+   EP  +HQA    HW+ AM AELAA+E N TW++ PLP GKH IGC+W+YK+K K+DGS+ERYKARLVAKGYTQQEGLD+ ETFSPVAK 
Subjt:  RNLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKL

Query:  VTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE--HLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKG
         TV+TLL VA +K WSL QLDVNNAFLHGDL EEVYM LPLG+     SKGE  +LVCKL+KS+YGLKQASRQWFAKFS  +I  GF QS +DYSLF + 
Subjt:  VTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE--HLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKG

Query:  TGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLL
         G  F+ALLVYVDDI+I  ++  ++  LKD L+A FKLKDLG+LK+FLGLE+ARS+ GI + QR Y L +L D+G LG KP + PM+  LK+  S  ++L
Subjt:  TGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLL

Query:  ADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLV
        ADPS YRRLIGRLLYLT++RPDI+++V +LSQFM+KP   HL AA+ +LRY+K + GQG+F P+ S+ Q++AF+DSDW  C DTRRSITG+CV++G SL+
Subjt:  ADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLV

Query:  SWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLA
        SWKSKKQ T+SRSSAEAEYRA+A   CE++WL  LL +LQ      ALLFCD+QAA+HIA+NPV+HERTKHIELDCH IR+K+  G ++ L V S +QLA
Subjt:  SWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLA

Query:  DPFTKPLS
        D  TK L+
Subjt:  DPFTKPLS

A0A2N9H2Y3 Integrase catalytic domain-containing protein1.6e-29757.72Show/hide
Query:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPEL
        TC +CPLAKQ++L F +NN LS K+FDL+H DIWGP+  PT  G+ YFLTLVDD TR TWI+L++ KSD  +++  F  ++ TQF   IKQ RSDN  E 
Subjt:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPEL

Query:  AFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAS
           DF+ SKG+IHQ SCVE PQQNSVVERKHQHILNVARAL FQS +P+++WG C+ TAVYLINR P P+L   +PFE L      Y+ ++VFGCLCFAS
Subjt:  AFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAS

Query:  TLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGL-VLPKPFNVSTEVEPPS---------LSSPS
        TLS HR+KF PRA    FLGYP G+K YKL ++   K+ ISRDV+FHE IFPF   T    LPD    L   P+P + +    PPS          S+P+
Subjt:  TLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGL-VLPKPFNVSTEVEPPS---------LSSPS

Query:  PFLPDVAPVIDTTQQPLMDAHN--IVPQQPHL--------IDPPLVRRSARVSKPPSYLHAYHCSL---LSSTSLPTTSSR---FPLQQFISYSRLSSTH
        P  P V+  +       +  HN    P   H+        +  PL RRS RV KPP+YL  YHC L   + STS P  +S    +PL   +SY  LS TH
Subjt:  PFLPDVAPVIDTTQQPLMDAHN--IVPQQPHL--------IDPPLVRRSARVSKPPSYLHAYHCSL---LSSTSLPTTSSR---FPLQQFISYSRLSSTH

Query:  RNLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKL
        RN  L+V+   EP F+HQA    HW+ AM AELAA+E N TW++ PLP GKH IGC+W+YK+K K+DGS+ERYKARLVAKGYTQQEGLD+ ETFSPVAK 
Subjt:  RNLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKL

Query:  VTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE--HLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKG
         TV+TLL VA  K WSL QLDVNNAFLHGDL EEVYM LP G+     SKGE  +LVCKL+KS+YGLKQASRQWFAKFS  +I  GF QSK+DYSLF + 
Subjt:  VTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE--HLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKG

Query:  TGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLL
         G +F+ALLVYVDDI+I  ++   +  LKD L+A FKLKDLG+LKYFLGLE+ARS+ GI + QR Y L +L D+G LG KP + PM+  LK+  S  ++L
Subjt:  TGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLL

Query:  ADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLV
         DPS YRRL+GRLLYLT++RPDI+++V KLSQFM+KP   HL+AA+ +LRY+K + GQG+F P+ S  Q++AF+DSDW  CLDTRRSITG+CV++GDSL+
Subjt:  ADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLV

Query:  SWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLA
        SWKSKKQ T+SRSSAEAEYRA+A   CE++WL  LL +LQ      ALLFCD+QAA+HIA+NPV+HERTKHIELDCH IR+K+  G ++ L V S +QLA
Subjt:  SWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLA

Query:  DPFTKPLSAALLFPLLSKMG
        D  TK L +A    LL KMG
Subjt:  DPFTKPLSAALLFPLLSKMG

A0A2N9IZK3 Uncharacterized protein3.7e-29956.88Show/hide
Query:  HANIDT--CGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFR
        H+  DT  C +CPLAKQKRL F +NN +S+ AFD++H DIWGP+  PT  G+ YFLTLVDD TR TW++L+K KS+   ++  F  +++TQFG  +K  R
Subjt:  HANIDT--CGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFR

Query:  SDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVF
        SDN  E +  DF+ ++G+IHQ SCVE PQQNSVVERKHQHILNVAR+L FQS +P++FWG  VLTAVYLINR PSP+L   +P+E L      YS +RVF
Subjt:  SDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVF

Query:  GCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASV------LPDPFP-----------------GLV
        GCLCFASTLS+HR+KF PRA P VFLGYP G+K YKL D+    + ISRDVIFHE +FPF     A        LP   P                 GL 
Subjt:  GCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASV------LPDPFP-----------------GLV

Query:  LPKPFNVSTEVEPPSLSSPSPFLPDVAPVIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSL------LSSTSLPTTSSRFPLQQF
          +P +VST    P L+SPS   P +              H  VP     +  PL RRS RVSKPP+YL  YHC +       SS+S  +T + +PL   
Subjt:  LPKPFNVSTEVEPPSLSSPSPFLPDVAPVIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSL------LSSTSLPTTSSRFPLQQF

Query:  ISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDF
        +SY  LS +HR   L+V+   EP  + QA    HWR AM  EL A+E N TWS+  LP GKH IGC+W+YK+K KADGS+ERYKARLVAKGYTQQEGLD+
Subjt:  ISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDF

Query:  LETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE-HLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSK
         ETFSPVAK  TV+TLL +A ++ WSL QLDVNNAFLHGDL EEVYM LP G+     SKGE +LVCKL KS+YGLKQASRQWFAKFS  LI  GF QSK
Subjt:  LETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGE-HLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSK

Query:  ADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLK
        +DYSLF +  G +F+ LLVYVDDI+I  +N + +  LKD L+A FKLKDLG+LKYFLGLE+ARSS GI + QR Y L +L D+G LG KP   PM+ +LK
Subjt:  ADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLK

Query:  LCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGF
        L  S+ D L+DPS YRRL+GRLLYLT++RPDI+++V +LSQFM KP  +HLAAA+ +L+Y+K + GQG+F P+ ++  +++F+DSDW +C DTRRS+TG+
Subjt:  LCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGF

Query:  CVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLL
        CVFLG+SL+SWKSKKQ TISRSSAEAEYRA+A   CE++WL  LL++LQV     ALL+CD+QAA+HIA+NPVFHERTKHIELDCH IR+K+ DG ++ L
Subjt:  CVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLL

Query:  PVRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHRPS
         V S +QLAD  TK L +     LLSKMGV +I+ PS
Subjt:  PVRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHRPS

A0A2Z7AT15 Cysteine-rich RLK (Receptor-like protein kinase) 82.7e-29456.1Show/hide
Query:  IDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP
        ++ C  C L+KQ+RL   S N++SA+ F+L+H D WGPFS  +  G  +F T+VDD +RYTW+++LK KSDVLS+ P F ++V TQFG  +K  RSDNAP
Subjt:  IDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP

Query:  ELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCF
        EL F DFF   G+ H  SCVERPQQNSVVERKHQHILNVARAL FQS +P+ +W DC+ T+VYLINRTPSP+L   TPFELL+G    YS ++VFGCLC+
Subjt:  ELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCF

Query:  ASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPDVAP
        ASTL S R KF PRAI  VF+GYPPG K YKL ++E  ++FISRDVIFHE  FP+   +  S+              +++ EV P S  +PS  +P  A 
Subjt:  ASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPDVAP

Query:  VIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDH
                          Q H        R++R    PS+L  YHC  +S+    +TS+  P+   ++YS+LSS+HR  V N+S+  EP  + QAV    
Subjt:  VIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDH

Query:  WRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNN
        WR AMD EL A+E N TWS+V LP GK  +GCRW+YK K  ADGS++RYKARLVAKGYTQQEGLD+LETFSPVAKLVTV+TLL +A  +GW L+QLDVNN
Subjt:  WRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNN

Query:  AFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQ
        AFLHGDL EEVYM LP G+ S  +      VCKLHKSIYGLKQASRQWFAKFS  L+S+GF QS AD SLF++   + F+AL+VYVDDI+I  ++ +   
Subjt:  AFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQ

Query:  HLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFA
         LKD LN+ FKLKDLG+LKYFLG+E+ARS+ G+ + QR+Y + LL + G LGCKP + PM+   KL   + ++L+DP+ YRRLIGRLLYLTI+RPD+ FA
Subjt:  HLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFA

Query:  VHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTA
        V+KLSQ+++ P   H+ AA ++L+Y+K + GQG+F  + S+ ++RAF+D+DWGACLDTRRS+TG+CVFLG+SL+SW++KKQQT+SRSSAEAEYR+LA + 
Subjt:  VHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTA

Query:  CEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHR
        CEI+W+  LL DL V  + P +LFCD+QAAVHIASNPVFHERTKHI++DCH +R+K+    +KL+ V S  QLAD FTKPL  +    LL KMGV +IH 
Subjt:  CEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHR

Query:  PS
         S
Subjt:  PS

A0A5A7VQN7 Cysteine-rich RLK (Receptor-like protein kinase) 80.0e+0064.09Show/hide
Query:  CGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELA
        C ICPLAKQ++LSFTSNN LS+ AFDL+H DIWGPFS+ TY+ +SYFLT+VDD+TRYTW+F+LK KSDV+S++P FFKL+ETQ+G  IK+ RSDNA +L 
Subjt:  CGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELA

Query:  FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAST
        F  FF  KGVIHQ+SCV+ PQQNSVVE+KHQHILN ARALYFQS+VP+ FWGDC++TAVYLI+RTPS LL+W  PF+ LN    DY+S++VFG LC+AS+
Subjt:  FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFAST

Query:  LSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPDVAPVID
        L  + SKF PRAIP+VF+GYP GMK YKLYDIE +K+FISRD +  E +     + E ++                                        
Subjt:  LSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPDVAPVID

Query:  TTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPT-TSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWR
         +QQ          +QP+  +   +RRS R++KPPSYL AYHCSLL++ S PT  S+++P+ Q++SY  L  T++  +L  ST  E  FYH+AVV   W 
Subjt:  TTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPT-TSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWR

Query:  AAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAF
         AM AEL AME N+TWS+VPLP GK++IGCRW+YKIKHKADGSIERYK RLVAKGYTQQEGLD+ ETFS V K+VTVKTLLT+AVSK W L+QLDVNNAF
Subjt:  AAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAF

Query:  LHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHL
        LHG+LFEEVYMDLPLGY    + +GE LVC+LHKSIYGLKQASRQWF KFS FL+SLGF QSKA+YSLF++G   SF+ALLVYVDDIIITG+NA  IQ L
Subjt:  LHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHL

Query:  KDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVH
        K  LN  F LKDLG LK+FLGLELAR+S+G+F+SQ++YTLQL+EDTG LG KPT +PMDP  KL +S+ D+L D + YRRLIGRLLYLTISRPDITFAVH
Subjt:  KDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVH

Query:  KLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACE
        KLSQFM KP  +H+ AA+ L++YLK SPG+GI LP V +F I AFAD++WG+ LDTRRS+TGFCVFLG+SLVSWKSKKQQT++RSSAEAEY+AL VT CE
Subjt:  KLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACE

Query:  IIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVR
        +IWL +LL +L+++   PALLFCDNQAA++IA+NP+FHE+TKHIELDCHF+RD++IDG+IKLLPVR
Subjt:  IIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVR

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-13933.07Show/hide
Query:  IDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP
        +  C  C   KQ R+SF +++       DLV++D+ GP    +  G+ YF+T +DD++R  W+++LK K  V  V  +F  LVE + G  +K+ RSDN  
Subjt:  IDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP

Query:  ELA---FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGC
        E     F ++  S G+ H+ +    PQ N V ER ++ I+   R++   +++P  FWG+ V TA YLINR+PS  L +  P  +    +  YS ++VFGC
Subjt:  ELA---FGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGC

Query:  LCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPD
          FA      R+K   ++IP +F+GY      Y+L+D  ++K+  SRDV+F E       V  A+ + +     ++P    +     P + ++P+     
Subjt:  LCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVLPDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPD

Query:  VAPVIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVV
           V +  +QP       V +Q   +D           + P+     H  L  S      S R+P  +++  S         +  V +H E         
Subjt:  VAPVIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVV

Query:  FDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLD
         +    AM  E+ +++ N T+ +V LP GK  + C+W++K+K   D  + RYKARLV KG+ Q++G+DF E FSPV K+ +++T+L++A S    + QLD
Subjt:  FDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLD

Query:  VNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKG-TGHSFVALLVYVDDIIITGSNA
        V  AFLHGDL EE+YM+ P G+   V  K +H+VCKL+KS+YGLKQA RQW+ KF  F+ S  + ++ +D  ++ K  + ++F+ LL+YVDD++I G + 
Subjt:  VNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKG-TGHSFVALLVYVDDIIITGSNA

Query:  STIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTG--IFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKL----CSSNVDLLADPS--IYRRLIGRLL
          I  LK  L+  F +KDLG  +  LG+++ R  T   +++SQ  Y  ++LE       KP S P+   LKL    C + V+   + +   Y   +G L+
Subjt:  STIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTG--IFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKL----CSSNVDLLADPS--IYRRLIGRLL

Query:  Y-LTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRS
        Y +  +RPDI  AV  +S+F+  P K H  A   +LRYL+ + G  +     S+  ++ + D+D    +D R+S TG+        +SW+SK Q+ ++ S
Subjt:  Y-LTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRS

Query:  SAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTK
        + EAEY A   T  E+IWL   L++L +      +++CD+Q+A+ ++ N ++H RTKHI++  H+IR+ + D ++K+L + ++   AD  TK
Subjt:  SAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTK

Q39196 Probable aquaporin PIP1-49.8e-14087.63Show/hide
Query:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW
        MEGKEEDVR+GANKF ERQPIGT+AQS D   KDYKEPPPAPLFEP EL+SWSFYRAGIAEFIATFLFLYITVLTVMGV R+     N C +VGIQGIAW
Subjt:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW

Query:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV
        AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRA+FYM+MQCLGAICGAGVVKGFQP PY+ LGGGAN V  GYTKG GLGAEI+GTF+LVYTV
Subjt:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV

Query:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV
        FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAII+N+D +WDDHWIFWVGPFIGAALAALYHQ+
Subjt:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV

Q7XSQ9 Probable aquaporin PIP1-21.4e-13888.69Show/hide
Query:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW
        MEGKEEDVRLGANKF+ERQPIGTAAQ  DD  KDYKEPPPAPLFEP EL SWSFYRAGIAEF+ATFLFLYITVLTVMGV  S     +KC TVGIQGIAW
Subjt:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW

Query:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV
        +FGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRA+FYMVMQCLGAICGAGVVKGFQ   YE  GGGAN V  GYTKGDGLGAEIVGTFILVYTV
Subjt:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV

Query:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV
        FSATDAKR+ARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAII+NR  AWDDHWIFWVGPFIGAALAA+YHQV
Subjt:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-17337.35Show/hide
Query:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSP--TYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP
        +C  C + K  ++ F+ +   S +  + +++D+W   SSP  ++  + Y++  VD  TRYTW++ LK KS V      F  L+E +F   I  F SDN  
Subjt:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSP--TYAGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP

Query:  E-LAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLC
        E +A  ++F   G+ H  S    P+ N + ERKH+HI+     L   + +P  +W      AVYLINR P+PLL+  +PF+ L G+  +Y  +RVFGC C
Subjt:  E-LAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLC

Query:  FASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPF-----------------------HTV--TEASVLP-----DPFP
        +      ++ K   ++   VFLGY     AY    ++  +L+ISR V F E  FPF                       HT   T   VLP     DP  
Subjt:  FASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPF-----------------------HTV--TEASVLP-----DPFP

Query:  GLVLPK----PF--------NVSTEVEPPSLSSPSPFLPDVAPVIDTTQ--QPLMDAH--------NIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCS
            P     PF        N+ +       SSP P  P       TTQ  Q     H        N   + P  +   L   +   S  PS       S
Subjt:  GLVLPK----PF--------NVSTEVEPPSLSSPSPFLPDVAPVIDTTQ--QPLMDAH--------NIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCS

Query:  LLSSTSLPTTSSRF-----PLQQFISYSRLS--STHR----------------NLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLP
          SS++ PT  S       PL Q ++ +  +  +TH                 +L ++++   EP+   QA+  + WR AM +E+ A   N TW +VP P
Subjt:  LLSSTSLPTTSSRF-----PLQQFISYSRLS--STHR----------------NLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLP

Query:  HGKHTI-GCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHV
            TI GCRWI+  K+ +DGS+ RYKARLVAKGY Q+ GLD+ ETFSPV K  +++ +L VAV + W + QLDVNNAFL G L ++VYM  P G+   +
Subjt:  HGKHTI-GCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHV

Query:  DSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLG
        D    + VCKL K++YGLKQA R W+ +  ++L+++GF  S +D SLFV   G S V +LVYVDDI+ITG++ + + +  D L+  F +KD   L YFLG
Subjt:  DSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFLG

Query:  LELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLL
        +E  R  TG+ +SQR Y L LL  T  +  KP + PM P  KL   +   L DP+ YR ++G L YL  +RPDI++AV++LSQFM  P + HL A   +L
Subjt:  LELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDLL

Query:  RYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALL
        RYL  +P  GIFL   +   + A++D+DW    D   S  G+ V+LG   +SW SKKQ+ + RSS EAEYR++A T+ E+ W+ +LL +L ++++ P ++
Subjt:  RYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPALL

Query:  FCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGV
        +CDN  A ++ +NPVFH R KHI +D HFIR+++  G ++++ V +H QLAD  TKPLS        SK+GV
Subjt:  FCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.7e-17137.3Show/hide
Query:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYA--GHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP
        +C  C + K  ++ F+++   S+K  + +++D+W   SSP  +   + Y++  VD  TRYTW++ LK KS V      F  LVE +F   I    SDN  
Subjt:  TCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTYA--GHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAP

Query:  E-LAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLC
        E +   D+    G+ H  S    P+ N + ERKH+HI+ +   L   + VP  +W      AVYLINR P+PLL+  +PF+ L G   +Y  ++VFGC C
Subjt:  E-LAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFWGDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLC

Query:  FASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTV-----TEASVLPDPFPG-----------LVLPKP----FNV
        +      +R K   ++    F+GY     AY    I   +L+ SR V F E  FPF T      T      D  P            LVLP P     ++
Subjt:  FASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTV-----TEASVLPDPFPG-----------LVLPKP----FNV

Query:  STEVEPPSLSSP-------SPFLPDVAPVIDTTQQPLMDAHNIVPQ---QPH-----------LIDP--------------PLVR---RSARVSKPPSYL
         T   PPS  SP       S  LP  +    ++ +P   +HN  PQ   QPH           L +P              PL +    S  +  P + +
Subjt:  STEVEPPSLSSP-------SPFLPDVAPVIDTTQQPLMDAHNIVPQ---QPH-----------LIDP--------------PLVR---RSARVSKPPSYL

Query:  HAYHCSLLSSTSLPTTSSRFPLQQFISYSRLS--STHR----------------NLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPL
           +    SSTS P      P    I  +  +  +TH                 +   +++ + EP+   QA+  D WR AM +E+ A   N TW +VP 
Subjt:  HAYHCSLLSSTSLPTTSSRFPLQQFISYSRLS--STHR----------------NLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPL

Query:  PHGKHTI-GCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSH
        P    TI GCRWI+  K  +DGS+ RYKARLVAKGY Q+ GLD+ ETFSPV K  +++ +L VAV + W + QLDVNNAFL G L +EVYM  P G+   
Subjt:  PHGKHTI-GCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSH

Query:  VDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFL
        VD      VC+L K+IYGLKQA R W+ +   +L+++GF  S +D SLFV   G S + +LVYVDDI+ITG++   ++H  D L+  F +K+   L YFL
Subjt:  VDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHLKDLLNAHFKLKDLGSLKYFL

Query:  GLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDL
        G+E  R   G+ +SQR YTL LL  T  L  KP + PM    KL   +   L DP+ YR ++G L YL  +RPD+++AV++LSQ+M  P   H  A   +
Subjt:  GLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQFMTKPCKSHLAAAHDL

Query:  LRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPAL
        LRYL  +P  GIFL   +   + A++D+DW    D   S  G+ V+LG   +SW SKKQ+ + RSS EAEYR++A T+ E+ W+ +LL +L +Q+S P +
Subjt:  LRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQVQVSLPAL

Query:  LFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDI
        ++CDN  A ++ +NPVFH R KHI LD HFIR+++  G ++++ V +H QLAD  TKPLS         K+GV+ +
Subjt:  LFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDI

Arabidopsis top hitse value%identityAlignment
AT1G01620.1 plasma membrane intrinsic protein 1C2.2e-13986.93Show/hide
Query:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW
        MEGKEEDVR+GANKF ERQPIGT+AQ+     KDYKEPPPAP FEP EL+SWSFYRAGIAEFIATFLFLYITVLTVMGV R+     N C +VGIQGIAW
Subjt:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW

Query:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV
        AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRA+FY+VMQCLGAICGAGVVKGFQP PY+ LGGGAN V  GYTKG GLGAEI+GTF+LVYTV
Subjt:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV

Query:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV
        FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAII+N+D AWDDHWIFWVGPFIGAALAALYHQ+
Subjt:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV

AT2G45960.1 plasma membrane intrinsic protein 1B2.5e-13886.22Show/hide
Query:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW
        MEGKEEDVR+GANKF ERQPIGT+AQS     KDYKEPPPAPLFEP EL SWSF+RAGIAEFIATFLFLYITVLTVMGV RS     N C +VGIQGIAW
Subjt:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW

Query:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV
        AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRA++Y+VMQCLGAICGAGVVKGFQPK Y+ LGGGAN +  GYTKG GLGAEI+GTF+LVYTV
Subjt:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV

Query:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV
        FSATDAKR+ARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFN+D AWDDHW+FWVGPFIGAALAALYH +
Subjt:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV

AT3G61430.1 plasma membrane intrinsic protein 1A2.1e-13785.87Show/hide
Query:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW
        MEGKEEDVR+GANKF ERQPIGT+AQS     KDYKEPPPAP FEP EL+SWSF+RAGIAEFIATFLFLYITVLTVMGV RS     N C +VGIQGIAW
Subjt:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW

Query:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV
        AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRA++Y+VMQCLGAICGAGVVKGFQPK Y+ LGGGAN V  GYTKG GLGAEI+GTF+LVYTV
Subjt:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV

Query:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV
        FSATDAKR+ARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAII+N+D +WDDHW+FWVGPFIGAALAALYH V
Subjt:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV

AT4G00430.1 plasma membrane intrinsic protein 1;47.0e-14187.63Show/hide
Query:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW
        MEGKEEDVR+GANKF ERQPIGT+AQS D   KDYKEPPPAPLFEP EL+SWSFYRAGIAEFIATFLFLYITVLTVMGV R+     N C +VGIQGIAW
Subjt:  MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAW

Query:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV
        AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRA+FYM+MQCLGAICGAGVVKGFQP PY+ LGGGAN V  GYTKG GLGAEI+GTF+LVYTV
Subjt:  AFGGMIFALVYCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTV

Query:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV
        FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAII+N+D +WDDHWIFWVGPFIGAALAALYHQ+
Subjt:  FSATDAKRSARDSHVPILAPLPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQV

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.1e-16049.74Show/hide
Query:  PLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWRAAMDA
        P  +  N VP+       P V  S R ++ P+YL  Y+C  ++S ++        + QF+SY ++S  + + ++ ++   EP  Y++A  F  W  AMD 
Subjt:  PLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSSTHRNLVLNVSTHFEPQFYHQAVVFDHWRAAMDA

Query:  ELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDL
        E+ AME   TW +  LP  K  IGC+W+YKIK+ +DG+IERYKARLVAKGYTQQEG+DF+ETFSPV KL +VK +L ++    ++L QLD++NAFL+GDL
Subjt:  ELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLTVAVSKGWSLMQLDVNNAFLHGDL

Query:  FEEVYMDLPLGYNSHV-DSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHLKDLL
         EE+YM LP GY +   DS   + VC L KSIYGLKQASRQWF KFS  LI  GF QS +D++ F+K T   F+ +LVYVDDIII  +N + +  LK  L
Subjt:  FEEVYMDLPLGYNSHV-DSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGSNASTIQHLKDLL

Query:  NAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQ
         + FKL+DLG LKYFLGLE+ARS+ GI + QR Y L LL++TG LGCKP+S+PMDP +   + +     D   YRRLIGRL+YL I+R DI+FAV+KLSQ
Subjt:  NAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKLSQ

Query:  FMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWL
        F   P  +H  A   +L Y+K + GQG+F  + +  Q++ F+D+ + +C DTRRS  G+C+FLG SL+SWKSKKQQ +S+SSAEAEYRAL+    E++WL
Subjt:  FMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWL

Query:  TNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAAL
            R+LQ+ +S P LLFCDN AA+HIA+N VFHERTKHIE DCH +R++ +         +++ +  D FT+ LS  L
Subjt:  TNLLRDLQVQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGAAAGGAAGAAGATGTAAGATTGGGAGCTAACAAGTTCACCGAGAGGCAGCCCATCGGGACGGCGGCTCAGAGCCAAGACGACGACGCCAAGGACTACAAGGA
GCCACCACCGGCGCCGCTGTTCGAGCCCGAAGAGCTCACTTCCTGGTCCTTTTACAGAGCCGGGATCGCCGAGTTCATCGCCACCTTCCTCTTCCTTTACATCACCGTTT
TGACTGTCATGGGCGTCGCCCGCTCAAACGAAGCCAACGGCAACAAGTGCAAGACCGTCGGGATTCAAGGTATCGCCTGGGCTTTCGGCGGTATGATCTTCGCTCTCGTC
TACTGCACTGCCGGAATCTCCGGTGGCCACATTAACCCGGCGGTGACGTTCGGGCTGTTTCTGGCGAGGAAGCTGTCGTTGACCAGGGCGATTTTCTACATGGTGATGCA
GTGCCTCGGAGCCATCTGCGGCGCCGGCGTGGTGAAGGGCTTCCAACCCAAGCCCTACGAGAGACTCGGCGGCGGAGCTAACGCCGTCAACAAGGGCTACACCAAAGGGG
ACGGCCTCGGCGCCGAGATCGTCGGGACCTTCATCCTTGTTTACACCGTCTTCTCCGCCACCGACGCCAAACGCAGCGCCAGAGACTCCCACGTTCCGATTCTGGCGCCG
CTGCCAATTGGGTTCGCCGTGTTCTTGGTGCACTTGGCCACCATCCCCATCACCGGCACCGGCATCAACCCAGCCCGGAGTTTGGGCGCCGCCATCATCTTCAACAGGGA
CGAAGCCTGGGATGACCACTGGATTTTCTGGGTCGGACCATTCATCGGAGCAGCACTGGCAGCTTTATACCACCAAGTGCATGCTAATATTGATACTTGTGGGATTTGTC
CTTTGGCTAAGCAAAAGCGTTTGTCCTTTACTTCAAACAATAGTCTTTCGGCCAAAGCTTTTGATCTTGTGCATGCTGACATTTGGGGACCATTCTCTAGTCCTACTTAT
GCTGGTCATTCTTACTTTCTTACACTTGTTGATGATTCTACAAGATATACATGGATTTTCTTGCTTAAACATAAGTCTGATGTGTTATCTGTGGTTCCTCAATTTTTTAA
ACTGGTTGAGACACAATTTGGTTGCTGCATAAAACAATTTCGTTCAGATAATGCACCGGAACTTGCTTTTGGTGATTTCTTTCGTAGTAAAGGTGTCATTCATCAGTTTT
CTTGTGTGGAGCGTCCTCAACAAAATTCGGTTGTAGAAAGAAAACACCAACATATTTTGAATGTTGCTCGTGCTCTTTATTTCCAATCTAGAGTACCCATTCGATTTTGG
GGTGATTGTGTTCTTACTGCTGTCTATCTGATTAATAGAACTCCTTCTCCTTTGTTAAAATGGTGTACTCCATTTGAGTTGTTGAATGGATCTAAGGCAGATTATAGCTC
CATTCGTGTGTTTGGATGTTTGTGTTTTGCGTCAACTCTCTCTTCGCATCGCTCAAAGTTTCATCCCCGAGCCATTCCTGCTGTTTTTCTTGGCTATCCTCCAGGTATGA
AAGCTTATAAGTTGTATGACATTGAGCAGCAAAAATTATTCATATCAAGGGATGTTATTTTTCACGAGGAAATATTTCCTTTTCATACTGTTACTGAGGCCTCTGTACTC
CCAGACCCCTTTCCTGGTCTTGTTTTGCCTAAGCCCTTTAATGTCAGCACTGAAGTTGAGCCACCAAGTCTCTCTTCTCCATCACCCTTTTTGCCTGATGTTGCTCCTGT
CATTGATACAACTCAGCAGCCTTTGATGGATGCTCATAACATTGTGCCTCAACAGCCTCATCTCATTGACCCACCCCTTGTTCGACGTTCTGCACGTGTCTCTAAGCCAC
CTTCTTATTTACATGCTTATCATTGTAGTCTTCTTTCCTCTACCTCCTTGCCTACCACCTCTTCTCGTTTTCCCCTACAACAATTTATTTCTTACTCTCGACTTTCCTCA
ACTCATCGCAATTTGGTTCTTAATGTTTCTACACATTTTGAGCCACAATTTTATCACCAAGCAGTCGTCTTTGACCATTGGCGGGCTGCTATGGATGCAGAGTTGGCAGC
TATGGAGGATAATAAGACTTGGAGCGTTGTTCCTCTCCCTCATGGTAAGCACACTATTGGATGTCGGTGGATATATAAGATTAAACATAAAGCTGATGGTTCTATTGAAC
GTTACAAGGCTCGTTTAGTTGCTAAAGGTTATACACAGCAAGAAGGCTTGGATTTCCTTGAAACTTTCTCTCCGGTTGCCAAGTTAGTCACTGTTAAGACCCTTCTTACT
GTTGCAGTTTCTAAAGGGTGGTCTTTGATGCAATTGGATGTTAACAACGCTTTCTTACATGGTGATTTGTTTGAGGAGGTTTACATGGATTTACCTTTAGGATACAATTC
TCATGTTGATAGTAAGGGGGAGCATTTAGTTTGTAAATTGCATAAATCAATCTATGGCCTCAAGCAAGCCTCTAGGCAATGGTTTGCCAAATTTTCTCACTTCTTAATTT
CTTTGGGATTCTTCCAGTCAAAGGCCGACTATTCATTGTTTGTCAAAGGAACTGGGCATTCTTTTGTTGCTCTTTTAGTATACGTCGATGACATCATTATTACTGGGTCT
AATGCTTCAACTATTCAGCACTTGAAGGATCTTCTTAATGCACATTTCAAACTCAAGGATCTTGGCTCTCTTAAATATTTTCTTGGGCTTGAACTTGCTCGTTCCTCTAC
TGGTATTTTTGTATCACAGAGACATTATACTTTACAGCTACTGGAAGACACAGGTTTTCTTGGCTGTAAGCCCACTAGTATTCCTATGGATCCTCAGTTGAAGTTGTGTT
CTTCTAATGTTGATTTATTGGCTGACCCTTCAATATATCGACGTCTTATTGGCCGCCTCTTATACTTGACTATATCTCGACCTGACATAACATTTGCAGTTCATAAACTG
AGTCAGTTTATGACGAAGCCCTGTAAATCTCACCTTGCGGCTGCTCATGATTTGTTGCGCTACTTGAAGTCATCCCCAGGCCAAGGAATTTTCTTACCTGCTGTATCTAA
TTTTCAGATTCGAGCTTTTGCAGATTCTGATTGGGGTGCTTGTCTTGACACCAGACGCTCCATCACTGGTTTTTGTGTATTTTTAGGAGATTCATTGGTGTCGTGGAAAT
CAAAGAAGCAACAAACCATTTCCAGATCATCAGCAGAGGCAGAGTATCGTGCTCTTGCTGTAACAGCATGTGAAATCATTTGGTTGACTAATCTTTTACGTGATTTACAG
GTTCAAGTTAGCCTTCCAGCTCTTTTGTTTTGTGACAATCAGGCAGCAGTTCATATTGCCTCTAATCCTGTTTTTCATGAGCGCACAAAACACATTGAGTTAGATTGTCA
TTTTATTCGAGACAAACTCATTGATGGGACCATCAAACTTCTTCCAGTTAGATCACATTCCCAGCTTGCGGATCCTTTTACAAAACCTCTATCTGCTGCTCTTCTTTTCC
CATTGTTGTCCAAGATGGGCGTTTTGGATATACATCGTCCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGAAAGGAAGAAGATGTAAGATTGGGAGCTAACAAGTTCACCGAGAGGCAGCCCATCGGGACGGCGGCTCAGAGCCAAGACGACGACGCCAAGGACTACAAGGA
GCCACCACCGGCGCCGCTGTTCGAGCCCGAAGAGCTCACTTCCTGGTCCTTTTACAGAGCCGGGATCGCCGAGTTCATCGCCACCTTCCTCTTCCTTTACATCACCGTTT
TGACTGTCATGGGCGTCGCCCGCTCAAACGAAGCCAACGGCAACAAGTGCAAGACCGTCGGGATTCAAGGTATCGCCTGGGCTTTCGGCGGTATGATCTTCGCTCTCGTC
TACTGCACTGCCGGAATCTCCGGTGGCCACATTAACCCGGCGGTGACGTTCGGGCTGTTTCTGGCGAGGAAGCTGTCGTTGACCAGGGCGATTTTCTACATGGTGATGCA
GTGCCTCGGAGCCATCTGCGGCGCCGGCGTGGTGAAGGGCTTCCAACCCAAGCCCTACGAGAGACTCGGCGGCGGAGCTAACGCCGTCAACAAGGGCTACACCAAAGGGG
ACGGCCTCGGCGCCGAGATCGTCGGGACCTTCATCCTTGTTTACACCGTCTTCTCCGCCACCGACGCCAAACGCAGCGCCAGAGACTCCCACGTTCCGATTCTGGCGCCG
CTGCCAATTGGGTTCGCCGTGTTCTTGGTGCACTTGGCCACCATCCCCATCACCGGCACCGGCATCAACCCAGCCCGGAGTTTGGGCGCCGCCATCATCTTCAACAGGGA
CGAAGCCTGGGATGACCACTGGATTTTCTGGGTCGGACCATTCATCGGAGCAGCACTGGCAGCTTTATACCACCAAGTGCATGCTAATATTGATACTTGTGGGATTTGTC
CTTTGGCTAAGCAAAAGCGTTTGTCCTTTACTTCAAACAATAGTCTTTCGGCCAAAGCTTTTGATCTTGTGCATGCTGACATTTGGGGACCATTCTCTAGTCCTACTTAT
GCTGGTCATTCTTACTTTCTTACACTTGTTGATGATTCTACAAGATATACATGGATTTTCTTGCTTAAACATAAGTCTGATGTGTTATCTGTGGTTCCTCAATTTTTTAA
ACTGGTTGAGACACAATTTGGTTGCTGCATAAAACAATTTCGTTCAGATAATGCACCGGAACTTGCTTTTGGTGATTTCTTTCGTAGTAAAGGTGTCATTCATCAGTTTT
CTTGTGTGGAGCGTCCTCAACAAAATTCGGTTGTAGAAAGAAAACACCAACATATTTTGAATGTTGCTCGTGCTCTTTATTTCCAATCTAGAGTACCCATTCGATTTTGG
GGTGATTGTGTTCTTACTGCTGTCTATCTGATTAATAGAACTCCTTCTCCTTTGTTAAAATGGTGTACTCCATTTGAGTTGTTGAATGGATCTAAGGCAGATTATAGCTC
CATTCGTGTGTTTGGATGTTTGTGTTTTGCGTCAACTCTCTCTTCGCATCGCTCAAAGTTTCATCCCCGAGCCATTCCTGCTGTTTTTCTTGGCTATCCTCCAGGTATGA
AAGCTTATAAGTTGTATGACATTGAGCAGCAAAAATTATTCATATCAAGGGATGTTATTTTTCACGAGGAAATATTTCCTTTTCATACTGTTACTGAGGCCTCTGTACTC
CCAGACCCCTTTCCTGGTCTTGTTTTGCCTAAGCCCTTTAATGTCAGCACTGAAGTTGAGCCACCAAGTCTCTCTTCTCCATCACCCTTTTTGCCTGATGTTGCTCCTGT
CATTGATACAACTCAGCAGCCTTTGATGGATGCTCATAACATTGTGCCTCAACAGCCTCATCTCATTGACCCACCCCTTGTTCGACGTTCTGCACGTGTCTCTAAGCCAC
CTTCTTATTTACATGCTTATCATTGTAGTCTTCTTTCCTCTACCTCCTTGCCTACCACCTCTTCTCGTTTTCCCCTACAACAATTTATTTCTTACTCTCGACTTTCCTCA
ACTCATCGCAATTTGGTTCTTAATGTTTCTACACATTTTGAGCCACAATTTTATCACCAAGCAGTCGTCTTTGACCATTGGCGGGCTGCTATGGATGCAGAGTTGGCAGC
TATGGAGGATAATAAGACTTGGAGCGTTGTTCCTCTCCCTCATGGTAAGCACACTATTGGATGTCGGTGGATATATAAGATTAAACATAAAGCTGATGGTTCTATTGAAC
GTTACAAGGCTCGTTTAGTTGCTAAAGGTTATACACAGCAAGAAGGCTTGGATTTCCTTGAAACTTTCTCTCCGGTTGCCAAGTTAGTCACTGTTAAGACCCTTCTTACT
GTTGCAGTTTCTAAAGGGTGGTCTTTGATGCAATTGGATGTTAACAACGCTTTCTTACATGGTGATTTGTTTGAGGAGGTTTACATGGATTTACCTTTAGGATACAATTC
TCATGTTGATAGTAAGGGGGAGCATTTAGTTTGTAAATTGCATAAATCAATCTATGGCCTCAAGCAAGCCTCTAGGCAATGGTTTGCCAAATTTTCTCACTTCTTAATTT
CTTTGGGATTCTTCCAGTCAAAGGCCGACTATTCATTGTTTGTCAAAGGAACTGGGCATTCTTTTGTTGCTCTTTTAGTATACGTCGATGACATCATTATTACTGGGTCT
AATGCTTCAACTATTCAGCACTTGAAGGATCTTCTTAATGCACATTTCAAACTCAAGGATCTTGGCTCTCTTAAATATTTTCTTGGGCTTGAACTTGCTCGTTCCTCTAC
TGGTATTTTTGTATCACAGAGACATTATACTTTACAGCTACTGGAAGACACAGGTTTTCTTGGCTGTAAGCCCACTAGTATTCCTATGGATCCTCAGTTGAAGTTGTGTT
CTTCTAATGTTGATTTATTGGCTGACCCTTCAATATATCGACGTCTTATTGGCCGCCTCTTATACTTGACTATATCTCGACCTGACATAACATTTGCAGTTCATAAACTG
AGTCAGTTTATGACGAAGCCCTGTAAATCTCACCTTGCGGCTGCTCATGATTTGTTGCGCTACTTGAAGTCATCCCCAGGCCAAGGAATTTTCTTACCTGCTGTATCTAA
TTTTCAGATTCGAGCTTTTGCAGATTCTGATTGGGGTGCTTGTCTTGACACCAGACGCTCCATCACTGGTTTTTGTGTATTTTTAGGAGATTCATTGGTGTCGTGGAAAT
CAAAGAAGCAACAAACCATTTCCAGATCATCAGCAGAGGCAGAGTATCGTGCTCTTGCTGTAACAGCATGTGAAATCATTTGGTTGACTAATCTTTTACGTGATTTACAG
GTTCAAGTTAGCCTTCCAGCTCTTTTGTTTTGTGACAATCAGGCAGCAGTTCATATTGCCTCTAATCCTGTTTTTCATGAGCGCACAAAACACATTGAGTTAGATTGTCA
TTTTATTCGAGACAAACTCATTGATGGGACCATCAAACTTCTTCCAGTTAGATCACATTCCCAGCTTGCGGATCCTTTTACAAAACCTCTATCTGCTGCTCTTCTTTTCC
CATTGTTGTCCAAGATGGGCGTTTTGGATATACATCGTCCATCTTGA
Protein sequenceShow/hide protein sequence
MEGKEEDVRLGANKFTERQPIGTAAQSQDDDAKDYKEPPPAPLFEPEELTSWSFYRAGIAEFIATFLFLYITVLTVMGVARSNEANGNKCKTVGIQGIAWAFGGMIFALV
YCTAGISGGHINPAVTFGLFLARKLSLTRAIFYMVMQCLGAICGAGVVKGFQPKPYERLGGGANAVNKGYTKGDGLGAEIVGTFILVYTVFSATDAKRSARDSHVPILAP
LPIGFAVFLVHLATIPITGTGINPARSLGAAIIFNRDEAWDDHWIFWVGPFIGAALAALYHQVHANIDTCGICPLAKQKRLSFTSNNSLSAKAFDLVHADIWGPFSSPTY
AGHSYFLTLVDDSTRYTWIFLLKHKSDVLSVVPQFFKLVETQFGCCIKQFRSDNAPELAFGDFFRSKGVIHQFSCVERPQQNSVVERKHQHILNVARALYFQSRVPIRFW
GDCVLTAVYLINRTPSPLLKWCTPFELLNGSKADYSSIRVFGCLCFASTLSSHRSKFHPRAIPAVFLGYPPGMKAYKLYDIEQQKLFISRDVIFHEEIFPFHTVTEASVL
PDPFPGLVLPKPFNVSTEVEPPSLSSPSPFLPDVAPVIDTTQQPLMDAHNIVPQQPHLIDPPLVRRSARVSKPPSYLHAYHCSLLSSTSLPTTSSRFPLQQFISYSRLSS
THRNLVLNVSTHFEPQFYHQAVVFDHWRAAMDAELAAMEDNKTWSVVPLPHGKHTIGCRWIYKIKHKADGSIERYKARLVAKGYTQQEGLDFLETFSPVAKLVTVKTLLT
VAVSKGWSLMQLDVNNAFLHGDLFEEVYMDLPLGYNSHVDSKGEHLVCKLHKSIYGLKQASRQWFAKFSHFLISLGFFQSKADYSLFVKGTGHSFVALLVYVDDIIITGS
NASTIQHLKDLLNAHFKLKDLGSLKYFLGLELARSSTGIFVSQRHYTLQLLEDTGFLGCKPTSIPMDPQLKLCSSNVDLLADPSIYRRLIGRLLYLTISRPDITFAVHKL
SQFMTKPCKSHLAAAHDLLRYLKSSPGQGIFLPAVSNFQIRAFADSDWGACLDTRRSITGFCVFLGDSLVSWKSKKQQTISRSSAEAEYRALAVTACEIIWLTNLLRDLQ
VQVSLPALLFCDNQAAVHIASNPVFHERTKHIELDCHFIRDKLIDGTIKLLPVRSHSQLADPFTKPLSAALLFPLLSKMGVLDIHRPS