; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035092 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035092
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr3:14764807..14773801
RNA-Seq ExpressionLag0035092
SyntenyLag0035092
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]8.9e-21937.53Show/hide
Query:  SIEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKE
        ++E   +P+ LH+       LV+ PL+G+ NY++W RAM++AL+ KNK GFI  +I +P ++  L  +W   N+++ SWILNSV+++IA S++Y  +A+E
Subjt:  SIEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKE

Query:  LIT-----------------------LSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPS
        + T                       L QG+M + +YYTKL+T+W +L D++P   C C  ++ + ++ + E VM FLMGLNDSYA VRAQ+L+++P+P+
Subjt:  LIT-----------------------LSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPS

Query:  ITDVFSLLIQEERQRNAG---NSAAIEPIALLASDNSKKNPNSS--STDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPPGY-KTRNSSSPVPVEGSN
        I  VF+L+IQEERQR+     + A ++   +L++ NS  N  +S  ++  ++     R IC++C  + HT+D+CYKLHGYPPG+ K ++  S        
Subjt:  ITDVFSLLIQEERQRNAG---NSAAIEPIALLASDNSKKNPNSS--STDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPPGY-KTRNSSSPVPVEGSN

Query:  STKAPGVNQVSQS-DFFSSLNSNQYAQLLNMLSSHLPTAKN-----ETATAASAIT----------------------------------------HAAV
        ++ +   +Q +Q  D   SL  +Q  QL+  LSS L T +N     +  T  S +T                                         + V
Subjt:  STKAPGVNQVSQS-DFFSSLNSNQYAQLLNMLSSHLPTAKN-----ETATAASAIT----------------------------------------HAAV

Query:  TLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAK
         LP    + V   G V ++  ++L++VL++P FQ+NL+SVS L    + S++F +++C IQDIS++  IG       LY+L  +  + L + +  T    
Subjt:  TLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAK

Query:  VISYETWHQRLGHLSPKRLSLLKTSLSLPDS-FNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMK
        V + E WH+R+GH S  +LS LK  L++ ++   + C+ C L+KQRRL   S N++++++F+L+H D WGPF + + +GFR+F T+VDD SR+TWVY++K
Subjt:  VISYETWHQRLGHLSPKRLSLLKTSLSLPDS-FNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMK

Query:  QKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINR
         KSD +SI P F ++V TQF   +K+ RSDNAPEL F DFFA  G  H  SCVE PQQNSVVERKHQH+LN                 D I T+ YLINR
Subjt:  QKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINR

Query:  TPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNM
        TP+P+L +K+PF +LHG    YS L+VFGCLCYASTL + R KF PRA+ CVF+GYPPG KGYKL ++   ++ +SRDV+F E++FP+            
Subjt:  TPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNM

Query:  FPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDS
                               P S  D    +S  PS+ ++                                     P I A+  +      HS   
Subjt:  FPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDS

Query:  LNFEQPVIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWT
                 R+ R H+ P  L+DYHC  +  +P S   +T + +   ++YS+LS++HR F+ N+S   EP+ F QAV    WR AMDEE+ A+E  +TW+
Subjt:  LNFEQPVIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWT

Query:  LVPLPK------------------------------------------------EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGY
        +V LP+                                                 K+VTV+  L+L    GW L+Q+D+NNAFL+GDL EEVYM++P G+
Subjt:  LVPLPK------------------------------------------------EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGY

Query:  YRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNA
          + + P+    VCKL+KSIYGLKQASRQWF KFS  LLS GF QS AD SLF R D   F+AL+VYVDDI+I     +  + ++  L   F LKDLGN 
Subjt:  YRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNA

Query:  KYFLGLELSRSSKGM
        KYFLG+E++RS++G+
Subjt:  KYFLGLELSRSSKGM

RVW21404.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.1e-21238.99Show/hide
Query:  IEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSA---
        +E   +P+ LH+       LV+  L GA NY +W RAM+MAL+ KNK GF+ GTI +P +   +  AW   N++I+SWI+N+VS++IA S++Y  SA   
Subjt:  IEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSA---

Query:  --------------------KELITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPSI
                            K+L  L+QG++ + +Y+TKLK +W +L +F+P+P C C GL+ + D+ H EYV+ FLMGLND YA +R QIL+M P+P+I
Subjt:  --------------------KELITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPSI

Query:  TDVFSLLIQEERQRNAG--NSAAIEPIALLASDNSKKNPNSSSTDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPPGYKTRNSSSPVPVEGSNSTKAP
          VFSL+IQEER R  G   S +     +    NS     SS   +TR+    R  C+ CG +GH  D+CYKL GYPPG+K +N         +NS    
Subjt:  TDVFSLLIQEERQRNAG--NSAAIEPIALLASDNSKKNPNSSSTDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPPGYKTRNSSSPVPVEGSNSTKAP

Query:  GVNQVSQSDFFSSLNSNQYAQLLNMLSSHLPTAKNETATAAS-----------------------------------------AITHAAVTLPTAYRVQV
         +N  S     SSL + Q  QL+ +L++ L +  + +   +S                                         A+ +  VTLPT   V +
Subjt:  GVNQVSQSDFFSSLNSNQYAQLLNMLSSHLPTAKNETATAAS-----------------------------------------AITHAAVTLPTAYRVQV

Query:  EFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAKVISYETWHQR
        + +G V +S  + L +VLF+P F+YNL+S                      + SR   IG+          + K ++   + + +           WH R
Subjt:  EFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAKVISYETWHQR

Query:  LGHLSPKRLSLLKTSLSLPDSFN-SPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIP
        LGH S  RL  L++ L    SF+ +PC +CPLAKQR L + S N   S  FDL+H D+WGPF   +  G+++FLT+VDD SR TWVY++K KS+    IP
Subjt:  LGHLSPKRLSLLKTSLSLPDSFN-SPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIP

Query:  RFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKS
         FF  V+ QF K +KA RSDNAPEL   +F+ + G IH  SCVETPQQNSVVERKHQH+LN                 D ILTA YLINRTP+P L+NK+
Subjt:  RFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKS

Query:  PFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCK----STNMFPEFVL
        PF +LH    DYS LRVFGCLCY STL  +R+KF PRA + VFLGYP G KGYKL DI    + +SR+V+F E+ FPFS + + C     S+++F + VL
Subjt:  PFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCK----STNMFPEFVL

Query:  PCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQP
        PC   D                      +D  S+++                        VV  P   +         A  SR T V   SS        
Subjt:  PCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQP

Query:  VIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLP-
                     +LKDYHC+L+      +  +T + +  +LSY +LS++++ F L+VSI SEP  F +A +   WR AMD E+ A+E   TW++V LP 
Subjt:  VIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLP-

Query:  ---------KEKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYR-DRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGF
                   K+VTVKL L++    GW L Q+D+NNAFL+GDL EEVYM +P GY R     P++   VC L+KS+YGLKQASRQWF+KFS  ++  GF
Subjt:  ---------KEKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYR-DRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGF

Query:  TQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKGM
        +QS +++SLF +  D  F+ALLVYVDD++I   +   IA +++ L + F LKDLG+ KYFLGLE+++SS G+
Subjt:  TQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKGM

RVW82526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.6e-21237.93Show/hide
Query:  IEAQLNPFSLHHSFSSTAVLVTQPLLGA-INYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSA--
        +E   +P+ LH+    +  LV+  L G+  NY SW R+M+ AL+ KNK GFI GTI +P     L + W   N+++ SW+ NSV K+IA SI+Y  +A  
Subjt:  IEAQLNPFSLHHSFSSTAVLVTQPLLGA-INYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSA--

Query:  ---------------------KELITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPS
                             ++++  +QG+  +  YYT+LK++W +L +F+ IP C C G++ +++    E VM FL+GLN+S+A ++AQILLM+P P 
Subjt:  ---------------------KELITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPS

Query:  ITDVFSLLIQEERQRNAGNSAAIEPIALLASDNSKKNPNSSSTDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPPGYKTRNSSSPVPVEGSNSTK---
        +  VFSL++QEE QR+   S +      ++S     +  SS T+ +R +++ RP+CT+C + GHT+DRCYK+HGY PG++ R +  P    GS   +   
Subjt:  ITDVFSLLIQEERQRNAGNSAAIEPIALLASDNSKKNPNSSSTDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPPGYKTRNSSSPVPVEGSNSTK---

Query:  -APGVNQVSQSDFFSS------LNSNQYAQLLNMLSSH-----------------------------------------LPTAKNETATAASAITHA---
         +   NQ++ +D   +      L  +Q+ QLL +LS H                                         L +         S++ H+   
Subjt:  -APGVNQVSQSDFFSS------LNSNQYAQLLNMLSSH-----------------------------------------LPTAKNETATAASAITHA---

Query:  ----AVTLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVA
             VTLPT  ++ +  IG + +S  ++L  VL+IP FQ+NLIS+S L QT   S  F+   C IQD S+   IG       LYLL++ V +S+++   
Subjt:  ----AVTLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVA

Query:  ATVKAKVISYETWHQRLGHLSPKRLSLLKTSLSLPDSFNS--PCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRF
                  + WH RL H S  +LS+LK  L L  + N+   C ICPLAKQ+RL F  +N+++S  FDLIHCD+WGPF  PT++GFRYFLT+VDDC+R 
Subjt:  ATVKAKVISYETWHQRLGHLSPKRLSLLKTSLSLPDSFNS--PCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRF

Query:  TWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILT
        TWV+L++ KSD  +I P+FF +V+T+F   IKA RSDNAPEL   + F     +H FSCVETPQQNSVVERKHQH+LN                GD +LT
Subjt:  TWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILT

Query:  ATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFS-TRK
        + YLINR P+PLL+NK+PF +LH     YS L+ FGCLCY+STL + R KF PRA+ CVFLGYP G KGYK+ D+   ++ VSR+V F E  FPF  ++ 
Subjt:  ATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFS-TRK

Query:  DDCKSTNMFPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTP
        ++  +++ F + VLP              V P ST          PS    N  +H                     +PD +   T            +P
Subjt:  DDCKSTNMFPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTP

Query:  VVTHSSDSLNFEQPVIRRSKRQHHPPGFLKDYHCNLLQHSP-ISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIA
          T  + +         RS R   PP +L DYHC+L   +P      +TPY L   +SY++LS + R F +++S  +EP+ + +AV    W+HAM  E+ 
Subjt:  VVTHSSDSLNFEQPVIRRSKRQHHPPGFLKDYHCNLLQHSP-ISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIA

Query:  AMEQTNTWTLVPLPKEKIV---------------TVKLFLSLTVSYGWSLVQMDINNAFL------NGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNK
        A+E  NTW+L  LP  K                 +++ + +  V+ G++  Q +  + FL           +EV+M +P GY+R+R+      +VCKL+K
Subjt:  AMEQTNTWTLVPLPKEKIV---------------TVKLFLSLTVSYGWSLVQMDINNAFL------NGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNK

Query:  SIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKGM
        SIYGL+QASRQWF KFSGVL+S GF QS +DYSLF +     F+ALLVYVDDI++   +      ++  L ++F LKDLGN KYFLGLE++RS+KG+
Subjt:  SIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKGM

XP_010526678.1 PREDICTED: uncharacterized protein LOC104804180 isoform X1 [Tarenaya hassleriana]2.8e-20436.69Show/hide
Query:  LPFLPTSPTM--DKEASNGSSTTIDASSGSIEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWEC
        + F+P++P +     AS GS+++  A   S +   +P  LH + SS+  LV++ L G  NY  WSRA+  AL  KNK GFI GTI +P        +W  
Subjt:  LPFLPTSPTM--DKEASNGSSTTIDASSGSIEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWEC

Query:  NNNVIASWILNSVSKDIAASIVYTGSAKE-----------------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGL---------
         N ++ +W+ NSV  DI   I Y   A E                       + +L QG++++ +Y+TKL  +W++L+ F P+P C C+G          
Subjt:  NNNVIASWILNSVSKDIAASIVYTGSAKE-----------------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGL---------

Query:  -KPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAGNSAAIEPIALLASDNSKKN----------PNSSSTDRTRKKEN
           +        V+ FLM LNDS++A R QIL+  P+P +T  ++L+ QEE+Q+    S   + +A   + NS ++          P S S   +     
Subjt:  -KPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAGNSAAIEPIALLASDNSKKN----------PNSSSTDRTRKKEN

Query:  QRPICTNCGVKGHTIDRCYKLHGYPPGYKTR---NSSSPVPVEGSNSTKAPGVNQVSQSDFFSSLNS--------------NQYAQLLNMLSSHLPTAKN
         RPICT+CG+ GH + RC++LHGYPPG+K+    NS S  P   S S +    NQV+  D  S+ N+              N  AQ+ ++LS  L   K 
Subjt:  QRPICTNCGVKGHTIDRCYKLHGYPPGYKTR---NSSSPVPVEGSNSTKAPGVNQVSQSDFFSSLNS--------------NQYAQLLNMLSSHLPTAKN

Query:  ETATA----------------------------------------ASAITH----------------AAVTLPTAYRVQVEFIGDVRISDQIILRDVLFI
        + + +                                          A TH                 +V+LP    ++V   G V +S  I L  VLFI
Subjt:  ETATA----------------------------------------ASAITH----------------AAVTLPTAYRVQVEFIGDVRISDQIILRDVLFI

Query:  PQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLL----NAKVTQSLNTTVAATVKAKVISYETWHQRLGHLSPKRL-SLLKTS
        P F YNL+SVSCL Q T  S+ F  ++ +IQD++R   IG+    + LY+L     +  T SL +    T+ A   +++ WH RLGH S  R+ S+ K+S
Subjt:  PQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLL----NAKVTQSLNTTVAATVKAKVISYETWHQRLGHLSPKRL-SLLKTS

Query:  ---LSLPDSFNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKV
            S  ++ +  C +CPLAKQRRLSFP ++HV+   F+L+H DVWGP  E + +G R+FL++VDD SR TWVYL+K KSD +   P F   VE QF+  
Subjt:  ---LSLPDSFNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKV

Query:  IKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKSPFTVLHGSEVDYS
        IK  RSDNAPEL F   FA KG +HQFSC  TPQQNS+VERKHQH+LN                GD ILT+ YLINRTP+PLL NK+PF +L G    YS
Subjt:  IKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKSPFTVLHGSEVDYS

Query:  QLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNMFPEFVLPCPIIDAITPDMNVFVE
         LRVFGCLCY STLT  R KF+PRA+S VFLGYP GVKGYK+ D+    V++SR+VVF E +FPF +      + + FP+ V P    ++I+P       
Subjt:  QLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNMFPEFVLPCPIIDAITPDMNVFVE

Query:  PPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQPVIRRSKRQHHPPGFLKD
                 NLS   +    +Q     +FPT   S+                       + + ++  +  VT SS         + R +RQ   P +L D
Subjt:  PPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQPVIRRSKRQHHPPGFLKD

Query:  YHCNLLQH-SPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLPK----------------
        YHC L+ H S    +  TPY L   L+Y  LS ++R F LN++   EP  + QA KF+ WR AM  E+ A+ +TNTW++  LP                 
Subjt:  YHCNLLQH-SPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLPK----------------

Query:  --------------------------------EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYG
                                         K+ TV++ L+L   Y W + QMD++NAFLNGDL EE+YM +P GY   +        VCKL +S+YG
Subjt:  --------------------------------EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYG

Query:  LKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKG
        LKQASRQW  K S VLL+ GF Q ++D SLF + +    VA+LVYVDD+LITG   + I+ ++  L  +F +KDLG   YFLGLE++RS +G
Subjt:  LKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKG

XP_010526680.1 PREDICTED: uncharacterized protein LOC104804180 isoform X2 [Tarenaya hassleriana]4.3e-20537.83Show/hide
Query:  LPFLPTSPTM--DKEASNGSSTTIDASSGSIEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWEC
        + F+P++P +     AS GS+++  A   S +   +P  LH + SS+  LV++ L G  NY  WSRA+  AL  KNK GFI GTI +P        +W  
Subjt:  LPFLPTSPTM--DKEASNGSSTTIDASSGSIEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWEC

Query:  NNNVIASWILNSVSKDIAASIVYTGSAKE-----------------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGL---------
         N ++ +W+ NSV  DI   I Y   A E                       + +L QG++++ +Y+TKL  +W++L+ F P+P C C+G          
Subjt:  NNNVIASWILNSVSKDIAASIVYTGSAKE-----------------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGL---------

Query:  -KPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAGNSAAIEPIALLASDNSKKN----------PNSSSTDRTRKKEN
           +        V+ FLM LNDS++A R QIL+  P+P +T  ++L+ QEE+Q+    S   + +A   + NS ++          P S S   +     
Subjt:  -KPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAGNSAAIEPIALLASDNSKKN----------PNSSSTDRTRKKEN

Query:  QRPICTNCGVKGHTIDRCYKLHGYPPGYKTR---NSSSPVPVEGSNSTKAPGVNQVSQ--SDFFS--------SLNSN--------------QYAQLLNM
         RPICT+CG+ GH + RC++LHGYPPG+K+    NS S  P   S S +    NQV    S   S        SL+SN               Y  L   
Subjt:  QRPICTNCGVKGHTIDRCYKLHGYPPGYKTR---NSSSPVPVEGSNSTKAPGVNQVSQ--SDFFS--------SLNSN--------------QYAQLLNM

Query:  LSS-HLPTAKNETATAASA--------------ITHAAVTLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQ
         S+   PTA     T AS               I   +V+LP    ++V   G V +S  I L  VLFIP F YNL+SVSCL Q T  S+ F  ++ +IQ
Subjt:  LSS-HLPTAKNETATAASA--------------ITHAAVTLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQ

Query:  DISRLTTIGRADCCYGLYLL----NAKVTQSLNTTVAATVKAKVISYETWHQRLGHLSPKRL-SLLKTS---LSLPDSFNSPCYICPLAKQRRLSFPSNN
        D++R   IG+    + LY+L     +  T SL +    T+ A   +++ WH RLGH S  R+ S+ K+S    S  ++ +  C +CPLAKQRRLSFP ++
Subjt:  DISRLTTIGRADCCYGLYLL----NAKVTQSLNTTVAATVKAKVISYETWHQRLGHLSPKRL-SLLKTS---LSLPDSFNSPCYICPLAKQRRLSFPSNN

Query:  HVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVE
        HV+   F+L+H DVWGP  E + +G R+FL++VDD SR TWVYL+K KSD +   P F   VE QF+  IK  RSDNAPEL F   FA KG +HQFSC  
Subjt:  HVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVE

Query:  TPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFL
        TPQQNS+VERKHQH+LN                GD ILT+ YLINRTP+PLL NK+PF +L G    YS LRVFGCLCY STLT  R KF+PRA+S VFL
Subjt:  TPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFL

Query:  GYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNMFPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPT
        GYP GVKGYK+ D+    V++SR+VVF E +FPF +      + + FP+ V P    ++I+P                NLS   +    +Q     +FPT
Subjt:  GYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNMFPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPT

Query:  CSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQPVIRRSKRQHHPPGFLKDYHCNLLQH-SPISKDKNTPYSLDKYLSYSRL
           S+                       + + ++  +  VT SS         + R +RQ   P +L DYHC L+ H S    +  TPY L   L+Y  L
Subjt:  CSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQPVIRRSKRQHHPPGFLKDYHCNLLQH-SPISKDKNTPYSLDKYLSYSRL

Query:  SATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLPK-----------------------------------------------
        S ++R F LN++   EP  + QA KF+ WR AM  E+ A+ +TNTW++  LP                                                
Subjt:  SATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLPK-----------------------------------------------

Query:  -EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLF
          K+ TV++ L+L   Y W + QMD++NAFLNGDL EE+YM +P GY   +        VCKL +S+YGLKQASRQW  K S VLL+ GF Q ++D SLF
Subjt:  -EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLF

Query:  TRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKG
         + +    VA+LVYVDD+LITG   + I+ ++  L  +F +KDLG   YFLGLE++RS +G
Subjt:  TRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKG

TrEMBL top hitse value%identityAlignment
A0A2N9EHN7 Integrase catalytic domain-containing protein3.5e-22138.48Show/hide
Query:  ASNGSSTTIDASSGS----IEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLL-TAWECNNNVIASWIL
        +S+ SST+   SS S    IE   +P+ L++  +    +V  PL G  NY +W R+M  ALS KNK GF+ G+I +P  +  LL + W+  N+++ SWI 
Subjt:  ASNGSSTTIDASSGS----IEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLL-TAWECNNNVIASWIL

Query:  NSVSKDIAASIVYTGSAKE-----------------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCE-----GL-KPFLDHLHSEYVM
        N +SK I A+++Y  +AKE                       + +L Q NM +  Y+T+LK +W +  ++RPIP C C      GL +  +D+ H +YV 
Subjt:  NSVSKDIAASIVYTGSAKE-----------------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCE-----GL-KPFLDHLHSEYVM

Query:  IFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAG-----------NSAAI------EPIALLASDNSKKNPNSSSTDRTR------KKENQ
         FLMGLNDS+A VR QILLM+P+P+I  VFSL+  +E+QR AG            S A+       P   L+  N+  N   + TD ++      +K+  
Subjt:  IFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAG-----------NSAAI------EPIALLASDNSKKNPNSSSTDRTR------KKENQ

Query:  RPICTNCGVKGHTIDRCYKLHGYPPGYKTRN---------SSSPVP-VEGSNSTKA-PGV---------------NQVSQSDFFSSLNSNQYAQLLNMLS
          IC++CG KGHT D+CYKLHGYPPG++++          SSS VP  E +N+ ++ P +                Q  Q++  S  +++Q A  ++ +S
Subjt:  RPICTNCGVKGHTIDRCYKLHGYPPGYKTRN---------SSSPVP-VEGSNSTKA-PGV---------------NQVSQSDFFSSLNSNQYAQLLNMLS

Query:  SHLP----------------------------TAK---------------------NETATAASAITHAAVTLPTAYRVQVEFIGDVRISDQIILRDVLF
           P                            T K                      +  T    + + +V LP    V V  IG V+I+  ++L DVL 
Subjt:  SHLP----------------------------TAK---------------------NETATAASAITHAAVTLPTAYRVQVEFIGDVRISDQIILRDVLF

Query:  IPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAA---------------TVKAKVISYETWHQRLGHL
        +P F +NLISVS L  +    + F +  C IQD+     IG      GLYLL+   + S NT  AA               ++K        WH R GH 
Subjt:  IPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAA---------------TVKAKVISYETWHQRLGHL

Query:  SPKRLSLLKT---SLSLPDSFNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRF
        S  R+  L +   ++SL     S C +CPLAKQ+RL FP+ NH++   FDL+H D+WGP+  PT  G+RYFLTLVDDC+R TW+YLM+ KSD   ++  F
Subjt:  SPKRLSLLKT---SLSLPDSFNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRF

Query:  FQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKSPF
          +++TQF  +IK  RSDN  E    +F+A+KG IHQ SCVETPQQNSVVERKHQH+LN                G  I TA YLINR P P+L NKSPF
Subjt:  FQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKSPF

Query:  TVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNMF----------P
          L      Y+ L+VFGCLC+ASTL++HR+KFDPRA SCVFLGYP GVKGYKL D+   +V +SRDVVF E  FPF T+      T             P
Subjt:  TVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNMF----------P

Query:  EFVLPCPII-DAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPD-DNILITVQPDINAEQSRQTPVVTHSSDS
         F+  C II D I P   +   PPS           P   +S  P    D          H D  +  SP  D+I +      +  QS  +P        
Subjt:  EFVLPCPII-DAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPD-DNILITVQPDINAEQSRQTPVVTHSSDS

Query:  LNFEQPVIRRSKRQHHPPGFLKDYHCNLLQ-----HSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQ
               +RRS R H PP +L+DYHC L        SP      TPY L   LSY  LS THR F L+V+  SEPS FHQA +   W+ AM  E+AA+E 
Subjt:  LNFEQPVIRRSKRQHHPPGFLKDYHCNLLQ-----HSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQ

Query:  TNTWTLVPLPK------------------------------------------------EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMS
         NTWTL PLP                                                  K  TV+  L++  +  WSL Q+D+NNAFL+GDL EEVYM+
Subjt:  TNTWTLVPLPK------------------------------------------------EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMS

Query:  IPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLK
        +PLG+    K  T   LVCKLNKS+YGLKQASRQWF KFS  ++  GF QS +DYSLFTR   + F+ALLVYVDDILI       +  ++  L   F LK
Subjt:  IPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLK

Query:  DLGNAKYFLGLELSRSSKGM
        DLGN K+FLGLE++RS+KG+
Subjt:  DLGNAKYFLGLELSRSSKGM

A0A2N9G1Y1 Integrase catalytic domain-containing protein2.3e-22038.41Show/hide
Query:  ASNGSSTTIDASSGS----IEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLL-TAWECNNNVIASWIL
        +S+ SST+   SS S    IE   +P+ L++  +    +V  PL G  NY +W R+M  ALS KNK GF+ G+I +P  +  LL + W+  N+++ SWI 
Subjt:  ASNGSSTTIDASSGS----IEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLL-TAWECNNNVIASWIL

Query:  NSVSKDIAASIVYTGSAKE-----------------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCE-----GL-KPFLDHLHSEYVM
        N +SK I A+++Y  +AKE                       + +L Q NM +  Y+T+LK +W +  ++RPIP C C      GL +  +D+ H +YV 
Subjt:  NSVSKDIAASIVYTGSAKE-----------------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCE-----GL-KPFLDHLHSEYVM

Query:  IFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAG-----------NSAAI------EPIALLASDNSKKNPNSSSTDRTR------KKENQ
         FLMGLNDS+A VR QILLM+P+P+I  VFSL+  +E+QR AG            S A+       P   L+  N+  N   + TD ++      +K+  
Subjt:  IFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAG-----------NSAAI------EPIALLASDNSKKNPNSSSTDRTR------KKENQ

Query:  RPICTNCGVKGHTIDRCYKLHGYPPGYKTRN---------SSSPVP-VEGSNSTKA-PGV---------------NQVSQSDFFSSLNSNQYAQLLNMLS
          IC++CG KGHT D+CYKLHGYPPG++++          SSS VP  E +N+ ++ P +                Q  Q++  S  +++Q A  ++ +S
Subjt:  RPICTNCGVKGHTIDRCYKLHGYPPGYKTRN---------SSSPVP-VEGSNSTKA-PGV---------------NQVSQSDFFSSLNSNQYAQLLNMLS

Query:  SHLP----------------------------TAK---------------------NETATAASAITHAAVTLPTAYRVQVEFIGDVRISDQIILRDVLF
           P                            T K                      +  T    + + +V LP    V V  IG V+I+  ++L DVL 
Subjt:  SHLP----------------------------TAK---------------------NETATAASAITHAAVTLPTAYRVQVEFIGDVRISDQIILRDVLF

Query:  IPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAA---------------TVKAKVISYETWHQRLGHL
        +P F +NLISVS L  +    + F +  C IQD+     IG      GLYLL+   + S NT  AA               ++K        WH R GH 
Subjt:  IPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAA---------------TVKAKVISYETWHQRLGHL

Query:  SPKRLSLLKT---SLSLPDSFNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRF
        S  R+  L +   ++SL     S C +CPLAKQ+RL FP+ NH++   FDL+H D+WGP+  PT  G+RYFLTLVDDC+R TW+YLM+ KSD   ++  F
Subjt:  SPKRLSLLKT---SLSLPDSFNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRF

Query:  FQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKSPF
          +++TQF  +IK  RSDN  E    +F+A+KG IHQ SCVETPQQNSVVERKHQH+LN                G  I TA YLINR P P+L NKSPF
Subjt:  FQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKSPF

Query:  TVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNMF----------P
          L      Y+ L+VFGCLC+ASTL++HR+KFDPRA SCVFLGYP GVKGYKL D+   +V +SRDVVF E  FPF T+      T             P
Subjt:  TVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNMF----------P

Query:  EFVLPCPII-DAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPD-DNILITVQPDINAEQSRQTPVVTHSSDS
         F+  C II D I P   +   PPS           P   +S  P    D          H D  +  SP  D+I +      +  QS  +P        
Subjt:  EFVLPCPII-DAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPD-DNILITVQPDINAEQSRQTPVVTHSSDS

Query:  LNFEQPVIRRSKRQHHPPGFLKDYHCNLLQ-----HSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQ
               +RRS R H P  +L+DYHC L        SP +    TPY L   LSY  LS THR F L+V+  SEPS FHQA +   W+ AM  E+AA+E 
Subjt:  LNFEQPVIRRSKRQHHPPGFLKDYHCNLLQ-----HSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQ

Query:  TNTWTLVPLPK------------------------------------------------EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMS
         NTWTL PLP                                                  K  TV+  L++  +  WSL Q+D+NNAFL+GDL EEVYM+
Subjt:  TNTWTLVPLPK------------------------------------------------EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMS

Query:  IPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLK
        +PLG+    K  T   LVCKLNKS+YGLKQASRQWF KFS  ++  GF QS +DYSLFTR   + F+ALLVYVDDILI       +  ++  L   F LK
Subjt:  IPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLK

Query:  DLGNAKYFLGLELSRSSKGM
        DLGN K+FLGLE++RS+KG+
Subjt:  DLGNAKYFLGLELSRSSKGM

A0A2N9GZW3 Integrase catalytic domain-containing protein4.2e-23040.11Show/hide
Query:  FSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGS-LLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKE-------
        + LHH  S  A+LV+Q L+G  NY +WSR+M+MAL+ KNK GF+ G I++P+ + S    AW   N ++ SW+LNS+SK+IA+S++Y  +AKE       
Subjt:  FSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGS-LLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKE-------

Query:  ----------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSL
                        +  LSQ N S+ +YYT+LK++W +L +FRPIP+C C  +K  LD+   EYVM FLMGLNDS++ VRAQIL+  P+PSIT  F+L
Subjt:  ----------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSL

Query:  LIQEERQRNAGNSAAIEPIALLASDNSKKNPNSSSTDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPPGYK-------TRNSSSPVPVEGSNSTKAPG
        +IQEERQRN  N  ++ P A   +  ++      +  + +  +  RPIC++CG+ GHT+D+CYKLHGYPPGYK          SS+ V       T+A  
Subjt:  LIQEERQRNAGNSAAIEPIALLASDNSKKNPNSSSTDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPPGYK-------TRNSSSPVPVEGSNSTKAPG

Query:  ---VNQVSQSDFFSSLNSNQYAQLLNMLSSHLPTAKNETATAASAITH----------------------------------------------------
           ++ +S     +SL S+Q+     ++S       +    AASAI+H                                                    
Subjt:  ---VNQVSQSDFFSSLNSNQYAQLLNMLSSHLPTAKNETATAASAITH----------------------------------------------------

Query:  ------AAVTLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLL----NAKVTQ
                + LP   +V    IG V+++  ++L DVL +P F +NLIS+S L  T    + F +  C IQD+     IG      GLY L    +A  + 
Subjt:  ------AAVTLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLL----NAKVTQ

Query:  SLNTTVAATVKAKVISYETWHQRLGHLSPKRLSLLKTSLS--LPDSFNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTL
        S     A T       ++ WH RLGH S  RLSLLK  +S  +  S N  C +C ++KQ+RL F +  H A   FDLIHCD+WGP+  PT +  RYFLT+
Subjt:  SLNTTVAATVKAKVISYETWHQRLGHLSPKRLSLLKTSLS--LPDSFNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTL

Query:  VDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN---------------
        VDDC+R TWV+LMKQKS+   +I  FF L++TQFS  IK  RSDN PE K   F+A  GT+HQ SCV TPQQN+ VERKHQHLL                
Subjt:  VDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN---------------

Query:  -GDFILTATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSF
         G  +LTAT+LINR PTPLL NKSPF +L     +YS LRVFGCLCYA+TL+ +R KF PR+  CV LGYP G+KGY+L D+   QV VSRDV+F+E+SF
Subjt:  -GDFILTATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSF

Query:  PFSTRKDDCKSTNMFPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSP-DDNILITVQPDINA
        PF T +    + +M    VLP PI D           P S      + +   S+ + N P H       S  +  H  T+   SP   N  +   PDI +
Subjt:  PFSTRKDDCKSTNMFPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSP-DDNILITVQPDINA

Query:  EQSRQTPVVTHSSDSLNFEQPVIRRSKRQHHPPGFLKDYHCNLLQ----HSPISKDKN-----TPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQA
        +Q         ++   N     +R+S R H PP +L+ +HCN       HSP S   N     T + L  Y+SYS+L+  +  F+L+ S   EP+ FH+A
Subjt:  EQSRQTPVVTHSSDSLNFEQPVIRRSKRQHHPPGFLKDYHCNLLQ----HSPISKDKN-----TPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQA

Query:  VKFDVWRHAMDEEIAAMEQTNTWTLVPLPKEKI------------------------------------------------VTVKLFLSLTVSYGWSLVQ
         K   W  AM  E+AA+E  +TW+L PLP  K+                                                VTV+  L++    GW+L Q
Subjt:  VKFDVWRHAMDEEIAAMEQTNTWTLVPLPKEKI------------------------------------------------VTVKLFLSLTVSYGWSLVQ

Query:  MDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGP
        +D+NNAFL+G+L EEVYM++P G +  +  P++I  VCKL KS+YGLKQASRQWF+KFS  LL++GF QSKADYSLFTR D  SF+ALLVYVDDILI   
Subjt:  MDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGP

Query:  STSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKGM
            +  ++  L   F LKDLG  +YFLGLE++RSS+G+
Subjt:  STSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKGM

A0A2N9HYD2 Integrase catalytic domain-containing protein2.6e-22438.06Show/hide
Query:  TSPTMDKEASNGSSTTIDASSGSIEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQG-SLLTAWECNNNVIA
        + PTM   A++ S+     SS   +   N F LHH  S   +LV+QPL G  NY +WSR+M+MAL+ KNK GFI GTI  P+ Q       W   N ++ 
Subjt:  TSPTMDKEASNGSSTTIDASSGSIEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQG-SLLTAWECNNNVIA

Query:  SWILNSVSKDIAASIVYTGSAKE-----------------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIF
        SW+LNSVSK+IA+S++Y  +A+E                       + +L+Q   ++ AY+TKLK++W +L ++R  P C C  LK  +D+   E VM F
Subjt:  SWILNSVSKDIAASIVYTGSAKE-----------------------LITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIF

Query:  LMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAGNSAAIEPIALLASDNSKKNPNSSSTDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPP
        LMGLNDS+A VRAQIL+M+P+P+I   FSL++QEERQR+ G +A       +A     + P ++   R+   + +RP+C++CG+ GH +D+CYKLHG+PP
Subjt:  LMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAGNSAAIEPIALLASDNSKKNPNSSSTDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPP

Query:  GYKTRN----------------------------------------SSSPVPVEGSN----STKAPGVNQVSQSDFFSSLNSNQYAQLLNMLSSHLPTA-
        G+K RN                                        +SS VP+  +     S         S S+  SS N +Q A + +   S +  A 
Subjt:  GYKTRN----------------------------------------SSSPVPVEGSN----STKAPGVNQVSQSDFFSSLNSNQYAQLLNMLSSHLPTA-

Query:  ----------------------KNE----------------TATAASAITHAAVTLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQT
                              +NE                + T+ ++  H+ + LP   +V    IG V+IS  + L +VL +P F +NLIS++ L  +
Subjt:  ----------------------KNE----------------TATAASAITHAAVTLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQT

Query:  TDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQ--SLNTTVAATVKAK----VISYETWHQRLGHLSPKRLSLLKTSLS--LPDSFNSPCYI
            + FS+  C IQD+     IG A   +GLY+L     Q  SL     A +  K    V  +  WH RLGH S  RL+LL   +S     S +  C +
Subjt:  TDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQ--SLNTTVAATVKAK----VISYETWHQRLGHLSPKRLSLLKTSLS--LPDSFNSPCYI

Query:  CPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVD
        C L+K RRL FP++ H++   FDLIHCD+WGPF  PT N  +YFLT+VDDC+R TW++LMK KS+  +++  FF LV+TQFS  IK  RSDN  E    +
Subjt:  CPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVD

Query:  FFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTT
        F+A  GTIHQ SC+ TPQQNS VERKHQHLL                 G  +LTA YLINR P+PLL+NKSP+ +L  +   YS LRVFG LCYA+TL+ 
Subjt:  FFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTT

Query:  HRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRK--------------DDCKSTNMFPEFVLPCPIIDAITP--DMNVFVE
        +R KF PR+  C+ LGYP G KGY+L D+   QV VSRDV+F E  FPF   K                   +  FP  V+P    D+  P  D     E
Subjt:  HRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRK--------------DDCKSTNMFPEFVLPCPIIDAITP--DMNVFVE

Query:  PPSTGDRLNNLS---DDPSAMV---SNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQPVIRRSKRQHHP
        PP T     + +    D SA     ++   H   FP+   S + H  T    SP   IL         +QS   PV            P +R+S R H  
Subjt:  PPSTGDRLNNLS---DDPSAMV---SNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQPVIRRSKRQHHP

Query:  PGFLKDYHCNLLQHSP------ISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLPKEKI--
        P +L++YHC+     P      +    +T + L + LSYS LS  ++ F+LN S   EPS +++A K   W  AM  EI A+E   TW+L  LP  KI  
Subjt:  PGFLKDYHCNLLQHSP------ISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLPKEKI--

Query:  ----------------------------------------------VTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYR--DRKFPTD
                                                      VTV+  L++    GW L Q+D+NNAFL+G L EEVYMS+P G++   +  +P+D
Subjt:  ----------------------------------------------VTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYR--DRKFPTD

Query:  IPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELS
           VCKL KSIYGLKQASRQWF+KFS  LL++GFTQSKADYSLFT+     F+ALLVYVDDILI   + + +  ++  L   F LKDLG  +YFLGLE++
Subjt:  IPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELS

Query:  RSSKGM
        RSSKG+
Subjt:  RSSKGM

A0A2Z7AT15 Cysteine-rich RLK (Receptor-like protein kinase) 84.3e-21937.53Show/hide
Query:  SIEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKE
        ++E   +P+ LH+       LV+ PL+G+ NY++W RAM++AL+ KNK GFI  +I +P ++  L  +W   N+++ SWILNSV+++IA S++Y  +A+E
Subjt:  SIEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKE

Query:  LIT-----------------------LSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPS
        + T                       L QG+M + +YYTKL+T+W +L D++P   C C  ++ + ++ + E VM FLMGLNDSYA VRAQ+L+++P+P+
Subjt:  LIT-----------------------LSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPS

Query:  ITDVFSLLIQEERQRNAG---NSAAIEPIALLASDNSKKNPNSS--STDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPPGY-KTRNSSSPVPVEGSN
        I  VF+L+IQEERQR+     + A ++   +L++ NS  N  +S  ++  ++     R IC++C  + HT+D+CYKLHGYPPG+ K ++  S        
Subjt:  ITDVFSLLIQEERQRNAG---NSAAIEPIALLASDNSKKNPNSS--STDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPPGY-KTRNSSSPVPVEGSN

Query:  STKAPGVNQVSQS-DFFSSLNSNQYAQLLNMLSSHLPTAKN-----ETATAASAIT----------------------------------------HAAV
        ++ +   +Q +Q  D   SL  +Q  QL+  LSS L T +N     +  T  S +T                                         + V
Subjt:  STKAPGVNQVSQS-DFFSSLNSNQYAQLLNMLSSHLPTAKN-----ETATAASAIT----------------------------------------HAAV

Query:  TLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAK
         LP    + V   G V ++  ++L++VL++P FQ+NL+SVS L    + S++F +++C IQDIS++  IG       LY+L  +  + L + +  T    
Subjt:  TLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAK

Query:  VISYETWHQRLGHLSPKRLSLLKTSLSLPDS-FNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMK
        V + E WH+R+GH S  +LS LK  L++ ++   + C+ C L+KQRRL   S N++++++F+L+H D WGPF + + +GFR+F T+VDD SR+TWVY++K
Subjt:  VISYETWHQRLGHLSPKRLSLLKTSLSLPDS-FNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMK

Query:  QKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINR
         KSD +SI P F ++V TQF   +K+ RSDNAPEL F DFFA  G  H  SCVE PQQNSVVERKHQH+LN                 D I T+ YLINR
Subjt:  QKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINR

Query:  TPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNM
        TP+P+L +K+PF +LHG    YS L+VFGCLCYASTL + R KF PRA+ CVF+GYPPG KGYKL ++   ++ +SRDV+F E++FP+            
Subjt:  TPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNM

Query:  FPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDS
                               P S  D    +S  PS+ ++                                     P I A+  +      HS   
Subjt:  FPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDS

Query:  LNFEQPVIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWT
                 R+ R H+ P  L+DYHC  +  +P S   +T + +   ++YS+LS++HR F+ N+S   EP+ F QAV    WR AMDEE+ A+E  +TW+
Subjt:  LNFEQPVIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWT

Query:  LVPLPK------------------------------------------------EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGY
        +V LP+                                                 K+VTV+  L+L    GW L+Q+D+NNAFL+GDL EEVYM++P G+
Subjt:  LVPLPK------------------------------------------------EKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGY

Query:  YRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNA
          + + P+    VCKL+KSIYGLKQASRQWF KFS  LLS GF QS AD SLF R D   F+AL+VYVDDI+I     +  + ++  L   F LKDLGN 
Subjt:  YRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNA

Query:  KYFLGLELSRSSKGM
        KYFLG+E++RS++G+
Subjt:  KYFLGLELSRSSKGM

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.7e-5124.16Show/hide
Query:  LLIQEERQRNAGNSAAIEPIALLASDNSKKNPNSSSTDRTRK-------KENQRPICTNCGVKGHTIDRCYKLHGYPPGYKTRNSSSPVPVEGSNSTKAP
        LL QE + +N  N  + + +  +  +N+    N+   +R  K           +  C +CG +GH    C+    Y      +N  +   V+ + S    
Subjt:  LLIQEERQRNAGNSAAIEPIALLASDNSKKNPNSSSTDRTRK-------KENQRPICTNCGVKGHTIDRCYKLHGYPPGYKTRNSSSPVPVEGSNSTKAP

Query:  GVNQVSQSDFFSSLNSNQYAQLLNMLSSHLPTAKNETATAASAITHAAVTLPTAYRVQVEFI-----GDVRISD--QIILRDVLFIPQFQYNLISVSCLL
        G+  + +    +S+  N    L +  S HL   ++    +   +      L  A   Q EFI     G VR+ +  +I L DVLF  +   NL+SV   L
Subjt:  GVNQVSQSDFFSSLNSNQYAQLLNMLSSHLPTAKNETATAASAITHAAVTLPTAYRVQVEFI-----GDVRISD--QIILRDVLFIPQFQYNLISVSCLL

Query:  QTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAK-VISYETWHQRLGHLSPKRLSLLK-----TSLSLPDSFNSPCYI
        Q   +S+ F      I   + L  +  +     + ++N +         A ++ AK   ++  WH+R GH+S  +L  +K     +  SL ++    C I
Subjt:  QTTDVSLTFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAK-VISYETWHQRLGHLSPKRLSLLK-----TSLSLPDSFNSPCYI

Query:  CP---LAKQRRLSF---PSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAP
        C      KQ RL F       H+   +F ++H DV GP    T +   YF+  VD  + +   YL+K KSD  S+   F    E  F+  +     DN  
Subjt:  CP---LAKQRRLSF---PSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAP

Query:  EL---KFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLL--DNKSPFTVLHGSEVDYSQLRVF
        E    +   F   KG  +  +   TPQ N V ER  + +                  G+ +LTATYLINR P+  L   +K+P+ + H  +     LRVF
Subjt:  EL---KFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLN----------------GDFILTATYLINRTPTPLL--DNKSPFTVLHGSEVDYSQLRVF

Query:  GCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVF------------FEDSFPFSTRKDDCKSTNMFPEFVLPCPIIDAITP
        G   Y   +   + KFD ++   +F+GY P   G+KL+D    + IV+RDVV             FE  F   +++ + K+       ++     +    
Subjt:  GCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVF------------FEDSFPFSTRKDDCKSTNMFPEFVLPCPIIDAITP

Query:  DMNVFVEPPSTGDRLNNLSDDPSAMV-------SNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQP---
          N+     S      N  +D   ++       S +  +I       +SN    + +  +  DD+ L   +   N  +SR++    H  + +  + P   
Subjt:  DMNVFVEPPSTGDRLNNLSDDPSAMV-------SNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQP---

Query:  -----VIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVS--IDSEPSYFHQAVKFD---VWRHAMDEEIAAMEQT
             + RRS+R    P                             +SY+    +  + +LN     +  P+ F +    D    W  A++ E+ A +  
Subjt:  -----VIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVS--IDSEPSYFHQAVKFD---VWRHAMDEEIAAMEQT

Query:  NTWTLVPLPKEK------------------------------------------------IVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSI
        NTWT+   P+ K                                                I + +  LSL + Y   + QMD+  AFLNG L EE+YM +
Subjt:  NTWTLVPLPKEK------------------------------------------------IVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSI

Query:  PLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLF--TRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLL
        P G         +   VCKLNK+IYGLKQA+R WF  F   L    F  S  D  ++   +G+    + +L+YVDD++I     + +   +  L E F +
Subjt:  PLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLF--TRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLL

Query:  KDLGNAKYFLGLEL
         DL   K+F+G+ +
Subjt:  KDLGNAKYFLGLEL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-7826Show/hide
Query:  GAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKELITLSQGNMSIEAYYTKLKTIWQDLED
        G   +S+W R M   L  +     +    KKP+T  +    W   +   AS I   +S D+  +I+   +A+ + T       +E+ Y   KT+   L  
Subjt:  GAINYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKELITLSQGNMSIEAYYTKLKTIWQDLED

Query:  FRPIPECKCEGLKPFLDHLH------------------SEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQR----NAGNSAAIEPIAL
         + +          FL HL+                   +  ++ L  L  SY  +   IL  K    + DV S L+  E+ R    N G +   E    
Subjt:  FRPIPECKCEGLKPFLDHLH------------------SEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQR----NAGNSAAIEPIAL

Query:  LASDNSKKNPNSSSTDRTRKKENQR-PICTNCGVKGHTIDRC---YKLHGYPPGYKTRNSSSPVPVEGSNSTKAPGVNQVSQSDFFSSLNSNQYAQLLNM
            +S     S +  +++ +   R   C NC   GH    C    K  G   G K  ++++ +     N      V  +++ +    L+  +   +++ 
Subjt:  LASDNSKKNPNSSSTDRTRKKENQR-PICTNCGVKGHTIDRC---YKLHGYPPGYKTRNSSSPVPVEGSNSTKAPGVNQVSQSDFFSSLNSNQYAQLLNM

Query:  LSSHLPTAKNETATAASAITHAAVTLPTAYRVQVEFIGDVRISDQI----ILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRA
         +SH  T   +      A     V +      ++  IGD+ I   +    +L+DV  +P  + NLIS    L        F+ +   +   S +   G A
Subjt:  LSSHLPTAKNETATAASAITHAAVTLPTAYRVQVEFIGDVRISDQI----ILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLIQDISRLTTIGRA

Query:  DCCYGLYLLNAKVTQSLNTTVAATVKAKVISYETWHQRLGHLSPKRLSLL--KTSLSLPDSFN-SPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWG
             LY  NA++ Q              IS + WH+R+GH+S K L +L  K+ +S        PC  C   KQ R+SF +++     + DL++ DV G
Subjt:  DCCYGLYLLNAKVTQSLNTTVAATVKAKVISYETWHQRLGHLSPKRLSLL--KTSLSLPDSFN-SPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWG

Query:  PFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPEL---KFVDFFATKGTIHQFSCVETPQQNSVVERKHQ
        P +  +  G +YF+T +DD SR  WVY++K K     +  +F  LVE +  + +K  RSDN  E    +F ++ ++ G  H+ +   TPQ N V ER ++
Subjt:  PFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPEL---KFVDFFATKGTIHQFSCVETPQQNSVVERKHQ

Query:  HLLN----------------GDFILTATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFD
         ++                 G+ + TA YLINR+P+  L  + P  V    EV YS L+VFGC  +A      R+K D +++ C+F+GY     GY+L+D
Subjt:  HLLN----------------GDFILTATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFD

Query:  IAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNMFPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNV
          K +VI SRDVVF E     +    +     + P FV        I    N      ST D ++   + P  ++       G+         EH     
Subjt:  IAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNMFPEFVLPCPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNV

Query:  VQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQPVIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSID
         +     +  + +P + + +   T  V  S D     +P   +    H      K+     +Q    S  KN  Y L +     R      +++  +  D
Subjt:  VQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQPVIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSID

Query:  SEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLPKEKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCK
         +        +  V      + I   E       +  P  K+ +++  LSL  S    + Q+D+  AFL+GDL EE+YM  P G+    +      +VCK
Subjt:  SEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLPKEKIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCK

Query:  LNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSL-FTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSR
        LNKS+YGLKQA RQW+ KF   + S  + ++ +D  + F R  + +F+ LL+YVDD+LI G     IA ++  L ++F +KDLG A+  LG+++ R
Subjt:  LNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSL-FTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSR

P25600 Putative transposon Ty5-1 protein YCL074W4.3e-1433.33Show/hide
Query:  MDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGP
        MD++ AFLN  + E +Y+  P G+  +R    +   V +L   +YGLKQA   W    +  L   GF + + ++ L+ R      + + VYVDD+L+  P
Subjt:  MDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGP

Query:  STSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKG
        S      V+  L + + +KDLG    FLGL + +SS G
Subjt:  STSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.6e-9026.75Show/hide
Query:  LGAINYSSWSRAMMMALSGKNKSGFIKG-TIKKPETQGS--------LLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKELITLSQGNMSIEAY--Y
        L + NY  WSR +     G   +GF+ G T   P T G+          T W+  + +I S +L ++S  +  ++    +A ++    +   +  +Y   
Subjt:  LGAINYSSWSRAMMMALSGKNKSGFIKG-TIKKPETQGS--------LLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKELITLSQGNMSIEAY--Y

Query:  TKLKTIWQDLEDFRPIPECKCEGLKPFLDHL--------HSEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAGNSAAIEPIALLA
        T+L+T  +         +   +GL    D L        H E V   L  L + Y  V  QI      P++T++   L+  E +  A +SA + PI   A
Subjt:  TKLKTIWQDLEDFRPIPECKCEGLKPFLDHL--------HSEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAGNSAAIEPIALLA

Query:  --------------------SDNSKKNPNS-----SSTDRTRKKENQRPI---CTNCGVKGHTIDRCYKLHGYPPGYKTRNSSSPVPVEGSNSTKAPGVN
                             DN   N NS     SST+        +P    C  CGV+GH+  RC +L  +     ++   SP       +  A G  
Subjt:  --------------------SDNSKKNPNS-----SSTDRTRKKENQRPI---CTNCGVKGHTIDRCYKLHGYPPGYKTRNSSSPVPVEGSNSTKAPGVN

Query:  QVSQSDFFSSLNSNQYAQLLNMLSSHLPTAKNETATAASAITHAAVTLPTAYRVQVEFIGDVRISDQ---IILRDVLFIPQFQYNLISVSCLLQTTDVSL
          S +    S  ++      N LS H P    +    A   T           + +   G   +S +   + L ++L++P    NLISV  L     VS+
Subjt:  QVSQSDFFSSLNSNQYAQLLNMLSSHLPTAKNETATAASAITHAAVTLPTAYRVQVEFIGDVRISDQ---IILRDVLFIPQFQYNLISVSCLLQTTDVSL

Query:  TFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAKVISYETWHQRLGHLSPKRLSLLKTSLSL----PDSFNSPCYICPLAKQRRL
         F   +  ++D++    + +      LY      +Q ++   + + KA   ++ +WH RLGH +P  L+ + ++ SL    P      C  C + K  ++
Subjt:  TFSTENCLIQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAKVISYETWHQRLGHLSPKRLSLLKTSLSL----PDSFNSPCYICPLAKQRRL

Query:  SFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPE-LKFVDFFATKGTI
         F  +   +++  + I+ DVW      +++ +RY++  VD  +R+TW+Y +KQKS        F  L+E +F   I  F SDN  E +   ++F+  G  
Subjt:  SFPSNNHVASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPE-LKFVDFFATKGTI

Query:  HQFSCVETPQQNSVVERKHQHLLNGDFIL----------------TATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPR
        H  S   TP+ N + ERKH+H++     L                 A YLINR PTPLL  +SPF  L G+  +Y +LRVFGC CY      ++ K D +
Subjt:  HQFSCVETPQQNSVVERKHQHLLNGDFIL----------------TATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPR

Query:  AVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFST------------RKDDCKSTNMFPEFVLP--CPIIDAIT-PDMNVFVEPPSTGD---
        +  CVFLGY      Y    +   ++ +SR V F E+ FPFS             R+  C  +   P   LP   P++ A +  D +    PPS+     
Subjt:  AVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFST------------RKDDCKSTNMFPEFVLP--CPIIDAIT-PDMNVFVEPPSTGD---

Query:  -----RLNNL-----SDDPSAMVSNQPAHIGDFPTCSDSNDE---HCDTNVVQSPDDN-----ILITVQPDINAEQSRQTPVVTHSSDSLNFEQPVIRRS
               +NL     S  PS+     P   G  PT   +  +   H   N  Q+   N     +  ++     +  S  +P  + SS S +   P I   
Subjt:  -----RLNNL-----SDDPSAMVSNQPAHIGDFPTCSDSNDE---HCDTNVVQSPDDN-----ILITVQPDINAEQSRQTPVVTHSSDSLNFEQPVIRRS

Query:  KRQHHPPGFLKDYHCN----LLQHSPISKDK------NTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTL
           H PP   +  + N    L  HS  ++ K      N  YSL                 ++++ +SEP    QA+K + WR+AM  EI A    +TW L
Subjt:  KRQHHPPGFLKDYHCN----LLQHSPISKDK------NTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTL

Query:  VPLPKE-------------------------------------------------KIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGY
        VP P                                                   K  ++++ L + V   W + Q+D+NNAFL G L ++VYMS P G+
Subjt:  VPLPKE-------------------------------------------------KIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGY

Query:  YRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNA
          D+  P     VCKL K++YGLKQA R W+ +    LL+ GF  S +D SLF      S V +LVYVDDILITG   + +      L + F +KD    
Subjt:  YRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNA

Query:  KYFLGLELSRSSKGM
         YFLG+E  R   G+
Subjt:  KYFLGLELSRSSKGM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.9e-9227.35Show/hide
Query:  LGAINYSSWSRAMMMALSGKNKSGFIKG-TIKKPETQGS--------LLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKELITLSQGNMSIEAY--Y
        L + NY  WSR +     G   +GF+ G T   P T G+          T W   + +I S IL ++S  +  ++    +A ++    +   +  +Y   
Subjt:  LGAINYSSWSRAMMMALSGKNKSGFIKG-TIKKPETQGS--------LLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKELITLSQGNMSIEAY--Y

Query:  TKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAGNSAAIEPIALLA----SDNS
        T+L+ I +         +    G KP +D  H E V   L  L D Y  V  QI      PS+T++   LI  E +  A NSA + PI        + N+
Subjt:  TKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAVRAQILLMKPIPSITDVFSLLIQEERQRNAGNSAAIEPIALLA----SDNS

Query:  KKNPNSSSTDRTRKKENQRP----------------------ICTNCGVKGHTIDRCYKLHGYPPGYKTRNSSSPVPVEGSNSTKAPGVNQVSQSDFFSS
         +N N+   +R     N R                        C  C V+GH+  RC +LH +      + S+SP       +  A      + +    S
Subjt:  KKNPNSSSTDRTRKKENQRP----------------------ICTNCGVKGHTIDRCYKLHGYPPGYKTRNSSSPVPVEGSNSTKAPGVNQVSQSDFFSS

Query:  LNSNQYAQLLNMLSSHLPTAKNETATAASA----ITH-AAVTLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCL
          ++      N LS H P    +    A      ITH  + +LPT+ R              + L  VL++P    NLISV  L  T  VS+ F   +  
Subjt:  LNSNQYAQLLNMLSSHLPTAKNETATAASA----ITH-AAVTLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCL

Query:  IQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAKVISYETWHQRLGHLSPKRLSLLKTSLSLPDSFNS----PCYICPLAKQRRLSFPSNNHV
        ++D++    + +      LY      +Q+++   +   KA   ++ +WH RLGH S   L+ + ++ SLP    S     C  C + K  ++ F ++   
Subjt:  IQDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAKVISYETWHQRLGHLSPKRLSLLKTSLSLPDSFNS----PCYICPLAKQRRLSFPSNNHV

Query:  ASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFV-DFFATKGTIHQFSCVET
        +S+  + I+ DVW      + + +RY++  VD  +R+TW+Y +KQKS        F  LVE +F   I    SDN  E   + D+ +  G  H  S   T
Subjt:  ASQVFDLIHCDVWGPFKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFV-DFFATKGTIHQFSCVET

Query:  PQQNSVVERKHQHLLNGDFIL----------------TATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLG
        P+ N + ERKH+H++     L                 A YLINR PTPLL  +SPF  L G   +Y +L+VFGC CY      +R K + ++  C F+G
Subjt:  PQQNSVVERKHQHLLNGDFIL----------------TATYLINRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLG

Query:  YPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFST-----------RKDDC---KSTNMFPEFVLPCPIIDAITPDMNVFVEPPSTGDRL-------NNL
        Y      Y    I  G++  SR V F E  FPFST           R D      S    P   L  P    + P ++    PPS+   L       +NL
Subjt:  YPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFST-----------RKDDC---KSTNMFPEFVLPCPIIDAITPDMNVFVEPPSTGDRL-------NNL

Query:  S----DDPSAMVSNQPAHIGDFPTCSDSNDEHCDTN--VVQSPDDNILITVQPDINA---EQSRQTPVVTHSSDSLNF-EQPVIRRSKRQHHPPGFLKDY
               PS+     P+H G  PT      ++ ++N  ++ +P+ N      P+ N+   +    +P +   S S++    P    +     PP      
Subjt:  S----DDPSAMVSNQPAHIGDFPTCSDSNDEHCDTN--VVQSPDDNILITVQPDINA---EQSRQTPVVTHSSDSLNF-EQPVIRRSKRQHHPPGFLKDY

Query:  HCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLPKE-----------------
           +   +P+    NT     +     R       +  +++ +SEP    QA+K D WR AM  EI A    +TW LVP P                   
Subjt:  HCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLPKE-----------------

Query:  --------------------------------KIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYGL
                                        K  ++++ L + V   W + Q+D+NNAFL G L +EVYMS P G+  D+  P     VC+L K+IYGL
Subjt:  --------------------------------KIVTVKLFLSLTVSYGWSLVQMDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYGL

Query:  KQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKGM
        KQA R W+ +    LL+ GF  S +D SLF      S + +LVYVDDILITG  T  + +    L + F +K+  +  YFLG+E  R  +G+
Subjt:  KQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVRALLQENFLLKDLGNAKYFLGLELSRSSKGM

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).9.1e-2031.22Show/hide
Query:  NYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSA-----------------------KELITLSQG
        NY +W       L    K GFI GT+ KP+    L   WE  N ++  W++NS++  +  S++Y  +A                       + L TL QG
Subjt:  NYSSWSRAMMMALSGKNKSGFIKGTIKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSA-----------------------KELITLSQG

Query:  NMSIEAYYTKLKTIWQDLEDFRPIPECKCEG-----LKPFLDHLHSEYVMIFLMG--LNDSYAAVRAQILLMKPIPSITDVFSLLIQEE
          S+E Y+ KL  +W +L ++ PIPECKC G      K   +    E    FLMG  LN  + AV  +I+  KP PS+ + F+++   E
Subjt:  NMSIEAYYTKLKTIWQDLEDFRPIPECKCEG-----LKPFLDHLHSEYVMIFLMG--LNDSYAAVRAQILLMKPIPSITDVFSLLIQEE

AT1G55250.1 histone mono-ubiquitination 23.6e-0850.77Show/hide
Query:  RRFELQDACEDNVTLSNQFQDIE----DDKCAHSSHLYILLNDQLQHLTAEMERYKSLTETLQEE
        R  ELQ + E N++LS Q QDIE    DD+  +SS LY L+ND++ H  AE++RYK LTE +Q E
Subjt:  RRFELQDACEDNVTLSNQFQDIE----DDKCAHSSHLYILLNDQLQHLTAEMERYKSLTETLQEE

AT2G38630.1 Transducin/WD40 repeat-like superfamily protein3.0e-1558.33Show/hide
Query:  DQLQHLTAEMERYKSLTETLQEEYDTHDPKGHCLPVLPFLMKRTKVIKIVAARDIVFALAHSGVCATFSKGT
        D+   ++   ER++++   L E+YDTHDPKG+CL  LP L+KR+KVI+IVAARDIVFAL  SGVCA+FS+ T
Subjt:  DQLQHLTAEMERYKSLTETLQEEYDTHDPKGHCLPVLPFLMKRTKVIKIVAARDIVFALAHSGVCATFSKGT

AT3G54190.1 Transducin/WD40 repeat-like superfamily protein2.6e-1967.14Show/hide
Query:  DQLQHLTAEMERYKSLTETLQEEYDTHDPKGHCLPVLPFLMKRTKVIKIVAARDIVFALAHSGVCATFSK
        D+   ++   ER++++   L E+YDTHDPKGHCL  LPFLMKRTKVI+IVAARDIVFALAHSGVCA FS+
Subjt:  DQLQHLTAEMERYKSLTETLQEEYDTHDPKGHCLPVLPFLMKRTKVIKIVAARDIVFALAHSGVCATFSK

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.2e-4935.52Show/hide
Query:  DINAEQSRQTPVVTHSSDSLN-FEQPVIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFD
        D +A  S  +  +  S++  N   +P +  S R+   P +L+DY+C    HS  S    T + + ++LSY ++S  +  F++ ++   EPS +++A +F 
Subjt:  DINAEQSRQTPVVTHSSDSLN-FEQPVIRRSKRQHHPPGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFD

Query:  VWRHAMDEEIAAMEQTNTWTLVPLPKE------------------------------------------------KIVTVKLFLSLTVSYGWSLVQMDIN
        VW  AMD+EI AME T+TW +  LP                                                  K+ +VKL L+++  Y ++L Q+DI+
Subjt:  VWRHAMDEEIAAMEQTNTWTLVPLPKE------------------------------------------------KIVTVKLFLSLTVSYGWSLVQMDIN

Query:  NAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSE
        NAFLNGDL EE+YM +P GY   +        VC L KSIYGLKQASRQWF KFS  L+  GF QS +D++ F +     F+ +LVYVDDI+I   + + 
Subjt:  NAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSE

Query:  IAYVRALLQENFLLKDLGNAKYFLGLELSRSSKGM
        +  +++ L+  F L+DLG  KYFLGLE++RS+ G+
Subjt:  IAYVRALLQENFLLKDLGNAKYFLGLELSRSSKGM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGAGAAGCTCACTAGAGGCGACTCGAAGCATTACATCTCTGGCGCGGTTCCCTTCTCTCGTCTTCAAAACGAGTGGGTTTTTTTGTTCGGCGCGGTGACTTGCCC
ACACACATCTTTCGACGGCGACTTCATCTCTTCTCCTCATCGTCGATCACTGAACTCTTCATCTCTAGCACGACGTGGGTTTTTTTGTTCATGGGTTTCTGATCGTCGTT
TTGAGCTTCAAGATGCTTGTGAAGATAATGTAACCTTGTCAAATCAATTCCAAGATATTGAGGATGATAAGTGTGCACATTCATCCCACTTATACATTTTACTAAACGAC
CAACTCCAACATTTGACTGCTGAAATGGAGCGTTACAAATCATTGACCGAAACTTTACAGGAAGAATATGATACTCATGATCCAAAGGGGCATTGCTTACCAGTTTTACC
TTTTCTAATGAAGAGGACAAAAGTAATAAAAATTGTTGCTGCACGGGATATTGTTTTTGCACTTGCACATTCTGGTGTTTGTGCAACTTTTAGTAAAGGAACAAAACCAG
GAACTTCTCCAATATTTAATATCTCATCAGTGACTCCAGCCCCTACACGCAAGACTGGTGCCGATGATTTTCTTAATTCTGACAATGATAAAAATGATTATGATTGGTTC
GTACAAAATGAAGAAATAGTATTGTTGAGCGGTGATTTTACACCTCTTCCTTGCGCGCCCTTTTTTTTGTTGCCCTTCTCTCCATTGCCTTTTCTTCCGACTTCTCCAAC
TATGGACAAGGAAGCCAGTAATGGATCTTCAACAACAATCGACGCAAGTTCTGGATCAATTGAAGCTCAATTGAATCCGTTTTCTCTTCATCACTCCTTCTCCTCAACGG
CTGTTCTTGTCACGCAGCCATTGCTTGGAGCGATTAACTACAGTTCCTGGAGTCGTGCGATGATGATGGCTTTATCAGGCAAAAATAAGTCTGGCTTCATCAAAGGTACG
ATAAAGAAACCAGAAACTCAAGGAAGTCTTCTAACAGCGTGGGAATGCAATAATAATGTTATTGCATCTTGGATCTTAAATTCAGTATCTAAAGATATTGCTGCAAGTAT
TGTGTATACTGGATCCGCAAAGGAATTGATTACTCTGAGTCAAGGCAACATGTCTATCGAAGCATACTACACTAAACTCAAGACTATTTGGCAAGATCTTGAAGATTTTA
GGCCCATACCTGAATGCAAATGTGAAGGTCTAAAGCCCTTTCTAGATCATCTACATTCTGAATATGTAATGATTTTTTTAATGGGACTTAATGACTCGTATGCTGCTGTG
AGGGCTCAGATTCTGCTGATGAAACCAATTCCTTCAATAACAGATGTTTTCTCACTTCTTATTCAAGAAGAGCGACAACGTAATGCTGGTAATTCTGCTGCAATTGAACC
GATTGCCCTGCTTGCATCTGATAACTCCAAGAAGAATCCTAATTCTTCTTCAACTGATCGCACTCGGAAAAAGGAGAATCAACGTCCAATTTGCACAAATTGTGGAGTTA
AAGGTCATACTATTGACCGATGCTACAAGTTGCATGGTTATCCTCCTGGTTACAAAACCAGGAACTCATCTTCTCCTGTGCCAGTAGAAGGATCCAACTCTACCAAAGCA
CCTGGAGTAAATCAAGTCAGTCAATCTGATTTTTTCTCCAGCTTAAACTCTAACCAGTATGCACAACTCTTAAACATGCTCTCATCTCATCTGCCTACTGCCAAGAATGA
AACTGCCACCGCGGCCTCAGCAATCACTCATGCTGCAGTTACACTACCAACTGCTTATCGTGTTCAAGTTGAATTCATTGGTGATGTGAGGATTTCAGATCAGATAATTT
TACGAGATGTTTTATTCATACCTCAATTTCAATACAATCTCATTTCCGTCAGCTGTTTATTGCAAACCACAGATGTGTCCCTCACATTTTCTACTGAAAATTGTCTCATT
CAGGACATATCACGATTGACGACGATTGGCAGGGCTGATTGTTGCTATGGCCTTTATCTGTTGAATGCAAAGGTCACACAGTCCCTTAATACTACTGTTGCAGCAACCGT
GAAAGCCAAAGTCATTTCATATGAGACTTGGCATCAACGTTTGGGACACCTTTCTCCTAAACGATTATCATTGTTGAAGACCTCTTTGTCTTTGCCTGATTCTTTTAATT
CCCCTTGTTACATTTGCCCCCTAGCGAAACAAAGACGTTTATCATTTCCTTCGAATAATCATGTAGCTTCACAAGTTTTTGATTTGATACATTGTGATGTCTGGGGACCG
TTTAAGGAACCTACTTATAATGGTTTTCGTTATTTTCTAACTCTTGTCGATGACTGCTCAAGGTTCACTTGGGTTTACTTGATGAAACAAAAATCAGATGCTGTCTCTAT
CATTCCCAGATTTTTTCAACTTGTTGAAACACAGTTCTCAAAAGTGATAAAGGCATTTAGATCCGACAATGCTCCGGAACTAAAATTTGTTGATTTTTTCGCTACGAAGG
GAACAATCCACCAATTCTCCTGTGTAGAAACACCTCAACAGAACTCAGTGGTGGAGAGAAAACATCAACATCTCCTTAATGGAGACTTCATATTGACAGCAACGTATCTC
ATCAATCGTACTCCTACACCTCTCTTGGACAACAAGTCCCCTTTCACTGTATTACATGGCAGTGAGGTTGATTACTCTCAATTAAGAGTATTTGGTTGCCTTTGCTATGC
GTCTACCTTAACAACTCATCGATCTAAATTTGATCCTCGAGCTGTTTCTTGCGTGTTTTTGGGCTATCCACCAGGTGTAAAAGGATACAAATTGTTTGACATTGCCAAAG
GACAGGTTATAGTCTCAAGAGATGTCGTGTTCTTTGAAGATTCCTTTCCATTTTCTACCAGGAAGGATGACTGTAAAAGCACAAATATGTTTCCCGAGTTTGTTCTACCT
TGTCCGATCATAGATGCCATAACACCTGATATGAATGTTTTCGTTGAACCACCTTCCACAGGTGATCGTCTAAATAACCTTTCAGATGATCCTTCTGCCATGGTTTCCAA
TCAGCCTGCACATATTGGAGACTTTCCAACCTGTAGTGATTCAAATGATGAGCATTGTGATACAAATGTTGTGCAATCTCCTGATGACAATATTCTTATCACTGTTCAGC
CTGATATCAATGCTGAACAATCTAGACAAACTCCAGTGGTGACTCACTCTTCTGACAGCTTGAATTTTGAGCAACCTGTTATACGGCGATCCAAAAGACAACATCATCCA
CCAGGATTCCTTAAGGACTATCATTGTAACTTGCTTCAACATAGTCCTATTTCTAAGGACAAAAATACTCCTTATTCACTCGATAAATATCTGTCCTATAGTCGTCTTTC
AGCTACTCATCGAGAGTTTATTCTTAATGTCTCGATAGATTCTGAACCATCTTACTTCCATCAAGCAGTCAAGTTTGATGTCTGGCGTCATGCTATGGATGAAGAGATCG
CCGCTATGGAACAAACCAACACATGGACTCTTGTTCCTCTTCCAAAAGAAAAAATAGTCACCGTCAAACTGTTTTTGTCTCTTACTGTCTCTTATGGCTGGTCTTTGGTT
CAAATGGACATTAACAACGCTTTTTTGAATGGAGATTTATTCGAAGAAGTTTACATGTCCATACCTCTCGGATATTATCGAGATCGCAAGTTCCCTACTGATATCCCTCT
AGTTTGCAAACTAAACAAGTCCATTTATGGACTCAAGCAGGCTTCAAGACAATGGTTCACAAAATTCTCAGGTGTTTTACTTTCTAATGGCTTCACCCAATCCAAGGCTG
ACTACTCTCTCTTTACTCGGGGAGATGACCTTTCATTTGTGGCCTTATTAGTCTATGTAGACGATATCTTGATCACTGGCCCATCCACATCAGAAATTGCTTATGTTAGA
GCCTTACTCCAAGAGAATTTTTTGTTGAAGGATTTAGGCAATGCCAAATACTTCCTTGGACTTGAACTCTCTCGTTCTTCCAAAGGAATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGGAGAAGCTCACTAGAGGCGACTCGAAGCATTACATCTCTGGCGCGGTTCCCTTCTCTCGTCTTCAAAACGAGTGGGTTTTTTTGTTCGGCGCGGTGACTTGCCC
ACACACATCTTTCGACGGCGACTTCATCTCTTCTCCTCATCGTCGATCACTGAACTCTTCATCTCTAGCACGACGTGGGTTTTTTTGTTCATGGGTTTCTGATCGTCGTT
TTGAGCTTCAAGATGCTTGTGAAGATAATGTAACCTTGTCAAATCAATTCCAAGATATTGAGGATGATAAGTGTGCACATTCATCCCACTTATACATTTTACTAAACGAC
CAACTCCAACATTTGACTGCTGAAATGGAGCGTTACAAATCATTGACCGAAACTTTACAGGAAGAATATGATACTCATGATCCAAAGGGGCATTGCTTACCAGTTTTACC
TTTTCTAATGAAGAGGACAAAAGTAATAAAAATTGTTGCTGCACGGGATATTGTTTTTGCACTTGCACATTCTGGTGTTTGTGCAACTTTTAGTAAAGGAACAAAACCAG
GAACTTCTCCAATATTTAATATCTCATCAGTGACTCCAGCCCCTACACGCAAGACTGGTGCCGATGATTTTCTTAATTCTGACAATGATAAAAATGATTATGATTGGTTC
GTACAAAATGAAGAAATAGTATTGTTGAGCGGTGATTTTACACCTCTTCCTTGCGCGCCCTTTTTTTTGTTGCCCTTCTCTCCATTGCCTTTTCTTCCGACTTCTCCAAC
TATGGACAAGGAAGCCAGTAATGGATCTTCAACAACAATCGACGCAAGTTCTGGATCAATTGAAGCTCAATTGAATCCGTTTTCTCTTCATCACTCCTTCTCCTCAACGG
CTGTTCTTGTCACGCAGCCATTGCTTGGAGCGATTAACTACAGTTCCTGGAGTCGTGCGATGATGATGGCTTTATCAGGCAAAAATAAGTCTGGCTTCATCAAAGGTACG
ATAAAGAAACCAGAAACTCAAGGAAGTCTTCTAACAGCGTGGGAATGCAATAATAATGTTATTGCATCTTGGATCTTAAATTCAGTATCTAAAGATATTGCTGCAAGTAT
TGTGTATACTGGATCCGCAAAGGAATTGATTACTCTGAGTCAAGGCAACATGTCTATCGAAGCATACTACACTAAACTCAAGACTATTTGGCAAGATCTTGAAGATTTTA
GGCCCATACCTGAATGCAAATGTGAAGGTCTAAAGCCCTTTCTAGATCATCTACATTCTGAATATGTAATGATTTTTTTAATGGGACTTAATGACTCGTATGCTGCTGTG
AGGGCTCAGATTCTGCTGATGAAACCAATTCCTTCAATAACAGATGTTTTCTCACTTCTTATTCAAGAAGAGCGACAACGTAATGCTGGTAATTCTGCTGCAATTGAACC
GATTGCCCTGCTTGCATCTGATAACTCCAAGAAGAATCCTAATTCTTCTTCAACTGATCGCACTCGGAAAAAGGAGAATCAACGTCCAATTTGCACAAATTGTGGAGTTA
AAGGTCATACTATTGACCGATGCTACAAGTTGCATGGTTATCCTCCTGGTTACAAAACCAGGAACTCATCTTCTCCTGTGCCAGTAGAAGGATCCAACTCTACCAAAGCA
CCTGGAGTAAATCAAGTCAGTCAATCTGATTTTTTCTCCAGCTTAAACTCTAACCAGTATGCACAACTCTTAAACATGCTCTCATCTCATCTGCCTACTGCCAAGAATGA
AACTGCCACCGCGGCCTCAGCAATCACTCATGCTGCAGTTACACTACCAACTGCTTATCGTGTTCAAGTTGAATTCATTGGTGATGTGAGGATTTCAGATCAGATAATTT
TACGAGATGTTTTATTCATACCTCAATTTCAATACAATCTCATTTCCGTCAGCTGTTTATTGCAAACCACAGATGTGTCCCTCACATTTTCTACTGAAAATTGTCTCATT
CAGGACATATCACGATTGACGACGATTGGCAGGGCTGATTGTTGCTATGGCCTTTATCTGTTGAATGCAAAGGTCACACAGTCCCTTAATACTACTGTTGCAGCAACCGT
GAAAGCCAAAGTCATTTCATATGAGACTTGGCATCAACGTTTGGGACACCTTTCTCCTAAACGATTATCATTGTTGAAGACCTCTTTGTCTTTGCCTGATTCTTTTAATT
CCCCTTGTTACATTTGCCCCCTAGCGAAACAAAGACGTTTATCATTTCCTTCGAATAATCATGTAGCTTCACAAGTTTTTGATTTGATACATTGTGATGTCTGGGGACCG
TTTAAGGAACCTACTTATAATGGTTTTCGTTATTTTCTAACTCTTGTCGATGACTGCTCAAGGTTCACTTGGGTTTACTTGATGAAACAAAAATCAGATGCTGTCTCTAT
CATTCCCAGATTTTTTCAACTTGTTGAAACACAGTTCTCAAAAGTGATAAAGGCATTTAGATCCGACAATGCTCCGGAACTAAAATTTGTTGATTTTTTCGCTACGAAGG
GAACAATCCACCAATTCTCCTGTGTAGAAACACCTCAACAGAACTCAGTGGTGGAGAGAAAACATCAACATCTCCTTAATGGAGACTTCATATTGACAGCAACGTATCTC
ATCAATCGTACTCCTACACCTCTCTTGGACAACAAGTCCCCTTTCACTGTATTACATGGCAGTGAGGTTGATTACTCTCAATTAAGAGTATTTGGTTGCCTTTGCTATGC
GTCTACCTTAACAACTCATCGATCTAAATTTGATCCTCGAGCTGTTTCTTGCGTGTTTTTGGGCTATCCACCAGGTGTAAAAGGATACAAATTGTTTGACATTGCCAAAG
GACAGGTTATAGTCTCAAGAGATGTCGTGTTCTTTGAAGATTCCTTTCCATTTTCTACCAGGAAGGATGACTGTAAAAGCACAAATATGTTTCCCGAGTTTGTTCTACCT
TGTCCGATCATAGATGCCATAACACCTGATATGAATGTTTTCGTTGAACCACCTTCCACAGGTGATCGTCTAAATAACCTTTCAGATGATCCTTCTGCCATGGTTTCCAA
TCAGCCTGCACATATTGGAGACTTTCCAACCTGTAGTGATTCAAATGATGAGCATTGTGATACAAATGTTGTGCAATCTCCTGATGACAATATTCTTATCACTGTTCAGC
CTGATATCAATGCTGAACAATCTAGACAAACTCCAGTGGTGACTCACTCTTCTGACAGCTTGAATTTTGAGCAACCTGTTATACGGCGATCCAAAAGACAACATCATCCA
CCAGGATTCCTTAAGGACTATCATTGTAACTTGCTTCAACATAGTCCTATTTCTAAGGACAAAAATACTCCTTATTCACTCGATAAATATCTGTCCTATAGTCGTCTTTC
AGCTACTCATCGAGAGTTTATTCTTAATGTCTCGATAGATTCTGAACCATCTTACTTCCATCAAGCAGTCAAGTTTGATGTCTGGCGTCATGCTATGGATGAAGAGATCG
CCGCTATGGAACAAACCAACACATGGACTCTTGTTCCTCTTCCAAAAGAAAAAATAGTCACCGTCAAACTGTTTTTGTCTCTTACTGTCTCTTATGGCTGGTCTTTGGTT
CAAATGGACATTAACAACGCTTTTTTGAATGGAGATTTATTCGAAGAAGTTTACATGTCCATACCTCTCGGATATTATCGAGATCGCAAGTTCCCTACTGATATCCCTCT
AGTTTGCAAACTAAACAAGTCCATTTATGGACTCAAGCAGGCTTCAAGACAATGGTTCACAAAATTCTCAGGTGTTTTACTTTCTAATGGCTTCACCCAATCCAAGGCTG
ACTACTCTCTCTTTACTCGGGGAGATGACCTTTCATTTGTGGCCTTATTAGTCTATGTAGACGATATCTTGATCACTGGCCCATCCACATCAGAAATTGCTTATGTTAGA
GCCTTACTCCAAGAGAATTTTTTGTTGAAGGATTTAGGCAATGCCAAATACTTCCTTGGACTTGAACTCTCTCGTTCTTCCAAAGGAATGTAG
Protein sequenceShow/hide protein sequence
MREKLTRGDSKHYISGAVPFSRLQNEWVFLFGAVTCPHTSFDGDFISSPHRRSLNSSSLARRGFFCSWVSDRRFELQDACEDNVTLSNQFQDIEDDKCAHSSHLYILLND
QLQHLTAEMERYKSLTETLQEEYDTHDPKGHCLPVLPFLMKRTKVIKIVAARDIVFALAHSGVCATFSKGTKPGTSPIFNISSVTPAPTRKTGADDFLNSDNDKNDYDWF
VQNEEIVLLSGDFTPLPCAPFFLLPFSPLPFLPTSPTMDKEASNGSSTTIDASSGSIEAQLNPFSLHHSFSSTAVLVTQPLLGAINYSSWSRAMMMALSGKNKSGFIKGT
IKKPETQGSLLTAWECNNNVIASWILNSVSKDIAASIVYTGSAKELITLSQGNMSIEAYYTKLKTIWQDLEDFRPIPECKCEGLKPFLDHLHSEYVMIFLMGLNDSYAAV
RAQILLMKPIPSITDVFSLLIQEERQRNAGNSAAIEPIALLASDNSKKNPNSSSTDRTRKKENQRPICTNCGVKGHTIDRCYKLHGYPPGYKTRNSSSPVPVEGSNSTKA
PGVNQVSQSDFFSSLNSNQYAQLLNMLSSHLPTAKNETATAASAITHAAVTLPTAYRVQVEFIGDVRISDQIILRDVLFIPQFQYNLISVSCLLQTTDVSLTFSTENCLI
QDISRLTTIGRADCCYGLYLLNAKVTQSLNTTVAATVKAKVISYETWHQRLGHLSPKRLSLLKTSLSLPDSFNSPCYICPLAKQRRLSFPSNNHVASQVFDLIHCDVWGP
FKEPTYNGFRYFLTLVDDCSRFTWVYLMKQKSDAVSIIPRFFQLVETQFSKVIKAFRSDNAPELKFVDFFATKGTIHQFSCVETPQQNSVVERKHQHLLNGDFILTATYL
INRTPTPLLDNKSPFTVLHGSEVDYSQLRVFGCLCYASTLTTHRSKFDPRAVSCVFLGYPPGVKGYKLFDIAKGQVIVSRDVVFFEDSFPFSTRKDDCKSTNMFPEFVLP
CPIIDAITPDMNVFVEPPSTGDRLNNLSDDPSAMVSNQPAHIGDFPTCSDSNDEHCDTNVVQSPDDNILITVQPDINAEQSRQTPVVTHSSDSLNFEQPVIRRSKRQHHP
PGFLKDYHCNLLQHSPISKDKNTPYSLDKYLSYSRLSATHREFILNVSIDSEPSYFHQAVKFDVWRHAMDEEIAAMEQTNTWTLVPLPKEKIVTVKLFLSLTVSYGWSLV
QMDINNAFLNGDLFEEVYMSIPLGYYRDRKFPTDIPLVCKLNKSIYGLKQASRQWFTKFSGVLLSNGFTQSKADYSLFTRGDDLSFVALLVYVDDILITGPSTSEIAYVR
ALLQENFLLKDLGNAKYFLGLELSRSSKGM