; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034648 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034648
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr3:9264412..9268196
RNA-Seq ExpressionLag0034648
SyntenyLag0034648
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]6.8e-20856.72Show/hide
Query:  DVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKP-EGRKSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWD
        D QLNP+ +HHS   +  +VTQPLTGA NY SW +AML+A+SG+NK GFI G ++KP +G    AW CNNDI+ SWILNSVSKEIAASI Y GS K +WD
Subjt:  DVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKP-EGRKSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWD

Query:  ELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITK
        EL  RFKQSNGP +YQLRKE VT  QG+L+IE YYTKLKTIWQ L +YR T DCTCGG+KPF +H+ESE++M FLMGL+DSYAAVRAQILLM P+P I  
Subjt:  ELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITK

Query:  VFALIVQEEHQRTAGNIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNAVSQP
        VF+L++QEE QR+AG +  +   + + L +A +S   S DR  R  +RP CS+CG+KGH+ DKCYK HGYP GYK RNSN   S TT P  ++TN V+  
Subjt:  VFALIVQEEHQRTAGNIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNAVSQP

Query:  QSG-------FFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILPTGF
         S        FFSSLNS QY+QLM LLN H+Q+A T PIT A+++T T+GI + L S      D WI+DSGASRHICH +  F+NW     + V+LP G 
Subjt:  QSG-------FFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILPTGF

Query:  ---------------------------------LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYIL-LQPDKLPFTVAESACFVSISTWHS
                                         +SCLL   N+S+DF  + C IQD S  M IGKA   NGLY+L  + +            +S+ TWH 
Subjt:  ---------------------------------LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYIL-LQPDKLPFTVAESACFVSISTWHS

Query:  RLGHLSAPRLSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHI
        RLGHLS   LS L STL         S CH+CPLAKQ+RLSF SNN+VAS+ F+LVH DIWGPFK P+Y GYKYFLT+VDD  R+TW Y+++ KSD LHI
Subjt:  RLGHLSAPRLSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHI

Query:  IPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFF
        +P+FF+LI TQFS+V K FRSDNAPEL+  EFFA  GT+HQFSCVE+PQQNSV ERKHQHLLNVARALFF
Subjt:  IPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFF

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]7.3e-19439.74Show/hide
Query:  NPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI--AWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDELH
        +P++LH+     + LV+ PL G SNY +WR+AM++AL+ KNK GFID S+ +P     +  +W   N ++ SWILNSV++ IA S+ Y  +A+ +W +L+
Subjt:  NPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI--AWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDELH

Query:  GRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVFA
         RF +SN P +YQ++K L    QGS+ + +YYTKL+T+W EL DY+PT  CTCG ++ +  +   E VM FLMGL+DSYA VRAQ+L++ P+P I KVFA
Subjt:  GRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVFA

Query:  LIVQEEHQR------TAGNIPTSSTQENITLLVAEA-SKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGY-KFRNSNLSGSDTT--GPKPAE
        L++QEE QR      +   +  S    N+      A S + S +       R ICSHC  + H VDKCYK+HGYP G+ KF++    GS         +E
Subjt:  LIVQEEHQR------TAGNIPTSSTQENITLLVAEA-SKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGY-KFRNSNLSGSDTT--GPKPAE

Query:  TNAVSQPQSGFFSSLNSSQYTQLMELLNTHIQSAKT-----EPITAASSLTRTAGICS-TLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSV
        T+  +Q Q     SL  SQ  QL+E L++ +Q+ +      +P T  S LT   GICS T H   + + D WI+D+GA+ HIC     F++ R +    V
Subjt:  TNAVSQPQSGFFSSLNSSQYTQLMELLNTHIQSAKT-----EPITAASSLTRTAGICS-TLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSV

Query:  ILPTGF---------------------------------LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDK-LPFTVAESACFVS
        +LP                                    +S L  N+N S+ F   SC+IQD S +  IG  +    LY+L QPD+ LP  +  +  FVS
Subjt:  ILPTGF---------------------------------LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDK-LPFTVAESACFVS

Query:  IS-TWHSRLGHLSAPRLSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKT
         S  WH R+GH S  +LS LK+ L  + +    + CH C L+KQRRL  +S N++++ +FEL+H D WGPF   +  G+++F TIVDDHSRYTW Y++K+
Subjt:  IS-TWHSRLGHLSAPRLSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKT

Query:  KSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRT
        KSD L I P F ++++TQF    K  RSDNAPEL F +FFA  G  H  SCVERPQQNSV ERKHQH+LNVARAL FQ+ IP+  W DCI T+ YLINRT
Subjt:  KSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRT

Query:  PAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQG
        P+P+L +KTPFELL+G    YS+L++FGCLCYAST  +S+++ + +  R         CVFIGYPPG KGY+L ++   ++ ISRDVIF E+TFP+ N  
Subjt:  PAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQG

Query:  TISDDDVCNLFSDHVLPCPIIDPLASSDAMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVSGHK
         +S  D+                              V  +  +TP +P     +                                             
Subjt:  TISDDDVCNLFSDHVLPCPIIDPLASSDAMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVSGHK

Query:  PIANTSGAPVRRSSRAHICPKFLQDYHCGSLN-HTADRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAMNV
                   R+SR H  P  L+DYHC S++   +    +PI   ++Y  LS SHR F+ N+S+I EP+ F QAV L  WR+AMD E+ A+ +
Subjt:  PIANTSGAPVRRSSRAHICPKFLQDYHCGSLN-HTADRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAMNV

MCH80704.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium medium]1.0e-18739.13Show/hide
Query:  LNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGR--KSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDEL
        LNPF +HHS    +VLV+  LT   NY SWR+AMLIAL  KNK GF+DGS+ +P     +   W  N++++ SW+LNSVSKE+ A+I Y+ +A A+W +L
Subjt:  LNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGR--KSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDEL

Query:  HGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVF
          RF Q+NGP ++QLRK+ +T +QG+LS+ AYY+K+K +W+EL ++RP   C CGG+ P  EH+  E+V+ FL+                          
Subjt:  HGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVF

Query:  ALIVQEEHQRTAGNIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNAVSQPQS
             EE QR   N  +S   E++   V +    +S         +P+C+HCG+ GHV DKCYK+HGYP GY+   S           P   NAV+    
Subjt:  ALIVQEEHQRTAGNIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNAVSQPQS

Query:  GFFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILP-----------T
             LN+ QY QL+  +    Q A++     +++      + ++ H +  P +  WI+DSGA+ HI +  D F +++ +    V LP           T
Subjt:  GFFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILP-----------T

Query:  GFLSC----------------------LLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYIL-LQPDK------LPFTVA-------ESACFVSIS
         +L+                       LL+N++LS+  + +   IQD      IGK++  +GLY L   P+       LPF  +        S    S+S
Subjt:  GFLSC----------------------LLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYIL-LQPDK------LPFTVA-------ESACFVSIS

Query:  TWHSRLGHLSAPRLSLLKSTLQ--FQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTK
         WH+RLGHLS P L  + + +   F  + +    C +CPLAK  RL F SNN+ A   F+L+HCD+WGPFK PTY GY YFLTIVDD +RYTWT+LMK K
Subjt:  TWHSRLGHLSAPRLSLLKSTLQ--FQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTK

Query:  SDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTP
        ++A  +I  FFK + TQF++  K   +DNA ELQ   F  + GT+HQFSC  RPQQNSV ERKHQHLLNVARAL FQ+++P++ WGDCI TA+YLINR P
Subjt:  SDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTP

Query:  APLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQGT
        +  L N +P+ELL+G   DY++LR+FGC+C+AST  A +++ T +         A  CV +GYPPGVKGY+L  I  ++++I+RDV F E  FPFH+   
Subjt:  APLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQGT

Query:  ISDDDVCNLFSDHVLPCPIIDPLASSD----AMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVS
           D + N+FSD VLP P+ +P +  D     +NP   +H        P     +  +  SS V    P                            +V+
Subjt:  ISDDDVCNLFSDHVLPCPIIDPLASSD----AMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVS

Query:  GHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLNHTADRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAM
         H  +        RRS+R    P +LQDY C           YPI+N+LSYD L PS++QFI  VS I+EPSF+HQAV    WR+AM  E+AA+
Subjt:  GHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLNHTADRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAM

RVW21404.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.9e-19039.63Show/hide
Query:  VIDVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI--AWKCNNDIITSWILNSVSKEIAASINYTGSAKA
        V++   +P+ LH+     + LV+  LTGA NY +W +AML+AL+ KNK GF+DG++ +P     I  AW   N +I+SWI+N+VS+EIA S+ Y  SA  
Subjt:  VIDVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI--AWKCNNDIITSWILNSVSKEIAASINYTGSAKA

Query:  VWDELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPD
        +W +L+ RF Q NGP ++Q++K+L   +QGSL + +Y+TKLK +W EL +++P   C CGG++ +T++   E+V+ FLMGL+D YA +R QIL+M+P+P 
Subjt:  VWDELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPD

Query:  ITKVFALIVQEEHQRTAG-NIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNA
        I KVF+L++QEE  RT G +   S   + +T      +   S+   +    R  CS+CG +GH+ DKCYK+ GYP G+KF+N   + S          + 
Subjt:  ITKVFALIVQEEHQRTAG-NIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNA

Query:  VSQPQSGFFSSLNSSQYTQLMELLNTHI---QSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILPTGF
         +       SSL + Q  QL++LL   +    SA TE  +   S++  AG    + +        WI++SGA+ H+C+    F +   V  V V LPTG 
Subjt:  VSQPQSGFFSSLNSSQYTQLMELLNTHI---QSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILPTGF

Query:  LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTVAESACFVS----ISTWHSRLGHLSAPRLSLLKSTLQFQGSHTECSPCH
                 + ID  GS    +D  LL  +        L       K+    +  A  +     +S WHSRLGH S  RL  L+S L F  S  + +PC+
Subjt:  LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTVAESACFVS----ISTWHSRLGHLSAPRLSLLKSTLQFQGSHTECSPCH

Query:  ICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFK
        +CPLAKQR L + S N   S+ F+L+H DIWGPF   +  GYK+FLTIVDDHSR TW Y++K KS+    IP FF  +  QF +  K  RSDNAPEL   
Subjt:  ICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFK

Query:  EFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRG
         F+ S+G IH  SCVE PQQNSV ERKHQH+LNVARAL FQ+ +PI  W DCILTA YLINRTP+P L+NKTPFE+L+    DYS+LR+FGCLCY ST  
Subjt:  EFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRG

Query:  ASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQGTISDDDV-CNLFSDHVLPCPIIDPLASSDAMNPTAGA
                K NRTK    A   VF+GYP G KGY+L DI  + + ISR+VIF E+ FPF      S  D+  +LF D VLPC                  
Subjt:  ASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQGTISDDDV-CNLFSDHVLPCPIIDPLASSDAMNPTAGA

Query:  HVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVSGHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLNHTA-
                      + D++  SSV+P +               ++  PL                      AP  R +R      +L+DYHC  +N  A 
Subjt:  HVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVSGHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLNHTA-

Query:  ---DRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAMNVLRPGVLFLFRLGSTQWGANGFIK
               +PI+++LSYD LS S++ F L+VS I EP  F +A ++  WR AMD E+ A+   +   +    +GST    NG  K
Subjt:  ---DRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAMNVLRPGVLFLFRLGSTQWGANGFIK

XP_012857659.1 PREDICTED: uncharacterized protein LOC105976934 [Erythranthe guttata]6.0e-18838.49Show/hide
Query:  IDVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI---AWKCNNDIITSWILNSVSKEIAASINYTGSAKA
        ID   +P++LH S    +VLV+  L    NY +W +AM+I+L+ KNK GFIDGS+ KP   +++   AW  NN I+ SWILN++S +I AS+ Y+ SA  
Subjt:  IDVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI---AWKCNNDIITSWILNSVSKEIAASINYTGSAKA

Query:  VWDELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCD---CTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNP
        +W++L  RF Q+NGP ++QLR+EL   +Q   S+  Y+TKLK IW EL ++RPTC    C+CGG+    +H   E VM FLMGL+DS A+ R QILLM+P
Subjt:  VWDELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCD---CTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNP

Query:  IPDITKVFALIVQEEHQRTAGNIPTSSTQENITLLVAE------ASKKQSNDRFRRTNQRP---ICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSD
        +P I KVFAL+ QEE  R+     +S  Q ++              + Q+N  +  T+QR     C+HC   GH V+KCY++HG+P GY+ R      S+
Subjt:  IPDITKVFALIVQEEHQRTAGNIPTSSTQENITLLVAE------ASKKQSNDRFRRTNQRP---ICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSD

Query:  TTGPKPAETNAVSQ-----------------PQS-GFFSSLNSSQYTQLMELLNTHIQSAKTEP-------ITAASSLTRTAGIC--STLHS-SRMPKSD
         +       N VS                  P S  F  ++ +SQ  QL+  +++H+ +   +P       I   S ++R  GIC  + LH+ S MP   
Subjt:  TTGPKPAETNAVSQ-----------------PQS-GFFSSLNSSQYTQLMELLNTHIQSAKTEP-------ITAASSLTRTAGIC--STLHS-SRMPKSD

Query:  IWILDSGASRHICHRRDSFQNWRQVYGVSVILPTGFL---------------------------------SCLLKNNNLSIDFSGSSCRIQDKSLLMTIG
         WILDSGASRHICH +  F N + V    V+LP   +                                 S LL  ++  + F   S  IQD+ L+  IG
Subjt:  IWILDSGASRHICHRRDSFQNWRQVYGVSVILPTGFL---------------------------------SCLLKNNNLSIDFSGSSCRIQDKSLLMTIG

Query:  KAESDNGLYILLQPDKLPFTVAESAC-FVSISTWHSRLGHLSAPRLSLLKSTLQFQ-GSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGP
        K     GLY+L      P  +  + C  +S + WH RLGH+  P+L+ L           +E S C++CPLAKQ+RL FS+++ V++ MF+L+HCDIWGP
Subjt:  KAESDNGLYILLQPDKLPFTVAESAC-FVSISTWHSRLGHLSAPRLSLLKSTLQFQ-GSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGP

Query:  FKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLN
        FK P+Y+G+ YF+T+VDD+SR+TW +L+KTKS+ + ++PRF K++  QF +  KVFRSDNA ELQFK  F  +G IHQFSCV  PQQN++ ERKHQH+LN
Subjt:  FKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLN

Query:  VARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLY-GHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVK
        VAR+LFFQ+ IPI  W +CILTA +LINR PA  L++ +P+ELLY     DY +L+ FGCL +A+     K++   +         A+ CVF+GYP G+K
Subjt:  VARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLY-GHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVK

Query:  GYRLYDIVKKQVIISRDVIFFEDTFPFHNQGTISDDDVCNLFSDHVLPCPIIDPLASSDAMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSS
        GY+L D+V  +V ISRDVIF E+ +PF N+ + S+       S    P P                                       SV+P+   DS 
Subjt:  GYRLYDIVKKQVIISRDVIFFEDTFPFHNQGTISDDDVCNLFSDHVLPCPIIDPLASSDAMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSS

Query:  FEPDAVVEDSL------TNVPLVETVELDVLVVSGHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLNHTADRPVYPIENYLSYDNLSPSHRQFILNVS
         EP   V  S+      +N+P+V+ +                   P+R+SSR    P +L D+ C ++  T   P+YPI  +     LSPS++ FILN+S
Subjt:  FEPDAVVEDSL------TNVPLVETVELDVLVVSGHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLNHTADRPVYPIENYLSYDNLSPSHRQFILNVS

Query:  AIHEPSFF
           EP+ +
Subjt:  AIHEPSFF

TrEMBL top hitse value%identityAlignment
A0A2N9GZW3 Integrase catalytic domain-containing protein4.2e-18738.02Show/hide
Query:  DVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKS---IAWKCNNDIITSWILNSVSKEIAASINYTGSAKAV
        DV  + ++LHH  S   +LV+Q L G  NY +W ++M++AL+ KNK GF++G +++P+   S    AW   N ++ SW+LNS+SKEIA+S+ Y  +AK +
Subjt:  DVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKS---IAWKCNNDIITSWILNSVSKEIAASINYTGSAKAV

Query:  WDELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDI
        W++L  RF Q NGP +++++K +   SQ + S+ +YYT+LK++W EL ++RP  DC+CG +K   ++ + E+VM FLMGL+DS++ VRAQIL+ +P+P I
Subjt:  WDELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDI

Query:  TKVFALIVQEEHQRTAGNIPT-SSTQENITLLV-AEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFR----NSNLSGSDTTGPKPA
        TK FAL++QEE QR   NIP+ +   +++ L    EA++            RPICSHCG+ GH VDKCYK+HGYP GYKF+    +++ S +    P   
Subjt:  TKVFALIVQEEHQRTAGNIPT-SSTQENITLLV-AEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFR----NSNLSGSDTTGPKPA

Query:  ETNAVSQPQSGFFS------SLNSSQYTQLMELLNTHIQSAKTEPITAASSLTR-TAGICSTLHSSRMPKSDI---------------WILDSGASRHIC
         T A  Q      S      SL SSQ+    ++++       + P  AAS+++   +GI S  H+  +PK  I               WILD+GA+ H+ 
Subjt:  ETNAVSQPQSGFFS------SLNSSQYTQLMELLNTHIQSAKTEPITAASSLTR-TAGICSTLHSSRMPKSDI---------------WILDSGASRHIC

Query:  HRRDSFQNWRQVYGVSVILPTG---------------------------------FLSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYIL--
        H    F +        + LP G                                  +S L    +  + F    C IQD      IG     NGLY L  
Subjt:  HRRDSFQNWRQVYGVSVILPTG---------------------------------FLSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYIL--

Query:  ----LQPDKLPFTVAESACFVS--ISTWHSRLGHLSAPRLSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYA
            +     P   A +A   +     WH RLGH S  RLSLLK+ +      +    C +C ++KQ+RL F +  H A   F+L+HCDIWGP+  PT  
Subjt:  ----LQPDKLPFTVAESACFVS--ISTWHSRLGHLSAPRLSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYA

Query:  GYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFF
          +YFLTIVDD +R TW +LMK KS+   +I  FF LI TQFS   K+ RSDN PE +   F+A  GT+HQ SCV  PQQN+  ERKHQHLL VARAL F
Subjt:  GYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFF

Query:  QARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIV
        QA +P+  WG C+LTA++LINR P PLL NK+PFELL+    +YS LR+FGCLCYA+T            NR K    +  CV +GYP G+KGYRL D+ 
Subjt:  QARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIV

Query:  KKQVIISRDVIFFEDTFPFHNQGTISDDDVCNLFSDHVLPCPIID-PLASSDA---MNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPD
         KQV +SRDV+F+E++FPFH     +        +  VLP PI D P++ S      N +  + +  +P  +P  P+   H+  SS +P     +  +P 
Subjt:  KKQVIISRDVIFFEDTFPFHNQGTISDDDVCNLFSDHVLPCPIID-PLASSDA---MNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPD

Query:  AVVEDSLTNVPLVETVELDVLVVSGHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLN----HTADRP---------VYPIENYLSYDNLSPSHRQFIL
         +V D                      P  N     +R+S+R H  P +LQ +HC + +    H+   P         V+P+ NY+SY  L+P +  F+L
Subjt:  AVVEDSLTNVPLVETVELDVLVVSGHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLN----HTADRP---------VYPIENYLSYDNLSPSHRQFIL

Query:  NVSAIHEPSFFHQAVKLEPWRRAMDAEIAAM
        + SAI EP+ FH+A K   W +AM  E+AA+
Subjt:  NVSAIHEPSFFHQAVKLEPWRRAMDAEIAAM

A0A2Z7AT15 Cysteine-rich RLK (Receptor-like protein kinase) 83.5e-19439.74Show/hide
Query:  NPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI--AWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDELH
        +P++LH+     + LV+ PL G SNY +WR+AM++AL+ KNK GFID S+ +P     +  +W   N ++ SWILNSV++ IA S+ Y  +A+ +W +L+
Subjt:  NPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI--AWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDELH

Query:  GRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVFA
         RF +SN P +YQ++K L    QGS+ + +YYTKL+T+W EL DY+PT  CTCG ++ +  +   E VM FLMGL+DSYA VRAQ+L++ P+P I KVFA
Subjt:  GRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVFA

Query:  LIVQEEHQR------TAGNIPTSSTQENITLLVAEA-SKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGY-KFRNSNLSGSDTT--GPKPAE
        L++QEE QR      +   +  S    N+      A S + S +       R ICSHC  + H VDKCYK+HGYP G+ KF++    GS         +E
Subjt:  LIVQEEHQR------TAGNIPTSSTQENITLLVAEA-SKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGY-KFRNSNLSGSDTT--GPKPAE

Query:  TNAVSQPQSGFFSSLNSSQYTQLMELLNTHIQSAKT-----EPITAASSLTRTAGICS-TLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSV
        T+  +Q Q     SL  SQ  QL+E L++ +Q+ +      +P T  S LT   GICS T H   + + D WI+D+GA+ HIC     F++ R +    V
Subjt:  TNAVSQPQSGFFSSLNSSQYTQLMELLNTHIQSAKT-----EPITAASSLTRTAGICS-TLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSV

Query:  ILPTGF---------------------------------LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDK-LPFTVAESACFVS
        +LP                                    +S L  N+N S+ F   SC+IQD S +  IG  +    LY+L QPD+ LP  +  +  FVS
Subjt:  ILPTGF---------------------------------LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDK-LPFTVAESACFVS

Query:  IS-TWHSRLGHLSAPRLSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKT
         S  WH R+GH S  +LS LK+ L  + +    + CH C L+KQRRL  +S N++++ +FEL+H D WGPF   +  G+++F TIVDDHSRYTW Y++K+
Subjt:  IS-TWHSRLGHLSAPRLSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKT

Query:  KSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRT
        KSD L I P F ++++TQF    K  RSDNAPEL F +FFA  G  H  SCVERPQQNSV ERKHQH+LNVARAL FQ+ IP+  W DCI T+ YLINRT
Subjt:  KSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRT

Query:  PAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQG
        P+P+L +KTPFELL+G    YS+L++FGCLCYAST  +S+++ + +  R         CVFIGYPPG KGY+L ++   ++ ISRDVIF E+TFP+ N  
Subjt:  PAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQG

Query:  TISDDDVCNLFSDHVLPCPIIDPLASSDAMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVSGHK
         +S  D+                              V  +  +TP +P     +                                             
Subjt:  TISDDDVCNLFSDHVLPCPIIDPLASSDAMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVSGHK

Query:  PIANTSGAPVRRSSRAHICPKFLQDYHCGSLN-HTADRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAMNV
                   R+SR H  P  L+DYHC S++   +    +PI   ++Y  LS SHR F+ N+S+I EP+ F QAV L  WR+AMD E+ A+ +
Subjt:  PIANTSGAPVRRSSRAHICPKFLQDYHCGSLN-HTADRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAMNV

A0A392M266 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)4.9e-18839.13Show/hide
Query:  LNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGR--KSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDEL
        LNPF +HHS    +VLV+  LT   NY SWR+AMLIAL  KNK GF+DGS+ +P     +   W  N++++ SW+LNSVSKE+ A+I Y+ +A A+W +L
Subjt:  LNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGR--KSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDEL

Query:  HGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVF
          RF Q+NGP ++QLRK+ +T +QG+LS+ AYY+K+K +W+EL ++RP   C CGG+ P  EH+  E+V+ FL+                          
Subjt:  HGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVF

Query:  ALIVQEEHQRTAGNIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNAVSQPQS
             EE QR   N  +S   E++   V +    +S         +P+C+HCG+ GHV DKCYK+HGYP GY+   S           P   NAV+    
Subjt:  ALIVQEEHQRTAGNIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNAVSQPQS

Query:  GFFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILP-----------T
             LN+ QY QL+  +    Q A++     +++      + ++ H +  P +  WI+DSGA+ HI +  D F +++ +    V LP           T
Subjt:  GFFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILP-----------T

Query:  GFLSC----------------------LLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYIL-LQPDK------LPFTVA-------ESACFVSIS
         +L+                       LL+N++LS+  + +   IQD      IGK++  +GLY L   P+       LPF  +        S    S+S
Subjt:  GFLSC----------------------LLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYIL-LQPDK------LPFTVA-------ESACFVSIS

Query:  TWHSRLGHLSAPRLSLLKSTLQ--FQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTK
         WH+RLGHLS P L  + + +   F  + +    C +CPLAK  RL F SNN+ A   F+L+HCD+WGPFK PTY GY YFLTIVDD +RYTWT+LMK K
Subjt:  TWHSRLGHLSAPRLSLLKSTLQ--FQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTK

Query:  SDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTP
        ++A  +I  FFK + TQF++  K   +DNA ELQ   F  + GT+HQFSC  RPQQNSV ERKHQHLLNVARAL FQ+++P++ WGDCI TA+YLINR P
Subjt:  SDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTP

Query:  APLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQGT
        +  L N +P+ELL+G   DY++LR+FGC+C+AST  A +++ T +         A  CV +GYPPGVKGY+L  I  ++++I+RDV F E  FPFH+   
Subjt:  APLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQGT

Query:  ISDDDVCNLFSDHVLPCPIIDPLASSD----AMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVS
           D + N+FSD VLP P+ +P +  D     +NP   +H        P     +  +  SS V    P                            +V+
Subjt:  ISDDDVCNLFSDHVLPCPIIDPLASSD----AMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVS

Query:  GHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLNHTADRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAM
         H  +        RRS+R    P +LQDY C           YPI+N+LSYD L PS++QFI  VS I+EPSF+HQAV    WR+AM  E+AA+
Subjt:  GHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLNHTADRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAM

A0A438CDX5 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-19039.63Show/hide
Query:  VIDVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI--AWKCNNDIITSWILNSVSKEIAASINYTGSAKA
        V++   +P+ LH+     + LV+  LTGA NY +W +AML+AL+ KNK GF+DG++ +P     I  AW   N +I+SWI+N+VS+EIA S+ Y  SA  
Subjt:  VIDVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI--AWKCNNDIITSWILNSVSKEIAASINYTGSAKA

Query:  VWDELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPD
        +W +L+ RF Q NGP ++Q++K+L   +QGSL + +Y+TKLK +W EL +++P   C CGG++ +T++   E+V+ FLMGL+D YA +R QIL+M+P+P 
Subjt:  VWDELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPD

Query:  ITKVFALIVQEEHQRTAG-NIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNA
        I KVF+L++QEE  RT G +   S   + +T      +   S+   +    R  CS+CG +GH+ DKCYK+ GYP G+KF+N   + S          + 
Subjt:  ITKVFALIVQEEHQRTAG-NIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNA

Query:  VSQPQSGFFSSLNSSQYTQLMELLNTHI---QSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILPTGF
         +       SSL + Q  QL++LL   +    SA TE  +   S++  AG    + +        WI++SGA+ H+C+    F +   V  V V LPTG 
Subjt:  VSQPQSGFFSSLNSSQYTQLMELLNTHI---QSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILPTGF

Query:  LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTVAESACFVS----ISTWHSRLGHLSAPRLSLLKSTLQFQGSHTECSPCH
                 + ID  GS    +D  LL  +        L       K+    +  A  +     +S WHSRLGH S  RL  L+S L F  S  + +PC+
Subjt:  LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTVAESACFVS----ISTWHSRLGHLSAPRLSLLKSTLQFQGSHTECSPCH

Query:  ICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFK
        +CPLAKQR L + S N   S+ F+L+H DIWGPF   +  GYK+FLTIVDDHSR TW Y++K KS+    IP FF  +  QF +  K  RSDNAPEL   
Subjt:  ICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFK

Query:  EFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRG
         F+ S+G IH  SCVE PQQNSV ERKHQH+LNVARAL FQ+ +PI  W DCILTA YLINRTP+P L+NKTPFE+L+    DYS+LR+FGCLCY ST  
Subjt:  EFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRG

Query:  ASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQGTISDDDV-CNLFSDHVLPCPIIDPLASSDAMNPTAGA
                K NRTK    A   VF+GYP G KGY+L DI  + + ISR+VIF E+ FPF      S  D+  +LF D VLPC                  
Subjt:  ASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQGTISDDDV-CNLFSDHVLPCPIIDPLASSDAMNPTAGA

Query:  HVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVSGHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLNHTA-
                      + D++  SSV+P +               ++  PL                      AP  R +R      +L+DYHC  +N  A 
Subjt:  HVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVSGHKPIANTSGAPVRRSSRAHICPKFLQDYHCGSLNHTA-

Query:  ---DRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAMNVLRPGVLFLFRLGSTQWGANGFIK
               +PI+++LSYD LS S++ F L+VS I EP  F +A ++  WR AMD E+ A+   +   +    +GST    NG  K
Subjt:  ---DRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAMNVLRPGVLFLFRLGSTQWGANGFIK

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 83.3e-20856.72Show/hide
Query:  DVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKP-EGRKSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWD
        D QLNP+ +HHS   +  +VTQPLTGA NY SW +AML+A+SG+NK GFI G ++KP +G    AW CNNDI+ SWILNSVSKEIAASI Y GS K +WD
Subjt:  DVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKP-EGRKSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWD

Query:  ELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITK
        EL  RFKQSNGP +YQLRKE VT  QG+L+IE YYTKLKTIWQ L +YR T DCTCGG+KPF +H+ESE++M FLMGL+DSYAAVRAQILLM P+P I  
Subjt:  ELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITK

Query:  VFALIVQEEHQRTAGNIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNAVSQP
        VF+L++QEE QR+AG +  +   + + L +A +S   S DR  R  +RP CS+CG+KGH+ DKCYK HGYP GYK RNSN   S TT P  ++TN V+  
Subjt:  VFALIVQEEHQRTAGNIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNAVSQP

Query:  QSG-------FFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILPTGF
         S        FFSSLNS QY+QLM LLN H+Q+A T PIT A+++T T+GI + L S      D WI+DSGASRHICH +  F+NW     + V+LP G 
Subjt:  QSG-------FFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILPTGF

Query:  ---------------------------------LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYIL-LQPDKLPFTVAESACFVSISTWHS
                                         +SCLL   N+S+DF  + C IQD S  M IGKA   NGLY+L  + +            +S+ TWH 
Subjt:  ---------------------------------LSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYIL-LQPDKLPFTVAESACFVSISTWHS

Query:  RLGHLSAPRLSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHI
        RLGHLS   LS L STL         S CH+CPLAKQ+RLSF SNN+VAS+ F+LVH DIWGPFK P+Y GYKYFLT+VDD  R+TW Y+++ KSD LHI
Subjt:  RLGHLSAPRLSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHI

Query:  IPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFF
        +P+FF+LI TQFS+V K FRSDNAPEL+  EFFA  GT+HQFSCVE+PQQNSV ERKHQHLLNVARALFF
Subjt:  IPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFF

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.3e-3227.78Show/hide
Query:  LKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTVAESACFVSISTWHSRLGHLSAPRLSLLKSTLQFQGS------HTECSPCHIC
        L+   +SI+F  S   I    L++       +N   I  Q   +      +    +   WH R GH+S  +L  +K    F            C  C  C
Subjt:  LKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTVAESACFVSISTWHSRLGHLSAPRLSLLKSTLQFQGS------HTECSPCHIC

Query:  PLAKQRRLSF---SSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPEL--
           KQ RL F       H+   +F +VH D+ GP    T     YF+  VD  + Y  TYL+K KSD   +   F       F+        DN  E   
Subjt:  PLAKQRRLSF---SSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPEL--

Query:  -QFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLL--HNKTPFELLYGHEVDYSNLRIFGCLC
         + ++F    G  +  +    PQ N V+ER  + +   AR +   A++    WG+ +LTA+YLINR P+  L   +KTP+E+ +  +    +LR+FG   
Subjt:  -QFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLL--HNKTPFELLYGHEVDYSNLRIFGCLC

Query:  YASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFE
        Y   +  +K  K D  +           +F+GY P   G++L+D V ++ I++RDV+  E
Subjt:  YASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFE

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein1.2e-1025.23Show/hide
Query:  VSISTWHSRLGHLSAPR-------------LSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHC-DIWGPFKAPTYAGYKYFLTI
        +++   H R+GH    +             L L+K   +F      C  C I    K+   + S NNH          C DI+GP  +      +Y L +
Subjt:  VSISTWHSRLGHLSAPR-------------LSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHC-DIWGPFKAPTYAGYKYFLTI

Query:  VDDHSRY--TWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPEL---QFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQAR
        VD+++RY  T T+  K     L  I +  + + TQF R  +   SD   E    Q +E+F S G  H  +  +    N  AER  + ++  A  L  Q+ 
Subjt:  VDDHSRY--TWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPEL---QFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQAR

Query:  IPIRLWGDCILTASYLIN
        + ++ W   + +A+ + N
Subjt:  IPIRLWGDCILTASYLIN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-5424.93Show/hide
Query:  SSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDELHGRFKQSNGPHVY
        S V        G + + +W++ M   L  +     +D   KKP+  K+  W   ++   S I   +S ++  +I    +A+ +W  L   +      +  
Subjt:  SSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDELHGRFKQSNGPHVY

Query:  QLRKEL--VTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVFALIVQEEHQRT
         L+K+L  +  S+G+ +  ++      +  +L +          G+K      E +  ++ L  L  SY  +   IL      ++  V + ++  E  R 
Subjt:  QLRKEL--VTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVFALIVQEEHQRT

Query:  AGNIPTSSTQENITLLVAEASKKQSND----------RFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGP-------------K
            P +  Q  IT     + ++ SN+          + R  ++   C +C   GH    C      P   K   S     D T               +
Subjt:  AGNIPTSSTQENITLLVAEASKKQSND----------RFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGP-------------K

Query:  PAETNAVSQPQSGFFSSLNSSQY-TQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVIL
          E   +S P+S +     +S + T + +L   ++           +S ++ AGI            DI I  +     +       ++ R V  + + L
Subjt:  PAETNAVSQPQSGFFSSLNSSQY-TQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVIL

Query:  PTGFLSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTVAESACFVSISTWHSRLGHLSAPRLSLL-KSTLQFQGSHTECSPC
         +G     L  +     F+    R+   SL++  G A    G       +     +  +   +S+  WH R+GH+S   L +L K +L      T   PC
Subjt:  PTGFLSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTVAESACFVSISTWHSRLGHLSAPRLSLL-KSTLQFQGSHTECSPC

Query:  HICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPEL--
          C   KQ R+SF +++     + +LV+ D+ GP +  +  G KYF+T +DD SR  W Y++KTK     +  +F  L+  +  R  K  RSDN  E   
Subjt:  HICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPEL--

Query:  -QFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYA
         +F+E+ +S G  H+ +    PQ N VAER ++ ++   R++   A++P   WG+ + TA YLINR+P+  L  + P  +    EV YS+L++FGC  +A
Subjt:  -QFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYA

Query:  STRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFE
              K Q+T   +++ P      C+FIGY     GYRL+D VKK+VI SRDV+F E
Subjt:  STRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-5623.9Show/hide
Query:  ASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGR-----------KSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDELHGRFKQSNGPHVYQ
        ++NY  W + +     G    GF+DGS   P                  WK  + +I S +L ++S  +  +++   +A  +W+ L   +   +  HV Q
Subjt:  ASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGR-----------KSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDELHGRFKQSNGPHVYQ

Query:  LRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVFALIVQEEHQ----R
        LR +L   ++G+ +I+ Y   L T + +L       D               E V   L  L + Y  V  QI   +  P +T++   ++  E +     
Subjt:  LRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVFALIVQEEHQ----R

Query:  TAGNIPTSS---TQENITLLVAEASKKQSN---DRFRRTNQRP--------------------ICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDT
        +A  IP ++   +  N T      +  ++N   +R    N +P                     C  CGV+GH   +C ++  +       NS    S  
Subjt:  TAGNIPTSS---TQENITLLVAEASKKQSN---DRFRRTNQRP--------------------ICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDT

Query:  TGPKPAETNAVSQPQSGFFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRH-ICHRRDSFQNWRQVYGV
        T  +P    A+  P S     L+S     +    N     +  +P T    +    G  ST+  S    + +       + H I +  +  +N   VY  
Subjt:  TGPKPAETNAVSQPQSGFFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRH-ICHRRDSFQNWRQVYGV

Query:  SVILPTGFLSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTV-AESACFVSISTWHSRLGHLSAPRLSLLKSTLQ------F
                   L   N +S++F  +S +++D +  + + + ++ + LY        P ++ A  +   + S+WH+RLGH   P  S+L S +        
Subjt:  SVILPTGFLSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTV-AESACFVSISTWHSRLGHLSAPRLSLLKSTLQ------F

Query:  QGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVF
          SH   S C  C + K  ++ FS +   ++   E ++ D+W      ++  Y+Y++  VD  +RYTW Y +K KS        F  L+  +F      F
Subjt:  QGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVF

Query:  RSDNAPE-LQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLR
         SDN  E +   E+F+  G  H  S    P+ N ++ERKH+H++     L   A IP   W      A YLINR P PLL  ++PF+ L+G   +Y  LR
Subjt:  RSDNAPE-LQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLR

Query:  IFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQ-GTISD------DDVCNLFSDH----
        +FGC CY   R  ++++  DK  +         CVF+GY      Y    +   ++ ISR V F E+ FPF N   T+S       +  C ++S H    
Subjt:  IFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQ-GTISD------DDVCNLFSDH----

Query:  ----VLPCP-IIDP-LASSDAMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVSGHKPIANTSGA
            VLP P   DP  A++   +P+A     Q  S   D  +S   +  SS  PT    +  +P      + T     +    +         +A +   
Subjt:  ----VLPCP-IIDP-LASSDAMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVSGHKPIANTSGA

Query:  PVRRSSRAHICPKFLQDYHCGSLNHTAD----RPVYPIENYLSYDNLSPSHR------------------QFILNVSAIHEPSFFHQAVKLEPWRRAMDA
        P + SS +   P         S + T       P  P+   ++ +N +P +                      ++++A  EP    QA+K E WR AM +
Subjt:  PVRRSSRAHICPKFLQDYHCGSLNHTAD----RPVYPIENYLSYDNLSPSHR------------------QFILNVSAIHEPSFFHQAVKLEPWRRAMDA

Query:  EIAA
        EI A
Subjt:  EIAA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-4923.58Show/hide
Query:  ASNYGSWRKAMLIALSGKNKEGFIDGSLKKPE---GRKSI--------AWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDELHGRFKQSNGPHVYQ
        ++NY  W + +     G    GF+DGS   P    G  ++         W+  + +I S IL ++S  +  +++   +A  +W+ L   +   +  HV Q
Subjt:  ASNYGSWRKAMLIALSGKNKEGFIDGSLKKPE---GRKSI--------AWKCNNDIITSWILNSVSKEIAASINYTGSAKAVWDELHGRFKQSNGPHVYQ

Query:  LRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVFALIVQEEHQRTAGN
        LR               + T+   +               G  KP     + E V   L  L D Y  V  QI   +  P +T++   ++  E +  A N
Subjt:  LRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVFALIVQEEHQRTAGN

Query:  ----IP---------TSSTQENITLLVAEASKKQSNDR----------FRRTNQRP-----ICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTG
            +P          ++T  N        +   +N+R           R  N++P      C  C V+GH   +C ++H +       N   S S  T 
Subjt:  ----IP---------TSSTQENITLLVAEASKKQSNDR----------FRRTNQRP-----ICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTG

Query:  PKPAETNAVSQPQSGFFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVI
         +P    AV+ P +     L+S     +    N     +  +P T    +    G            S I I  +G++      R    N + +Y  ++ 
Subjt:  PKPAETNAVSQPQSGFFSSLNSSQYTQLMELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVI

Query:  LPTGFLSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTVAESACF-VSISTWHSRLGHLSAPRLSLLKSTLQ------FQGS
             +  L   N +S++F  +S +++D +  + + + ++ + LY          ++  S C   + S+WHSRLGH   P L++L S +          S
Subjt:  LPTGFLSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAESDNGLYILLQPDKLPFTVAESACF-VSISTWHSRLGHLSAPRLSLLKSTLQ------FQGS

Query:  HTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSD
        H   S C  C + K  ++ FS++   +S   E ++ D+W      +   Y+Y++  VD  +RYTW Y +K KS        F  L+  +F        SD
Subjt:  HTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTIVDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSD

Query:  NAPE-LQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLRIFG
        N  E +  +++ +  G  H  S    P+ N ++ERKH+H++ +   L   A +P   W      A YLINR P PLL  ++PF+ L+G   +Y  L++FG
Subjt:  NAPE-LQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASYLINRTPAPLLHNKTPFELLYGHEVDYSNLRIFG

Query:  CLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFH--NQGTIS-----DDDVCNLFSDHVLP-CPI
        C CY   R  ++++  DK  +         C F+GY      Y    I   ++  SR V F E  FPF   N G  +      D   N  S   LP  P+
Subjt:  CLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFH--NQGTIS-----DDDVCNLFSDHVLP-CPI

Query:  IDPLASSDAMNPTAGAHVDQAP----SLTPDVPTSI-DHNLESSVV-------PTMEPDSSFEPDA---VVEDSLTNVPLVETVELD-------------
        + P        P  G H+D +P    S +P   T +   NL SS +       PT    +  +P A     ++S +N P++     +             
Subjt:  IDPLASSDAMNPTAGAHVDQAP----SLTPDVPTSI-DHNLESSVV-------PTMEPDSSFEPDA---VVEDSLTNVPLVETVELD-------------

Query:  --VLVVSGHKPIANTS----GAPVRRSSRAHICPKFLQDYHCGSLNHTADRPVYPIENYLSYDNLSPSHR-QFILNVSAIHEPSFFHQAVKLEPWRRAMD
            + S H P  +TS     +P   S+     P  L       +N  A    + +          P+ +  +  +++A  EP    QA+K + WR+AM 
Subjt:  --VLVVSGHKPIANTS----GAPVRRSSRAHICPKFLQDYHCGSLNHTADRPVYPIENYLSYDNLSPSHR-QFILNVSAIHEPSFFHQAVKLEPWRRAMD

Query:  AEIAA
        +EI A
Subjt:  AEIAA

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.1e-2531.86Show/hide
Query:  TSSDDSPVIDVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI--AWKCNNDIITSWILNSVSKEIAASIN
        TS  DSP       P  +HH    S+  +++      NY +W+      L    K GFIDG+L KP+    +   W+  N ++  W++NS++ ++  S+ 
Subjt:  TSSDDSPVIDVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSI--AWKCNNDIITSWILNSVSKEIAASIN

Query:  YTGSAKAVWDELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGG-----IKPFTEHMESEFVMIFLMG--LSDSYA
        Y  +A  +W++L   F       +YQLR+ L T  QG  S+E Y+ KL  +W EL +Y P  +C CGG      K   E  E E    FLMG  L+  + 
Subjt:  YTGSAKAVWDELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGG-----IKPFTEHMESEFVMIFLMG--LSDSYA

Query:  AVRAQILLMNPIPDITKVFALIVQEE
        AV  +I+   P P + + FA++   E
Subjt:  AVRAQILLMNPIPDITKVFALIVQEE

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.5e-0836.59Show/hide
Query:  VRRSSRAHICPKFLQDYHCGSLNHTADRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAM
        V  S R    P +LQDY+C S+   A   ++ I  +LSY+ +SP +  F++ ++   EPS +++A +   W  AMD EI AM
Subjt:  VRRSSRAHICPKFLQDYHCGSLNHTADRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGACAAGGAGACTTCAAGCGATGATTCACCAGTGATCGATGTGCAACTCAACCCCTTCCATCTCCACCACTCATATAGTTCTTCAGTCGTCCTTGTCACACAACC
TTTAACCGGTGCAAGCAACTATGGATCGTGGAGGAAGGCAATGTTGATTGCACTCTCAGGCAAAAACAAAGAAGGTTTCATTGATGGATCACTCAAGAAACCAGAAGGAA
GGAAGTCAATTGCCTGGAAGTGTAATAACGACATAATCACGTCCTGGATCCTCAATTCCGTATCCAAAGAGATTGCTGCTAGTATAAACTACACCGGATCTGCCAAGGCT
GTCTGGGATGAACTTCATGGAAGATTCAAGCAAAGCAATGGGCCACACGTTTATCAACTTCGGAAAGAGCTTGTTACTTCAAGTCAAGGAAGCTTGTCGATCGAAGCCTA
CTACACCAAGCTGAAAACGATTTGGCAAGAGCTTTGCGACTACCGTCCGACCTGCGATTGCACATGCGGAGGAATAAAGCCTTTCACGGAGCATATGGAGTCTGAATTCG
TCATGATTTTCCTCATGGGACTTAGCGACTCATATGCTGCTGTTCGCGCCCAAATATTGCTGATGAATCCTATTCCTGATATCACCAAGGTCTTTGCCCTAATCGTGCAA
GAAGAGCACCAACGAACAGCTGGTAATATACCGACATCGAGCACGCAAGAAAATATCACTCTTCTTGTTGCTGAAGCTTCCAAGAAGCAGAGCAATGATCGGTTTCGAAG
GACAAATCAGCGTCCTATCTGCTCTCATTGTGGAGTTAAAGGACACGTCGTGGACAAGTGTTACAAGATCCACGGATATCCTCTGGGATACAAGTTTCGGAACTCAAATT
TGTCAGGCAGTGACACCACAGGGCCTAAACCTGCTGAAACAAACGCAGTCTCTCAGCCCCAATCTGGCTTCTTCTCCAGCTTGAATTCAAGTCAGTACACGCAGTTGATG
GAGCTTCTTAATACACACATTCAATCGGCCAAAACTGAACCAATAACTGCAGCGTCTTCACTCACTCGTACTGCAGGTATTTGTTCGACATTACATTCATCTCGTATGCC
TAAGTCAGATATTTGGATCTTAGATTCTGGAGCATCTAGGCATATTTGTCATCGTCGTGATTCATTTCAGAATTGGCGTCAAGTGTATGGAGTTAGCGTTATTCTACCAA
CTGGTTTTCTGAGTTGTTTACTTAAGAACAACAATCTTTCAATTGATTTCTCTGGTTCATCTTGCCGGATACAGGACAAGTCCCTATTGATGACGATTGGCAAGGCTGAG
TCAGATAATGGCCTTTATATCCTGCTGCAACCAGATAAGCTGCCCTTTACTGTTGCTGAATCTGCTTGTTTTGTTAGTATATCTACATGGCATAGTCGCCTTGGCCATTT
GTCTGCACCTAGATTGTCTTTGCTTAAGAGTACTTTACAATTTCAAGGTTCACATACTGAATGTTCCCCTTGTCATATATGCCCATTGGCCAAGCAGCGTAGATTGTCAT
TTTCCTCTAATAATCATGTTGCTTCTACTATGTTTGAACTTGTACATTGTGACATTTGGGGCCCCTTTAAGGCCCCAACATATGCTGGTTATAAGTATTTTTTGACCATT
GTTGACGACCACTCCCGTTACACTTGGACCTACTTGATGAAAACTAAGTCCGATGCATTACATATAATTCCTCGGTTCTTCAAGTTAATCAATACTCAGTTCTCAAGGGT
CACCAAAGTTTTCCGGTCTGATAACGCCCCAGAACTGCAGTTCAAAGAGTTTTTTGCATCCATAGGAACAATCCACCAATTTTCTTGTGTTGAGAGACCTCAACAAAATT
CGGTGGCTGAAAGAAAGCACCAACACCTGTTAAACGTGGCTCGTGCTCTTTTCTTTCAAGCTCGCATCCCCATTCGTTTATGGGGTGATTGCATTCTAACTGCATCATAT
CTCATAAATAGAACGCCTGCACCTTTGCTTCATAATAAGACGCCCTTTGAATTGTTATATGGTCATGAGGTTGATTACTCTAATCTCAGAATTTTTGGGTGCTTATGCTA
TGCGTCTACTAGGGGTGCAAGCAAAAACCAAAAAACCGACAAACCGAACCGAACCAAACCGATCCAATGGGCCTCTCTGTGTGTTTTCATTGGATATCCACCGGGCGTCA
AAGGTTATCGCCTTTATGACATTGTTAAGAAACAGGTCATCATTTCCAGGGATGTGATCTTCTTTGAGGACACTTTTCCTTTTCATAATCAAGGGACAATATCAGATGAC
GATGTTTGCAACCTATTCTCTGATCATGTTCTCCCTTGCCCTATTATTGATCCTCTAGCCTCGTCTGATGCCATGAATCCAACTGCTGGTGCTCACGTGGATCAGGCTCC
CAGCCTCACTCCTGACGTCCCGACTTCTATTGATCACAACCTTGAATCCTCTGTGGTTCCTACCATGGAACCCGACAGCTCTTTTGAGCCTGATGCGGTTGTTGAGGACT
CTCTCACAAATGTTCCTTTGGTTGAAACTGTTGAGCTTGACGTTCTTGTTGTTTCAGGGCATAAGCCTATCGCAAATACCTCAGGTGCTCCCGTCAGGCGCTCTTCCAGG
GCACATATTTGCCCAAAATTTTTGCAAGACTACCATTGTGGTTCCTTAAATCACACAGCTGACAGACCCGTCTATCCGATTGAGAATTATTTGTCCTATGACAATTTAAG
CCCTAGTCACAGGCAATTCATTCTTAATGTCTCCGCGATACATGAGCCTTCCTTTTTCCATCAGGCTGTGAAATTAGAACCTTGGCGCAGAGCAATGGATGCTGAAATTG
CAGCAATGAACGTACTTCGACCTGGAGTCTTGTTCCTCTTCCGCCTGGGAAGCACACAGTGGGGTGCAAATGGATTTATAAAGTCAAATATAATGCTGATGGTTCTATTG
ACCGATACAAAGCGCGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGACAAGGAGACTTCAAGCGATGATTCACCAGTGATCGATGTGCAACTCAACCCCTTCCATCTCCACCACTCATATAGTTCTTCAGTCGTCCTTGTCACACAACC
TTTAACCGGTGCAAGCAACTATGGATCGTGGAGGAAGGCAATGTTGATTGCACTCTCAGGCAAAAACAAAGAAGGTTTCATTGATGGATCACTCAAGAAACCAGAAGGAA
GGAAGTCAATTGCCTGGAAGTGTAATAACGACATAATCACGTCCTGGATCCTCAATTCCGTATCCAAAGAGATTGCTGCTAGTATAAACTACACCGGATCTGCCAAGGCT
GTCTGGGATGAACTTCATGGAAGATTCAAGCAAAGCAATGGGCCACACGTTTATCAACTTCGGAAAGAGCTTGTTACTTCAAGTCAAGGAAGCTTGTCGATCGAAGCCTA
CTACACCAAGCTGAAAACGATTTGGCAAGAGCTTTGCGACTACCGTCCGACCTGCGATTGCACATGCGGAGGAATAAAGCCTTTCACGGAGCATATGGAGTCTGAATTCG
TCATGATTTTCCTCATGGGACTTAGCGACTCATATGCTGCTGTTCGCGCCCAAATATTGCTGATGAATCCTATTCCTGATATCACCAAGGTCTTTGCCCTAATCGTGCAA
GAAGAGCACCAACGAACAGCTGGTAATATACCGACATCGAGCACGCAAGAAAATATCACTCTTCTTGTTGCTGAAGCTTCCAAGAAGCAGAGCAATGATCGGTTTCGAAG
GACAAATCAGCGTCCTATCTGCTCTCATTGTGGAGTTAAAGGACACGTCGTGGACAAGTGTTACAAGATCCACGGATATCCTCTGGGATACAAGTTTCGGAACTCAAATT
TGTCAGGCAGTGACACCACAGGGCCTAAACCTGCTGAAACAAACGCAGTCTCTCAGCCCCAATCTGGCTTCTTCTCCAGCTTGAATTCAAGTCAGTACACGCAGTTGATG
GAGCTTCTTAATACACACATTCAATCGGCCAAAACTGAACCAATAACTGCAGCGTCTTCACTCACTCGTACTGCAGGTATTTGTTCGACATTACATTCATCTCGTATGCC
TAAGTCAGATATTTGGATCTTAGATTCTGGAGCATCTAGGCATATTTGTCATCGTCGTGATTCATTTCAGAATTGGCGTCAAGTGTATGGAGTTAGCGTTATTCTACCAA
CTGGTTTTCTGAGTTGTTTACTTAAGAACAACAATCTTTCAATTGATTTCTCTGGTTCATCTTGCCGGATACAGGACAAGTCCCTATTGATGACGATTGGCAAGGCTGAG
TCAGATAATGGCCTTTATATCCTGCTGCAACCAGATAAGCTGCCCTTTACTGTTGCTGAATCTGCTTGTTTTGTTAGTATATCTACATGGCATAGTCGCCTTGGCCATTT
GTCTGCACCTAGATTGTCTTTGCTTAAGAGTACTTTACAATTTCAAGGTTCACATACTGAATGTTCCCCTTGTCATATATGCCCATTGGCCAAGCAGCGTAGATTGTCAT
TTTCCTCTAATAATCATGTTGCTTCTACTATGTTTGAACTTGTACATTGTGACATTTGGGGCCCCTTTAAGGCCCCAACATATGCTGGTTATAAGTATTTTTTGACCATT
GTTGACGACCACTCCCGTTACACTTGGACCTACTTGATGAAAACTAAGTCCGATGCATTACATATAATTCCTCGGTTCTTCAAGTTAATCAATACTCAGTTCTCAAGGGT
CACCAAAGTTTTCCGGTCTGATAACGCCCCAGAACTGCAGTTCAAAGAGTTTTTTGCATCCATAGGAACAATCCACCAATTTTCTTGTGTTGAGAGACCTCAACAAAATT
CGGTGGCTGAAAGAAAGCACCAACACCTGTTAAACGTGGCTCGTGCTCTTTTCTTTCAAGCTCGCATCCCCATTCGTTTATGGGGTGATTGCATTCTAACTGCATCATAT
CTCATAAATAGAACGCCTGCACCTTTGCTTCATAATAAGACGCCCTTTGAATTGTTATATGGTCATGAGGTTGATTACTCTAATCTCAGAATTTTTGGGTGCTTATGCTA
TGCGTCTACTAGGGGTGCAAGCAAAAACCAAAAAACCGACAAACCGAACCGAACCAAACCGATCCAATGGGCCTCTCTGTGTGTTTTCATTGGATATCCACCGGGCGTCA
AAGGTTATCGCCTTTATGACATTGTTAAGAAACAGGTCATCATTTCCAGGGATGTGATCTTCTTTGAGGACACTTTTCCTTTTCATAATCAAGGGACAATATCAGATGAC
GATGTTTGCAACCTATTCTCTGATCATGTTCTCCCTTGCCCTATTATTGATCCTCTAGCCTCGTCTGATGCCATGAATCCAACTGCTGGTGCTCACGTGGATCAGGCTCC
CAGCCTCACTCCTGACGTCCCGACTTCTATTGATCACAACCTTGAATCCTCTGTGGTTCCTACCATGGAACCCGACAGCTCTTTTGAGCCTGATGCGGTTGTTGAGGACT
CTCTCACAAATGTTCCTTTGGTTGAAACTGTTGAGCTTGACGTTCTTGTTGTTTCAGGGCATAAGCCTATCGCAAATACCTCAGGTGCTCCCGTCAGGCGCTCTTCCAGG
GCACATATTTGCCCAAAATTTTTGCAAGACTACCATTGTGGTTCCTTAAATCACACAGCTGACAGACCCGTCTATCCGATTGAGAATTATTTGTCCTATGACAATTTAAG
CCCTAGTCACAGGCAATTCATTCTTAATGTCTCCGCGATACATGAGCCTTCCTTTTTCCATCAGGCTGTGAAATTAGAACCTTGGCGCAGAGCAATGGATGCTGAAATTG
CAGCAATGAACGTACTTCGACCTGGAGTCTTGTTCCTCTTCCGCCTGGGAAGCACACAGTGGGGTGCAAATGGATTTATAAAGTCAAATATAATGCTGATGGTTCTATTG
ACCGATACAAAGCGCGTCTAG
Protein sequenceShow/hide protein sequence
MADKETSSDDSPVIDVQLNPFHLHHSYSSSVVLVTQPLTGASNYGSWRKAMLIALSGKNKEGFIDGSLKKPEGRKSIAWKCNNDIITSWILNSVSKEIAASINYTGSAKA
VWDELHGRFKQSNGPHVYQLRKELVTSSQGSLSIEAYYTKLKTIWQELCDYRPTCDCTCGGIKPFTEHMESEFVMIFLMGLSDSYAAVRAQILLMNPIPDITKVFALIVQ
EEHQRTAGNIPTSSTQENITLLVAEASKKQSNDRFRRTNQRPICSHCGVKGHVVDKCYKIHGYPLGYKFRNSNLSGSDTTGPKPAETNAVSQPQSGFFSSLNSSQYTQLM
ELLNTHIQSAKTEPITAASSLTRTAGICSTLHSSRMPKSDIWILDSGASRHICHRRDSFQNWRQVYGVSVILPTGFLSCLLKNNNLSIDFSGSSCRIQDKSLLMTIGKAE
SDNGLYILLQPDKLPFTVAESACFVSISTWHSRLGHLSAPRLSLLKSTLQFQGSHTECSPCHICPLAKQRRLSFSSNNHVASTMFELVHCDIWGPFKAPTYAGYKYFLTI
VDDHSRYTWTYLMKTKSDALHIIPRFFKLINTQFSRVTKVFRSDNAPELQFKEFFASIGTIHQFSCVERPQQNSVAERKHQHLLNVARALFFQARIPIRLWGDCILTASY
LINRTPAPLLHNKTPFELLYGHEVDYSNLRIFGCLCYASTRGASKNQKTDKPNRTKPIQWASLCVFIGYPPGVKGYRLYDIVKKQVIISRDVIFFEDTFPFHNQGTISDD
DVCNLFSDHVLPCPIIDPLASSDAMNPTAGAHVDQAPSLTPDVPTSIDHNLESSVVPTMEPDSSFEPDAVVEDSLTNVPLVETVELDVLVVSGHKPIANTSGAPVRRSSR
AHICPKFLQDYHCGSLNHTADRPVYPIENYLSYDNLSPSHRQFILNVSAIHEPSFFHQAVKLEPWRRAMDAEIAAMNVLRPGVLFLFRLGSTQWGANGFIKSNIMLMVLL
TDTKRV