; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G22780 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G22780
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag-pol polyprotein
Genome locationChr1:18356189..18359686
RNA-Seq ExpressionCSPI01G22780
SyntenyCSPI01G22780
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAP12935.1 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]2.2e-16233.82Show/hide
Query:  LYPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR-----------
        LYP LT  NYT W I+ +A+++ QGV EAIEPA GA V+ +KDK   +++LQ + ED+L+Q+AKK+TAKE+WD LK R+ G+++V+ AR           
Subjt:  LYPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR-----------

Query:  ---ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL-
             E++D++AG+I+ +  + + L   L D+++VKKL D V +K++S++  IEQF ++D MPFEE    + + DE              DG+ ++    
Subjt:  ---ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL-

Query:  -SVKIVRGTSNTENDT--------------------------------------RDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLTCVFEEEPALM--
           +  +G   T +                                        RDKSHIKCF   + GHY ++C   K +  EAHL    +  PAL+  
Subjt:  -SVKIVRGTSNTENDT--------------------------------------RDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLTCVFEEEPALM--

Query:  ----------------MVVSQEGTHTR-------------------------------------------------------------CDQEDVILLSEE
                        +VV +E    R                                                             C   D  LL + 
Subjt:  ----------------MVVSQEGTHTR-------------------------------------------------------------CDQEDVILLSEE

Query:  RLLPEIYAN-------------------------------------------------------------------------------------------
          +P +  N                                                                                           
Subjt:  RLLPEIYAN-------------------------------------------------------------------------------------------

Query:  RFHH---------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTL
        + HH                                                                      K  A E F KFK L EN   + I+TL
Subjt:  RFHH---------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTL

Query:  RTDRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAY
        RTD                                   E RN +VMA+ARSLLK M VP + W EAVRHAV+LLNRLPTK +GE TPFE W G+KPHL +
Subjt:  RTDRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAY

Query:  LSTWLCGIGRGKLQISEMLFFKRILNGLVTKSLVTVRRLRSFSPSPPSMNT-----------------PVHLRSLTDIYANRDEVVGGDEQENE------
        L  + C                     + T +L  +      +P+P S +T                 P+  RSL DI      V   DE++ E      
Subjt:  LSTWLCGIGRGKLQISEMLFFKRILNGLVTKSLVTVRRLRSFSPSPPSMNT-----------------PVHLRSLTDIYANRDEVVGGDEQENE------

Query:  --------------------NELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALA
                             EL++IEKN+TWSL  LP GHK+I                                     EVFAPVARLDTVR ILA+A
Subjt:  --------------------NELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALA

Query:  ANQSWEVHHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDL
        A++ W VHHLDVKSAFLNGELEEEVYV Q EGF    ++H + KLSKALYGLRQAPRAWNI LD+SLKEL F +C QEQAVYTR    E + V VYV+DL
Subjt:  ANQSWEVHHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDL

Query:  IVTGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP----------KILTEHKPNLSYAVGMA
        IVTG +  +V+ FK+QMM EFEM+DLGLLSYYLGIEV Q         P   K  L K      +A  +P++           + L   +P+LSYAVG+A
Subjt:  IVTGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP----------KILTEHKPNLSYAVGMA

Query:  SRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQA
        SR+ME P   H K VKQILRYL+GTI  G  YT G  E  I GY+DS LAGD+D R+ T GM FY+N SLV+W+SQKQKTV +SSCE EF+AAT A C A
Subjt:  SRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQA

Query:  LWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVF------------------------------HEQRANVMTKALTGVKLVAMHELLGVRNL
        LWLR L+ E++  E + V LFVDNKSAIALMKN VF                               EQRA+ +TK L   KLV    LL VR+L
Subjt:  LWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVF------------------------------HEQRANVMTKALTGVKLVAMHELLGVRNL

ABF94034.1 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]1.2e-17633.02Show/hide
Query:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------
        YP LT  NYT W I+ +A+++ QGV +AIEPA G  V+ ++DK A +++LQ   ED+L+Q+AKK++AKE+WD LK R+ G++RV+ AR            
Subjt:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------

Query:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVALSV
            ET+D++AG+I+ +  + + L   L DS++VKKL D V +K++S++A IEQF D+DTMPFEE    + + +E              DG+ ++     
Subjt:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVALSV

Query:  K-------------------------------------------IVRGTS--NTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLTCVFEEEPA
        +                                             RG S   +    RDKSHIKCF C + GHY ++C   K +  EAHL    +  PA
Subjt:  K-------------------------------------------IVRGTS--NTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLTCVFEEEPA

Query:  LMMVVSQE-------------------------GTHT--------------------------------------------------RCDQEDVILLSEE
        L++ V+++                         G H                                                    C  +D  LL + 
Subjt:  LMMVVSQE-------------------------GTHT--------------------------------------------------RCDQEDVILLSEE

Query:  RLLPEIYANRFH----------------------------------------------------------------------------------------
          +P +  N                                                                                           
Subjt:  RLLPEIYANRFH----------------------------------------------------------------------------------------

Query:  ------------------------------------------------HAKSDAFESFKKFKLLMENKTKYKIRTLRTDRG-------------------
                                                          K  + ++F KFK L EN    +++TLR+DRG                   
Subjt:  ------------------------------------------------HAKSDAFESFKKFKLLMENKTKYKIRTLRTDRG-------------------

Query:  --------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLSTWLC------------------
                      ERRN TVMA+ARSLLK M VP +FW EAVRHAV+LLNRLPTK +G+RTPFE W GRKP L +L  + C                  
Subjt:  --------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLSTWLC------------------

Query:  ------GI------------GRGKLQISEMLFFKR-------------------------------ILNGLV-------------TKSLVTVRRLRS---
              G+             RG++ +S  + F+                                   G+V              +++V    + S   
Subjt:  ------GI------------GRGKLQISEMLFFKR-------------------------------ILNGLV-------------TKSLVTVRRLRS---

Query:  -----------FSPSPPSMNT------------------PVHLRSLTDIY--ANRDEVVGGD-------------------------EQENENELKSIEK
                    +P  PS N+                  P   RSL D+   A R ++V  +                         E+  + E+++IEK
Subjt:  -----------FSPSPPSMNT------------------PVHLRSLTDIY--ANRDEVVGGD-------------------------EQENENELKSIEK

Query:  NNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHLDVKSAFLNGELEEEVYVT
        N TW L  LP GH++I                                     EVFAPVARLDTVRV+LA+AA++ W+VHHLDVKSAFLNGELEEEVYV 
Subjt:  NNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHLDVKSAFLNGELEEEVYVT

Query:  QPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKVKKFKQQMMAEFEMSDLGL
        QPEGF    ++H + KL KALYGLRQAPRAWNIRLDRSL+EL F +CTQEQAVYTR    + + V VYV+DLIVTG +  ++K FK+QMM EFEMSDLGL
Subjt:  QPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKVKKFKQQMMAEFEMSDLGL

Query:  LSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP-------------------------KILTEHKPNLSYAVGMASRYMERPITMHYK
        L+YYLGIEV+Q +S   LKQ  YAK++LS+FGM  CN+   P++P                         + L   +P+LSYAVG+ASR+MERP  MH+K
Subjt:  LSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP-------------------------KILTEHKPNLSYAVGMASRYMERPITMHYK

Query:  VVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALWLRCLVNEIVRM
         VKQILRY++GT+ +G  Y  G     I GY+DS LAGDLD R+ T GM FY+N+SLV+W+SQKQKTVA+SSCE EF+AATTA CQALWLR L+ E+  +
Subjt:  VVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALWLRCLVNEIVRM

Query:  EPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNLESCQD
        E + V LFVDN+SAIALMKN VFH                              EQRA+ +TK L   KLV    LLGVR+L S Q+
Subjt:  EPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNLESCQD

ABF96448.1 retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group]4.7e-16534.11Show/hide
Query:  LYPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR-----------
        LYP LT  NYT W I+ +A+++ QGV EAIEPA GA V+ +KDK   +++LQ + ED+L+Q+AKK+TAKE+WD LK R+ G+++V+ AR           
Subjt:  LYPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR-----------

Query:  ---ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL-
             E++D++AG+I+ +  + + L   L D+++VKKL D V +K++S++  IEQF ++D MPFEE    + + DE              DG+ ++    
Subjt:  ---ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL-

Query:  -SVKIVRGTSNTENDT--------------------------------------RDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLTCVFEEEPALM--
           +  +G   T +                                        RDKSHIKCF   + GHY ++C   K +  EAHL    +  PAL+  
Subjt:  -SVKIVRGTSNTENDT--------------------------------------RDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLTCVFEEEPALM--

Query:  ----------------MVVSQEGTHTR-------------------------------------------------------------CDQEDVILLSEE
                        +VV +E    R                                                             C   D  LL + 
Subjt:  ----------------MVVSQEGTHTR-------------------------------------------------------------CDQEDVILLSEE

Query:  RLLPEIYAN-------------------------------------------------------------------------------------------
          +P +  N                                                                                           
Subjt:  RLLPEIYAN-------------------------------------------------------------------------------------------

Query:  RFHH---------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTL
        + HH                                                                      K  A E F KFK L EN   + I+TL
Subjt:  RFHH---------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTL

Query:  RTDRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAY
        RTD                                   E RN +VMA+ARSLLK M VP + W EAVRHAV+LLNRLPTK +GE TPFE W G+KPHL +
Subjt:  RTDRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAY

Query:  LSTWLCGI-------------GRGKLQISEMLFFKRI--LNGLVTKSLVTVRRLRSFSPSPPSMNTPVHLRSLTDIYANRDEVVGGDEQENE--------
        L  + C                R +   +E +  + +      + +   TV    S + S    + P+  RSL DI      V   DE++ E        
Subjt:  LSTWLCGI-------------GRGKLQISEMLFFKRI--LNGLVTKSLVTVRRLRSFSPSPPSMNTPVHLRSLTDIYANRDEVVGGDEQENE--------

Query:  ------------------NELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAAN
                           EL++IEKN+TWSL  LP GHK+I                                     EVFAPVARLDTVR ILA+AA+
Subjt:  ------------------NELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAAN

Query:  QSWEVHHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIV
        + W VHHLDVKSAFLNGELEEEVYV Q EGF    ++H + KLSKALYGLRQAPRAWNI LD+SLKEL F +C QEQAVYTR    E + V VYV+DLIV
Subjt:  QSWEVHHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIV

Query:  TGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP----------KILTEHKPNLSYAVGMASR
        TG +  +V+ FK+QMM EFEM+DLGLLSYYLGIEV Q  + I LKQ  YAK+          +A  +P++           + L   +P+LSYAVG+ASR
Subjt:  TGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP----------KILTEHKPNLSYAVGMASR

Query:  YMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALW
        +ME P   H K VKQILRYL+GTI  G  YT G  E  I GY+DS LAGD+D R+ T GM FY+N SLV+W+SQKQKTV +SSCE EF+AAT A C ALW
Subjt:  YMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALW

Query:  LRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVF------------------------------HEQRANVMTKALTGVKLVAMHELLGVRNL
        LR L+ E++  E + V LFVDNKSAIALMKN VF                               EQRA+ +TK L   KLV    LL VR+L
Subjt:  LRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVF------------------------------HEQRANVMTKALTGVKLVAMHELLGVRNL

EEC84282.1 hypothetical protein OsI_30754 [Oryza sativa Indica Group]1.6e-17633.67Show/hide
Query:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------
        YP LT  NYT W I+ +A+++ QGV +AIEPA G  V+ ++DK A +++LQ + ED+L+Q+AKK++AKE+WD LK R+ G++RV+ AR            
Subjt:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------

Query:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL--
            ET+D++AG+I+ +  + + L   L DS++VKKL D V +K++S++A IEQF ++D MPFEE    + + +E              DG+ ++     
Subjt:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL--

Query:  --------------------------------------------SVKIVRGTSNTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLT-------
                                                    S     G   +    RDKSHIKC+ C + GHY ++C   K +  EAHL        
Subjt:  --------------------------------------------SVKIVRGTSNTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLT-------

Query:  ----CVFEEEPALMMVVSQE-----------GTHT--------------------------------------------------RCDQEDVILLSEERL
             V E+EPA  +VV +E           G  T                                                   C   D  LL +   
Subjt:  ----CVFEEEPALMMVVSQE-----------GTHT--------------------------------------------------RCDQEDVILLSEERL

Query:  LPEIYAN--------------------------------------------------------------------RFHH---------------------
        +P +  N                                                                    R  H                     
Subjt:  LPEIYAN--------------------------------------------------------------------RFHH---------------------

Query:  -----------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTLRT
                                                                                K  A E+F KFK L EN    +I+TLR+
Subjt:  -----------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTLRT

Query:  DRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLS
        DRG                                 ERRN +VMA+ARSL+K M VP +FW EAVRHAVYLLNRLPTK +G+RTPFE W GRKP L +L 
Subjt:  DRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLS

Query:  TWLC------------------------GIGR------------GKLQISEMLFFKR-------ILNGLVTKSLVTVR-----------------RLRSF
         + C                        G+              G++ +S  + F+        ++ G    +  TV                  R R+ 
Subjt:  TWLC------------------------GIGR------------GKLQISEMLFFKR-------ILNGLVTKSLVTVR-----------------RLRSF

Query:  SPSPP------------------------------------------------SMNTPVHLRSLTDIY-------------------------ANRDEVV
        SP+ P                                                S   PV  RSL DI                          ++  E  
Subjt:  SPSPP------------------------------------------------SMNTPVHLRSLTDIY-------------------------ANRDEVV

Query:  GGDEQEN--ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL
        G    EN    EL++IEKN+TW+LT LP GHK I                                     EVFAPVARLDTVRVILA+AA++ WEVHHL
Subjt:  GGDEQEN--ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL

Query:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKV
        DVKSAFLNG+LEEEVYV QPEGF    E+H + +LSKALYGLRQAPRAWN RLD+ LKEL F +CTQEQAVYTR + +  V V VYV+DLIVTG + +++
Subjt:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKV

Query:  KKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKIL------------TEH-------------KPNLSY
          FKQQMM EFEMSDLGLLSYYLGIEV Q ++ I +KQ  YAK+ILS+FGM  CN T  PMEP+ L            TE+             +P+LSY
Subjt:  KKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKIL------------TEH-------------KPNLSY

Query:  AVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATT
        AVG+ASR+MERP TMH K VK ILRYL+GT+  G  +  G    +I G++DS LAGD+D R+ T GM FY+N SLVSW SQKQKTVA+SSCE EF+AAT 
Subjt:  AVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATT

Query:  ATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNL
        A C ALWLR L++E++  E +PV LFVDNKSAIALMKN VFH                              EQRA+ MTK L   KL     LLGVR+L
Subjt:  ATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNL

Query:  ESCQD
           QD
Subjt:  ESCQD

KAB8107251.1 hypothetical protein EE612_041900 [Oryza sativa]5.0e-17533.67Show/hide
Query:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------
        YP LT  NYT W I+ +A+++ QGV +AIEPA G  V+ ++DK + +++LQ + ED+L+Q+AKK++AKE+WD LK R+ G++RV+ AR            
Subjt:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------

Query:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL--
            ET+D++AG+I+ +  + + L   L DS++VKKL D V +K++S++A IEQF ++D MPFEE    + + +E              DG+ ++     
Subjt:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL--

Query:  --------------------------------------------SVKIVRGTSNTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLT-------
                                                    S     G   +    RDKSHIKC+ C + GHY ++C   K +  EAHL        
Subjt:  --------------------------------------------SVKIVRGTSNTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLT-------

Query:  ----CVFEEEPALMMVVSQE-----------GTHT--------------------------------------------------RCDQEDVILLSEERL
             V E+EPA  +VV +E           G  T                                                   C   D  LL +   
Subjt:  ----CVFEEEPALMMVVSQE-----------GTHT--------------------------------------------------RCDQEDVILLSEERL

Query:  LPEIYAN--------------------------------------------------------------------RFHH---------------------
        +P +  N                                                                    R  H                     
Subjt:  LPEIYAN--------------------------------------------------------------------RFHH---------------------

Query:  -----------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTLRT
                                                                                K  A E+F KFK L EN    +I+TLR+
Subjt:  -----------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTLRT

Query:  DRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLS
        DRG                                 ERRN +VMA+ARSL+K M VP +FW EAVRHAVYLLNRLPTK +G+RTPFE W GRKP L +L 
Subjt:  DRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLS

Query:  TWLC------------------------GIGR------------GKLQISEMLFFKR-------ILNGLVTKSLVTVR-----------------RLRSF
         + C                        G+              G++ +S  + F+        ++ G    +  TV                  R R+ 
Subjt:  TWLC------------------------GIGR------------GKLQISEMLFFKR-------ILNGLVTKSLVTVR-----------------RLRSF

Query:  SP----SPP--------------------------------------------SMNTPVHLRSLTDIY-------------------------ANRDEVV
        SP    SPP                                            S   PV  RSL DI                          ++  E  
Subjt:  SP----SPP--------------------------------------------SMNTPVHLRSLTDIY-------------------------ANRDEVV

Query:  GGDEQEN--ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL
        G    EN    EL++IEKN+TW+LT LP GHK I                                     EVFAPVARLDTVRVILA+AA++ WEVHHL
Subjt:  GGDEQEN--ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL

Query:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKV
        DVKSAFLNG+LEEEVYV QPEGF    E+H + +LSKALYGLRQAPRAWN RLD+ LKEL F +CTQEQAVYTR + +  V V VYV+DLIVTG +  ++
Subjt:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKV

Query:  KKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKIL------------TEH-------------KPNLSY
          FKQQMM EFEMSDLGLLSYYLGIEV Q ++ I +KQ  YAK+ILS+FGM  CN T  PMEP+ L            TE+             +P+LSY
Subjt:  KKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKIL------------TEH-------------KPNLSY

Query:  AVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATT
        AVG+ASR+MERP TMH K VK ILRYL+GT+  G  +  G    +I G++DS LAGD+D R+ T GM FY+N SLVSW SQKQKTVA+SSCE EF+AAT 
Subjt:  AVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATT

Query:  ATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNL
        A C ALWLR L++E++  E + V LFVDNKSAIALMKN VFH                              EQRA+ MTK L   KL     LLGVR+L
Subjt:  ATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNL

Query:  ESCQD
           QD
Subjt:  ESCQD

TrEMBL top hitse value%identityAlignment
A0A0P0XB91 Os08g0125300 protein2.4e-17533.67Show/hide
Query:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------
        YP LT  NYT W I+ +A+++ QGV +AIEPA G  V+ ++DK + +++LQ + ED+L+Q+AKK++AKE+WD LK R+ G++RV+ AR            
Subjt:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------

Query:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL--
            ET+D++AG+I+ +  + + L   L DS++VKKL D V +K++S++A IEQF ++D MPFEE    + + +E              DG+ ++     
Subjt:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL--

Query:  --------------------------------------------SVKIVRGTSNTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLT-------
                                                    S     G   +    RDKSHIKC+ C + GHY ++C   K +  EAHL        
Subjt:  --------------------------------------------SVKIVRGTSNTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLT-------

Query:  ----CVFEEEPALMMVVSQE-----------GTHT--------------------------------------------------RCDQEDVILLSEERL
             V E+EPA  +VV +E           G  T                                                   C   D  LL +   
Subjt:  ----CVFEEEPALMMVVSQE-----------GTHT--------------------------------------------------RCDQEDVILLSEERL

Query:  LPEIYAN--------------------------------------------------------------------RFHH---------------------
        +P +  N                                                                    R  H                     
Subjt:  LPEIYAN--------------------------------------------------------------------RFHH---------------------

Query:  -----------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTLRT
                                                                                K  A E+F KFK L EN    +I+TLR+
Subjt:  -----------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTLRT

Query:  DRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLS
        DRG                                 ERRN +VMA+ARSL+K M VP +FW EAVRHAVYLLNRLPTK +G+RTPFE W GRKP L +L 
Subjt:  DRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLS

Query:  TWLC------------------------GIGR------------GKLQISEMLFFKR-------ILNGLVTKSLVTVR-----------------RLRSF
         + C                        G+              G++ +S  + F+        ++ G    +  TV                  R R+ 
Subjt:  TWLC------------------------GIGR------------GKLQISEMLFFKR-------ILNGLVTKSLVTVR-----------------RLRSF

Query:  SP----SPP--------------------------------------------SMNTPVHLRSLTDIY-------------------------ANRDEVV
        SP    SPP                                            S   PV  RSL DI                          ++  E  
Subjt:  SP----SPP--------------------------------------------SMNTPVHLRSLTDIY-------------------------ANRDEVV

Query:  GGDEQEN--ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL
        G    EN    EL++IEKN+TW+LT LP GHK I                                     EVFAPVARLDTVRVILA+AA++ WEVHHL
Subjt:  GGDEQEN--ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL

Query:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKV
        DVKSAFLNG+LEEEVYV QPEGF    E+H + +LSKALYGLRQAPRAWN RLD+ LKEL F +CTQEQAVYTR + +  V V VYV+DLIVTG +  ++
Subjt:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKV

Query:  KKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKIL------------TEH-------------KPNLSY
          FKQQMM EFEMSDLGLLSYYLGIEV Q ++ I +KQ  YAK+ILS+FGM  CN T  PMEP+ L            TE+             +P+LSY
Subjt:  KKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKIL------------TEH-------------KPNLSY

Query:  AVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATT
        AVG+ASR+MERP TMH K VK ILRYL+GT+  G  +  G    +I G++DS LAGD+D R+ T GM FY+N SLVSW SQKQKTVA+SSCE EF+AAT 
Subjt:  AVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATT

Query:  ATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNL
        A C ALWLR L++E++  E + V LFVDNKSAIALMKN VFH                              EQRA+ MTK L   KL     LLGVR+L
Subjt:  ATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNL

Query:  ESCQD
           QD
Subjt:  ESCQD

B8BDZ6 Uncharacterized protein7.5e-17733.67Show/hide
Query:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------
        YP LT  NYT W I+ +A+++ QGV +AIEPA G  V+ ++DK A +++LQ + ED+L+Q+AKK++AKE+WD LK R+ G++RV+ AR            
Subjt:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------

Query:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL--
            ET+D++AG+I+ +  + + L   L DS++VKKL D V +K++S++A IEQF ++D MPFEE    + + +E              DG+ ++     
Subjt:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL--

Query:  --------------------------------------------SVKIVRGTSNTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLT-------
                                                    S     G   +    RDKSHIKC+ C + GHY ++C   K +  EAHL        
Subjt:  --------------------------------------------SVKIVRGTSNTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLT-------

Query:  ----CVFEEEPALMMVVSQE-----------GTHT--------------------------------------------------RCDQEDVILLSEERL
             V E+EPA  +VV +E           G  T                                                   C   D  LL +   
Subjt:  ----CVFEEEPALMMVVSQE-----------GTHT--------------------------------------------------RCDQEDVILLSEERL

Query:  LPEIYAN--------------------------------------------------------------------RFHH---------------------
        +P +  N                                                                    R  H                     
Subjt:  LPEIYAN--------------------------------------------------------------------RFHH---------------------

Query:  -----------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTLRT
                                                                                K  A E+F KFK L EN    +I+TLR+
Subjt:  -----------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTLRT

Query:  DRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLS
        DRG                                 ERRN +VMA+ARSL+K M VP +FW EAVRHAVYLLNRLPTK +G+RTPFE W GRKP L +L 
Subjt:  DRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLS

Query:  TWLC------------------------GIGR------------GKLQISEMLFFKR-------ILNGLVTKSLVTVR-----------------RLRSF
         + C                        G+              G++ +S  + F+        ++ G    +  TV                  R R+ 
Subjt:  TWLC------------------------GIGR------------GKLQISEMLFFKR-------ILNGLVTKSLVTVR-----------------RLRSF

Query:  SPSPP------------------------------------------------SMNTPVHLRSLTDIY-------------------------ANRDEVV
        SP+ P                                                S   PV  RSL DI                          ++  E  
Subjt:  SPSPP------------------------------------------------SMNTPVHLRSLTDIY-------------------------ANRDEVV

Query:  GGDEQEN--ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL
        G    EN    EL++IEKN+TW+LT LP GHK I                                     EVFAPVARLDTVRVILA+AA++ WEVHHL
Subjt:  GGDEQEN--ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL

Query:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKV
        DVKSAFLNG+LEEEVYV QPEGF    E+H + +LSKALYGLRQAPRAWN RLD+ LKEL F +CTQEQAVYTR + +  V V VYV+DLIVTG + +++
Subjt:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKV

Query:  KKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKIL------------TEH-------------KPNLSY
          FKQQMM EFEMSDLGLLSYYLGIEV Q ++ I +KQ  YAK+ILS+FGM  CN T  PMEP+ L            TE+             +P+LSY
Subjt:  KKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKIL------------TEH-------------KPNLSY

Query:  AVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATT
        AVG+ASR+MERP TMH K VK ILRYL+GT+  G  +  G    +I G++DS LAGD+D R+ T GM FY+N SLVSW SQKQKTVA+SSCE EF+AAT 
Subjt:  AVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATT

Query:  ATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNL
        A C ALWLR L++E++  E +PV LFVDNKSAIALMKN VFH                              EQRA+ MTK L   KL     LLGVR+L
Subjt:  ATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNL

Query:  ESCQD
           QD
Subjt:  ESCQD

Q0J8A6 Os08g0125300 protein2.4e-17533.67Show/hide
Query:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------
        YP LT  NYT W I+ +A+++ QGV +AIEPA G  V+ ++DK + +++LQ + ED+L+Q+AKK++AKE+WD LK R+ G++RV+ AR            
Subjt:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------

Query:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL--
            ET+D++AG+I+ +  + + L   L DS++VKKL D V +K++S++A IEQF ++D MPFEE    + + +E              DG+ ++     
Subjt:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL--

Query:  --------------------------------------------SVKIVRGTSNTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLT-------
                                                    S     G   +    RDKSHIKC+ C + GHY ++C   K +  EAHL        
Subjt:  --------------------------------------------SVKIVRGTSNTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLT-------

Query:  ----CVFEEEPALMMVVSQE-----------GTHT--------------------------------------------------RCDQEDVILLSEERL
             V E+EPA  +VV +E           G  T                                                   C   D  LL +   
Subjt:  ----CVFEEEPALMMVVSQE-----------GTHT--------------------------------------------------RCDQEDVILLSEERL

Query:  LPEIYAN--------------------------------------------------------------------RFHH---------------------
        +P +  N                                                                    R  H                     
Subjt:  LPEIYAN--------------------------------------------------------------------RFHH---------------------

Query:  -----------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTLRT
                                                                                K  A E+F KFK L EN    +I+TLR+
Subjt:  -----------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTLRT

Query:  DRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLS
        DRG                                 ERRN +VMA+ARSL+K M VP +FW EAVRHAVYLLNRLPTK +G+RTPFE W GRKP L +L 
Subjt:  DRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLS

Query:  TWLC------------------------GIGR------------GKLQISEMLFFKR-------ILNGLVTKSLVTVR-----------------RLRSF
         + C                        G+              G++ +S  + F+        ++ G    +  TV                  R R+ 
Subjt:  TWLC------------------------GIGR------------GKLQISEMLFFKR-------ILNGLVTKSLVTVR-----------------RLRSF

Query:  SP----SPP--------------------------------------------SMNTPVHLRSLTDIY-------------------------ANRDEVV
        SP    SPP                                            S   PV  RSL DI                          ++  E  
Subjt:  SP----SPP--------------------------------------------SMNTPVHLRSLTDIY-------------------------ANRDEVV

Query:  GGDEQEN--ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL
        G    EN    EL++IEKN+TW+LT LP GHK I                                     EVFAPVARLDTVRVILA+AA++ WEVHHL
Subjt:  GGDEQEN--ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL

Query:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKV
        DVKSAFLNG+LEEEVYV QPEGF    E+H + +LSKALYGLRQAPRAWN RLD+ LKEL F +CTQEQAVYTR + +  V V VYV+DLIVTG +  ++
Subjt:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKV

Query:  KKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKIL------------TEH-------------KPNLSY
          FKQQMM EFEMSDLGLLSYYLGIEV Q ++ I +KQ  YAK+ILS+FGM  CN T  PMEP+ L            TE+             +P+LSY
Subjt:  KKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKIL------------TEH-------------KPNLSY

Query:  AVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATT
        AVG+ASR+MERP TMH K VK ILRYL+GT+  G  +  G    +I G++DS LAGD+D R+ T GM FY+N SLVSW SQKQKTVA+SSCE EF+AAT 
Subjt:  AVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATT

Query:  ATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNL
        A C ALWLR L++E++  E + V LFVDNKSAIALMKN VFH                              EQRA+ MTK L   KL     LLGVR+L
Subjt:  ATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNL

Query:  ESCQD
           QD
Subjt:  ESCQD

Q10K27 Retrotransposon protein, putative, Ty1-copia subclass2.3e-16534.11Show/hide
Query:  LYPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR-----------
        LYP LT  NYT W I+ +A+++ QGV EAIEPA GA V+ +KDK   +++LQ + ED+L+Q+AKK+TAKE+WD LK R+ G+++V+ AR           
Subjt:  LYPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR-----------

Query:  ---ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL-
             E++D++AG+I+ +  + + L   L D+++VKKL D V +K++S++  IEQF ++D MPFEE    + + DE              DG+ ++    
Subjt:  ---ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVAL-

Query:  -SVKIVRGTSNTENDT--------------------------------------RDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLTCVFEEEPALM--
           +  +G   T +                                        RDKSHIKCF   + GHY ++C   K +  EAHL    +  PAL+  
Subjt:  -SVKIVRGTSNTENDT--------------------------------------RDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLTCVFEEEPALM--

Query:  ----------------MVVSQEGTHTR-------------------------------------------------------------CDQEDVILLSEE
                        +VV +E    R                                                             C   D  LL + 
Subjt:  ----------------MVVSQEGTHTR-------------------------------------------------------------CDQEDVILLSEE

Query:  RLLPEIYAN-------------------------------------------------------------------------------------------
          +P +  N                                                                                           
Subjt:  RLLPEIYAN-------------------------------------------------------------------------------------------

Query:  RFHH---------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTL
        + HH                                                                      K  A E F KFK L EN   + I+TL
Subjt:  RFHH---------------------------------------------------------------------AKSDAFESFKKFKLLMENKTKYKIRTL

Query:  RTDRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAY
        RTD                                   E RN +VMA+ARSLLK M VP + W EAVRHAV+LLNRLPTK +GE TPFE W G+KPHL +
Subjt:  RTDRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAY

Query:  LSTWLCGI-------------GRGKLQISEMLFFKRI--LNGLVTKSLVTVRRLRSFSPSPPSMNTPVHLRSLTDIYANRDEVVGGDEQENE--------
        L  + C                R +   +E +  + +      + +   TV    S + S    + P+  RSL DI      V   DE++ E        
Subjt:  LSTWLCGI-------------GRGKLQISEMLFFKRI--LNGLVTKSLVTVRRLRSFSPSPPSMNTPVHLRSLTDIYANRDEVVGGDEQENE--------

Query:  ------------------NELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAAN
                           EL++IEKN+TWSL  LP GHK+I                                     EVFAPVARLDTVR ILA+AA+
Subjt:  ------------------NELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAAN

Query:  QSWEVHHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIV
        + W VHHLDVKSAFLNGELEEEVYV Q EGF    ++H + KLSKALYGLRQAPRAWNI LD+SLKEL F +C QEQAVYTR    E + V VYV+DLIV
Subjt:  QSWEVHHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIV

Query:  TGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP----------KILTEHKPNLSYAVGMASR
        TG +  +V+ FK+QMM EFEM+DLGLLSYYLGIEV Q  + I LKQ  YAK+          +A  +P++           + L   +P+LSYAVG+ASR
Subjt:  TGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP----------KILTEHKPNLSYAVGMASR

Query:  YMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALW
        +ME P   H K VKQILRYL+GTI  G  YT G  E  I GY+DS LAGD+D R+ T GM FY+N SLV+W+SQKQKTV +SSCE EF+AAT A C ALW
Subjt:  YMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALW

Query:  LRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVF------------------------------HEQRANVMTKALTGVKLVAMHELLGVRNL
        LR L+ E++  E + V LFVDNKSAIALMKN VF                               EQRA+ +TK L   KLV    LL VR+L
Subjt:  LRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVF------------------------------HEQRANVMTKALTGVKLVAMHELLGVRNL

Q10RM4 Retrotransposon protein, putative, unclassified5.8e-17733.02Show/hide
Query:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------
        YP LT  NYT W I+ +A+++ QGV +AIEPA G  V+ ++DK A +++LQ   ED+L+Q+AKK++AKE+WD LK R+ G++RV+ AR            
Subjt:  YPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMAR------------

Query:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVALSV
            ET+D++AG+I+ +  + + L   L DS++VKKL D V +K++S++A IEQF D+DTMPFEE    + + +E              DG+ ++     
Subjt:  --ETETIDEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDE--------------DGEDVVVALSV

Query:  K-------------------------------------------IVRGTS--NTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLTCVFEEEPA
        +                                             RG S   +    RDKSHIKCF C + GHY ++C   K +  EAHL    +  PA
Subjt:  K-------------------------------------------IVRGTS--NTENDTRDKSHIKCFTCNKMGHYMSKC-RGKDRDDEAHLTCVFEEEPA

Query:  LMMVVSQE-------------------------GTHT--------------------------------------------------RCDQEDVILLSEE
        L++ V+++                         G H                                                    C  +D  LL + 
Subjt:  LMMVVSQE-------------------------GTHT--------------------------------------------------RCDQEDVILLSEE

Query:  RLLPEIYANRFH----------------------------------------------------------------------------------------
          +P +  N                                                                                           
Subjt:  RLLPEIYANRFH----------------------------------------------------------------------------------------

Query:  ------------------------------------------------HAKSDAFESFKKFKLLMENKTKYKIRTLRTDRG-------------------
                                                          K  + ++F KFK L EN    +++TLR+DRG                   
Subjt:  ------------------------------------------------HAKSDAFESFKKFKLLMENKTKYKIRTLRTDRG-------------------

Query:  --------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLSTWLC------------------
                      ERRN TVMA+ARSLLK M VP +FW EAVRHAV+LLNRLPTK +G+RTPFE W GRKP L +L  + C                  
Subjt:  --------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLSTWLC------------------

Query:  ------GI------------GRGKLQISEMLFFKR-------------------------------ILNGLV-------------TKSLVTVRRLRS---
              G+             RG++ +S  + F+                                   G+V              +++V    + S   
Subjt:  ------GI------------GRGKLQISEMLFFKR-------------------------------ILNGLV-------------TKSLVTVRRLRS---

Query:  -----------FSPSPPSMNT------------------PVHLRSLTDIY--ANRDEVVGGD-------------------------EQENENELKSIEK
                    +P  PS N+                  P   RSL D+   A R ++V  +                         E+  + E+++IEK
Subjt:  -----------FSPSPPSMNT------------------PVHLRSLTDIY--ANRDEVVGGD-------------------------EQENENELKSIEK

Query:  NNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHLDVKSAFLNGELEEEVYVT
        N TW L  LP GH++I                                     EVFAPVARLDTVRV+LA+AA++ W+VHHLDVKSAFLNGELEEEVYV 
Subjt:  NNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHLDVKSAFLNGELEEEVYVT

Query:  QPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKVKKFKQQMMAEFEMSDLGL
        QPEGF    ++H + KL KALYGLRQAPRAWNIRLDRSL+EL F +CTQEQAVYTR    + + V VYV+DLIVTG +  ++K FK+QMM EFEMSDLGL
Subjt:  QPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKVKKFKQQMMAEFEMSDLGL

Query:  LSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP-------------------------KILTEHKPNLSYAVGMASRYMERPITMHYK
        L+YYLGIEV+Q +S   LKQ  YAK++LS+FGM  CN+   P++P                         + L   +P+LSYAVG+ASR+MERP  MH+K
Subjt:  LSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP-------------------------KILTEHKPNLSYAVGMASRYMERPITMHYK

Query:  VVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALWLRCLVNEIVRM
         VKQILRY++GT+ +G  Y  G     I GY+DS LAGDLD R+ T GM FY+N+SLV+W+SQKQKTVA+SSCE EF+AATTA CQALWLR L+ E+  +
Subjt:  VVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALWLRCLVNEIVRM

Query:  EPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNLESCQD
        E + V LFVDN+SAIALMKN VFH                              EQRA+ +TK L   KLV    LLGVR+L S Q+
Subjt:  EPRPVTLFVDNKSAIALMKNLVFH------------------------------EQRANVMTKALTGVKLVAMHELLGVRNLESCQD

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-5831.44Show/hide
Query:  RDEVVGGDEQENENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEV
        RD+    +E  N  EL + + NNTW++T+ P     +                                     E FAPVAR+ + R IL+L    + +V
Subjt:  RDEVVGGDEQENENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEV

Query:  HHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVY--TRREEEECVFVVVYVNDLIVTGS
        H +DVK+AFLNG L+EE+Y+  P+G    ++   + KL+KA+YGL+QA R W    +++LKE +F   + ++ +Y   +    E ++V++YV+D+++   
Subjt:  HHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVY--TRREEEECVFVVVYVNDLIVTGS

Query:  STEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKILTE-------------------------HK
           ++  FK+ +M +F M+DL  + +++GI +E Q+ +I L Q  Y K+ILSKF M NCNA   P+  KI  E                          +
Subjt:  STEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKILTE-------------------------HK

Query:  PNLSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKG-PREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNE-SLVSWNSQKQKTVAVSSCEV
        P+L+ AV + SRY  +  +  ++ +K++LRYL+GTI     + K    E  I GY DS  AG    RK T+G  F + + +L+ WN+++Q +VA SS E 
Subjt:  PNLSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKG-PREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNE-SLVSWNSQKQKTVAVSSCEV

Query:  EFIAATTATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFHEQRANVMTK
        E++A   A  +ALWL+ L+  I      P+ ++ DN+  I++  N   H++  ++  K
Subjt:  EFIAATTATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFHEQRANVMTK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-6926.78Show/hide
Query:  KSDAFESFKKFKLLMENKTKYKIRTLRTDRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLL
        K   F+ F+KF  L+E +T  K++ LR+D G                                 ER N T++   RS+L+   +P  FW EAV+ A YL+
Subjt:  KSDAFESFKKFKLLMENKTKYKIRTLRTDRG---------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLL

Query:  NRLPTKVLGERTPFEVWMGRKPHLAYLSTWLCG----------------------IGRGKLQISEMLF---FKRILNG---LVTKSLVTVRR--------
        NR P+  L    P  VW  ++   ++L  + C                       IG G  +    L+    K+++     +  +S V            
Subjt:  NRLPTKVLGERTPFEVWMGRKPHLAYLSTWLCG----------------------IGRGKLQISEMLF---FKRILNG---LVTKSLVTVRR--------

Query:  --LRSFSPSPPSMNTPVHLRSLTDIYANRDEVVG---------------------GDEQEN---------------------------------------
          + +F   P + N P    S TD  + + E  G                     G+EQ                                         
Subjt:  --LRSFSPSPPSMNTPVHLRSLTDIYANRDEVVG---------------------GDEQEN---------------------------------------

Query:  ---------ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL
                 + E++S++KN T+ L ELP G + +                                     E+F+PV ++ ++R IL+LAA+   EV  L
Subjt:  ---------ENELKSIEKNNTWSLTELPPGHKSI-------------------------------------EVFAPVARLDTVRVILALAANQSWEVHHL

Query:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRR-EEEECVFVVVYVNDLIVTGSSTEK
        DVK+AFL+G+LEEE+Y+ QPEGFEV  +KH + KL+K+LYGL+QAPR W ++ D  +K   + K   +  VY +R  E   + +++YV+D+++ G     
Subjt:  DVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRR-EEEECVFVVVYVNDLIVTGSSTEK

Query:  VKKFKQQMMAEFEMSDLGLLSYYLGIEV--EQQKSRILLKQPTYAKRILSKFGMTNCNATKYP----------MEPKILTEH------------------
        + K K  +   F+M DLG     LG+++  E+   ++ L Q  Y +R+L +F M N      P          M P  + E                   
Subjt:  VKKFKQQMMAEFEMSDLGLLSYYLGIEV--EQQKSRILLKQPTYAKRILSKFGMTNCNATKYP----------MEPKILTEH------------------

Query:  ----KPNLSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSS
            +P++++AVG+ SR++E P   H++ VK ILRYLRGT   G     G  +  + GY+D+ +AGD+D RK ++G  F  +   +SW S+ QK VA+S+
Subjt:  ----KPNLSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSS

Query:  CEVEFIAATTATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFHEQRANV
         E E+IAAT    + +WL+  + E+  +  +   ++ D++SAI L KN ++H +  ++
Subjt:  CEVEFIAATTATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFHEQRANV

P25600 Putative transposon Ty5-1 protein YCL074W2.8e-2727.72Show/hide
Query:  LDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEK
        +DV +AFLN  ++E +YV QP GF        +++L   +YGL+QAP  WN  ++ +LK++ F +   E  +Y R   +  +++ VYV+DL+V   S + 
Subjt:  LDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEK

Query:  VKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKS-RILLKQPTYAKRILSKFGMTNCNATKYPM------------EPKILTEH--------------KPN
          + KQ++   + M DLG +  +LG+ + Q  +  I L    Y  +  S+  +     T+ P+              K +T +              +P+
Subjt:  VKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKS-RILLKQPTYAKRILSKFGMTNCNATKYPM------------EPKILTEH--------------KPN

Query:  LSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQK-TVAVSSCEVEFI
        +SY V + SR++  P  +H +  +++LRYL  T      Y  G  +  +  Y D+      D    T G    L  + V+W+S+K K  + V S E E+I
Subjt:  LSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQK-TVAVSSCEVEFI

Query:  AAT
         A+
Subjt:  AAT

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.0e-5331.59Show/hide
Query:  PGHKSIEVFAPVARLDTVRVILALAANQSWEVHHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKC
        PG    E F+PV +  ++R++L +A ++SW +  LDV +AFL G L ++VY++QP GF   +  + + KL KALYGL+QAPRAW + L   L  + F   
Subjt:  PGHKSIEVFAPVARLDTVRVILALAANQSWEVHHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKC

Query:  TQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP--
          + +++  +  +  V+++VYV+D+++TG+    +      +   F + D   L Y+LGIE ++  + + L Q  Y   +L++  M        PM P  
Subjt:  TQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEP--

Query:  -----------------------KILTEHKPNLSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTS
                               + L   +P++SYAV   S++M  P   H + +K+ILRYL GT + G    KG    ++  YSD+  AGD D    T+
Subjt:  -----------------------KILTEHKPNLSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTS

Query:  GMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFHEQRANV
        G   YL    +SW+S+KQK V  SS E E+ +    + +  W+  L+ E+     RP  ++ DN  A  L  N VFH +  ++
Subjt:  GMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFHEQRANV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-0527.69Show/hide
Query:  KSDAFESFKKFKLLMENKTKYKIRTLRTDRG-------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNR
        KS   E+F  FK L+EN+ + +I T  +D G                               ER++  ++    +LL    +P  +W  A   AVYL+NR
Subjt:  KSDAFESFKKFKLLMENKTKYKIRTLRTDRG-------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNR

Query:  LPTKVLGERTPFEVWMGRKPHLAYLSTWLC
        LPT +L   +PF+   G  P+   L  + C
Subjt:  LPTKVLGERTPFEVWMGRKPHLAYLSTWLC

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-5231.85Show/hide
Query:  PGHKSIEVFAPVARLDTVRVILALAANQSWEVHHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKC
        PG    E F+PV +  ++R++L +A ++SW +  LDV +AFL G L +EVY++QP GF   +    + +L KA+YGL+QAPRAW + L   L  + F   
Subjt:  PGHKSIEVFAPVARLDTVRVILALAANQSWEVHHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKC

Query:  TQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPM--EP
          + +++  +     ++++VYV+D+++TG+ T  +K     +   F + +   L Y+LGIE ++    + L Q  Y   +L++  M        PM   P
Subjt:  TQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPM--EP

Query:  KIL----------TEH-------------KPNLSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTS
        K+           TE+             +P+LSYAV   S+YM  P   H+  +K++LRYL GT   G    KG    ++  YSD+  AGD D    T+
Subjt:  KIL----------TEH-------------KPNLSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTS

Query:  GMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFHEQRANV
        G   YL    +SW+S+KQK V  SS E E+ +    + +  W+  L+ E+      P  ++ DN  A  L  N VFH +  ++
Subjt:  GMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFHEQRANV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.9e-0628.46Show/hide
Query:  KSDAFESFKKFKLLMENKTKYKIRTLRTDRG-------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNR
        KS   ++F  FK L+EN+ + +I TL +D G                               ER++  ++ +  +LL    VP  +W  A   AVYL+NR
Subjt:  KSDAFESFKKFKLLMENKTKYKIRTLRTDRG-------------------------------ERRNLTVMAIARSLLKSMHVPAKFWEEAVRHAVYLLNR

Query:  LPTKVLGERTPFEVWMGRKPHLAYLSTWLC
        LPT +L  ++PF+   G+ P+   L  + C
Subjt:  LPTKVLGERTPFEVWMGRKPHLAYLSTWLC

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.0e-5228.44Show/hide
Query:  ENELKSIEKNNTWSLTELPPGHKS-------------------------------------IEVFAPVARLDTVRVILALAANQSWEVHHLDVKSAFLNG
        ++E+ ++E  +TW +  LPP  K                                      IE F+PV +L +V++ILA++A  ++ +H LD+ +AFLNG
Subjt:  ENELKSIEKNNTWSLTELPPGHKS-------------------------------------IEVFAPVARLDTVRVILALAANQSWEVHHLDVKSAFLNG

Query:  ELEEEVYVTQPEGFEVPN----EKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKVKKFKQ
        +L+EE+Y+  P G+          + +  L K++YGL+QA R W ++   +L    F +   +   + +      + V+VYV+D+I+  ++   V + K 
Subjt:  ELEEEVYVTQPEGFEVPN----EKHKMYKLSKALYGLRQAPRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKVKKFKQ

Query:  QMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKI-------------------------LTEHKPNLSYAVGMA
        Q+ + F++ DLG L Y+LG+E+ +  + I + Q  YA  +L + G+  C  +  PM+P +                         L   + ++S+AV   
Subjt:  QMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKI-------------------------LTEHKPNLSYAVGMA

Query:  SRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQA
        S++ E P   H + V +IL Y++GT+  G  Y+    E  +  +SD+      D R+ T+G   +L  SL+SW S+KQ+ V+ SS E E+ A + AT + 
Subjt:  SRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSSCEVEFIAATTATCQA

Query:  LWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFHEQRANV
        +WL     E+     +P  LF DN +AI +  N VFHE+  ++
Subjt:  LWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFHEQRANV

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-2629.65Show/hide
Query:  VFVVVYVNDLIVTGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKI---------------
        +++++YV+D+++TGSS   +     Q+ + F M DLG + Y+LGI+++   S + L Q  YA++IL+  GM +C     P+  K+               
Subjt:  VFVVVYVNDLIVTGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCNATKYPMEPKI---------------

Query:  ---------LTEHKPNLSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQ
                 LT  +P++SYAV +  + M  P    + ++K++LRY++GTI  G  Y     + N+  + DS  AG    R+ T+G   +L  +++SW+++
Subjt:  ---------LTEHKPNLSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQ

Query:  KQKTVAVSSCEVEFIAATTATCQALW
        +Q TV+ SS E E+ A      +  W
Subjt:  KQKTVAVSSCEVEFIAATTATCQALW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGCATCTCGCAGCAAACAAAGGAGAATGGCATAATGCCGATTCTCTACCCGATGTTAACTCCACACAACTACACGGTGTGGACGATAAAGCCAGAAGCAATCCT
CGATGCCCAAGGAGTTGGGGAAGCAATAGAGCCAGCGAGAGGAGCCGAAGTGAATGTGAAGAAAGACAAGAATGCGCATGCATACATTCTCCAATGCATCTTGGAAGACG
TGCTTCTACAAATCGCAAAGAAGAAGACTGCAAAGGAAATATGGGATAGTCTTAAGATAAGGTACTTCGGCAGTGAGCGAGTGAAGATGGCGAGAGAAACTGAGACGATC
GATGAGTTCGCAGGAAAAATTAGCGGATTAGTAAGTAAGTTCACCACCCTCAGAGTTGCACTTGAGGATTCATCATTGGTCAAGAAGCTCCTCGATTTCGTCCTCGACAA
ATATCTTTCCATTATCGCTAGAATCGAGCAATTCCAAGATCTCGATACCATGCCGTTTGAAGAAGGTGCAGATTTGGTGGTATCGATCGACGAAGATGGCGAGGACGTGG
TCGTGGCACTGAGCGTCAAAATAGTGCGAGGCACTAGCAACACTGAAAATGACACTCGTGATAAAAGTCACATTAAGTGTTTCACTTGCAACAAGATGGGACATTACATG
TCGAAATGTCGTGGAAAAGATCGTGACGATGAAGCTCATCTAACTTGTGTCTTCGAAGAAGAACCAGCTTTGATGATGGTCGTGTCCCAGGAGGGGACACACACTAGATG
TGATCAGGAGGATGTCATACTACTCAGCGAAGAGCGGTTGTTGCCAGAGATATATGCGAACCGATTTCACCATGCAAAAAGTGACGCATTTGAATCATTCAAGAAATTCA
AACTCTTAATGGAGAACAAGACAAAGTACAAGATCAGAACGCTCCGGACGGATCGAGGTGAGCGTCGTAACCTCACCGTAATGGCGATAGCGAGATCACTCCTCAAAAGC
ATGCACGTGCCTGCAAAATTTTGGGAAGAGGCAGTGAGACACGCGGTTTATTTGTTAAATCGTCTTCCAACGAAGGTCCTTGGAGAACGCACACCATTTGAAGTTTGGAT
GGGGAGAAAGCCACATCTTGCATACTTGAGTACTTGGTTGTGTGGCATAGGCCGTGGAAAACTACAAATTAGTGAGATGTTGTTTTTCAAGAGAATTTTGAATGGACTTG
TAACGAAGTCGTTAGTGACAGTAAGGAGATTACGGAGTTTCAGTCCATCTCCTCCATCGATGAACACACCGGTCCATCTAAGATCTCTCACTGACATCTACGCCAACAGA
GATGAAGTTGTAGGTGGTGATGAACAAGAGAATGAGAACGAGTTAAAATCCATTGAGAAAAACAACACATGGAGTCTGACCGAGCTTCCACCAGGACACAAATCCATTGA
AGTTTTTGCACCGGTTGCAAGACTTGACACCGTTCGAGTCATTCTTGCACTCGCTGCAAACCAAAGTTGGGAGGTACACCATCTAGATGTGAAGTCGGCATTTCTCAATG
GAGAATTGGAAGAGGAAGTATATGTTACTCAACCGGAGGGTTTTGAGGTTCCAAATGAAAAACACAAGATGTATAAATTGTCAAAGGCTCTCTACGGATTGAGACAAGCT
CCACGAGCTTGGAACATTCGACTTGATAGGAGTCTCAAAGAGCTTGATTTTGGAAAATGCACTCAAGAGCAAGCAGTCTACACAAGAAGAGAAGAAGAGGAATGTGTTTT
TGTTGTTGTGTATGTTAACGATCTCATTGTAACAGGAAGTAGCACTGAAAAGGTTAAGAAGTTCAAGCAGCAAATGATGGCAGAATTTGAAATGAGCGACTTAGGCCTTC
TCTCTTACTACTTAGGAATTGAAGTTGAACAACAGAAGAGTCGAATCTTGCTTAAACAACCAACTTATGCCAAAAGAATTTTGTCCAAGTTTGGAATGACTAATTGCAAT
GCCACAAAGTACCCGATGGAACCCAAGATACTTACTGAACACAAGCCAAATCTCTCATATGCTGTTGGGATGGCGAGTAGGTATATGGAAAGGCCTATAACCATGCATTA
CAAGGTGGTCAAGCAAATACTTAGGTATTTGAGAGGGACAATTCACTTTGGGTTCACTTATACGAAAGGTCCCAGAGAATTCAATATATTCGGTTACTCTGACAGTTATT
TAGCCGGTGATCTTGATGGGAGAAAAGGCACAAGTGGAATGACATTCTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACGGTGGCAGTCTCATCT
TGCGAAGTTGAGTTCATTGCAGCCACTACTGCAACTTGCCAAGCATTATGGTTAAGATGCCTTGTTAACGAGATAGTCAGAATGGAGCCAAGGCCGGTAACATTATTTGT
GGACAACAAATCTGCGATAGCTCTCATGAAGAATCTTGTATTTCATGAACAACGAGCCAATGTTATGACTAAAGCATTGACAGGAGTAAAGTTAGTTGCTATGCATGAGC
TACTTGGTGTTCGTAATTTAGAATCATGTCAGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGCATCTCGCAGCAAACAAAGGAGAATGGCATAATGCCGATTCTCTACCCGATGTTAACTCCACACAACTACACGGTGTGGACGATAAAGCCAGAAGCAATCCT
CGATGCCCAAGGAGTTGGGGAAGCAATAGAGCCAGCGAGAGGAGCCGAAGTGAATGTGAAGAAAGACAAGAATGCGCATGCATACATTCTCCAATGCATCTTGGAAGACG
TGCTTCTACAAATCGCAAAGAAGAAGACTGCAAAGGAAATATGGGATAGTCTTAAGATAAGGTACTTCGGCAGTGAGCGAGTGAAGATGGCGAGAGAAACTGAGACGATC
GATGAGTTCGCAGGAAAAATTAGCGGATTAGTAAGTAAGTTCACCACCCTCAGAGTTGCACTTGAGGATTCATCATTGGTCAAGAAGCTCCTCGATTTCGTCCTCGACAA
ATATCTTTCCATTATCGCTAGAATCGAGCAATTCCAAGATCTCGATACCATGCCGTTTGAAGAAGGTGCAGATTTGGTGGTATCGATCGACGAAGATGGCGAGGACGTGG
TCGTGGCACTGAGCGTCAAAATAGTGCGAGGCACTAGCAACACTGAAAATGACACTCGTGATAAAAGTCACATTAAGTGTTTCACTTGCAACAAGATGGGACATTACATG
TCGAAATGTCGTGGAAAAGATCGTGACGATGAAGCTCATCTAACTTGTGTCTTCGAAGAAGAACCAGCTTTGATGATGGTCGTGTCCCAGGAGGGGACACACACTAGATG
TGATCAGGAGGATGTCATACTACTCAGCGAAGAGCGGTTGTTGCCAGAGATATATGCGAACCGATTTCACCATGCAAAAAGTGACGCATTTGAATCATTCAAGAAATTCA
AACTCTTAATGGAGAACAAGACAAAGTACAAGATCAGAACGCTCCGGACGGATCGAGGTGAGCGTCGTAACCTCACCGTAATGGCGATAGCGAGATCACTCCTCAAAAGC
ATGCACGTGCCTGCAAAATTTTGGGAAGAGGCAGTGAGACACGCGGTTTATTTGTTAAATCGTCTTCCAACGAAGGTCCTTGGAGAACGCACACCATTTGAAGTTTGGAT
GGGGAGAAAGCCACATCTTGCATACTTGAGTACTTGGTTGTGTGGCATAGGCCGTGGAAAACTACAAATTAGTGAGATGTTGTTTTTCAAGAGAATTTTGAATGGACTTG
TAACGAAGTCGTTAGTGACAGTAAGGAGATTACGGAGTTTCAGTCCATCTCCTCCATCGATGAACACACCGGTCCATCTAAGATCTCTCACTGACATCTACGCCAACAGA
GATGAAGTTGTAGGTGGTGATGAACAAGAGAATGAGAACGAGTTAAAATCCATTGAGAAAAACAACACATGGAGTCTGACCGAGCTTCCACCAGGACACAAATCCATTGA
AGTTTTTGCACCGGTTGCAAGACTTGACACCGTTCGAGTCATTCTTGCACTCGCTGCAAACCAAAGTTGGGAGGTACACCATCTAGATGTGAAGTCGGCATTTCTCAATG
GAGAATTGGAAGAGGAAGTATATGTTACTCAACCGGAGGGTTTTGAGGTTCCAAATGAAAAACACAAGATGTATAAATTGTCAAAGGCTCTCTACGGATTGAGACAAGCT
CCACGAGCTTGGAACATTCGACTTGATAGGAGTCTCAAAGAGCTTGATTTTGGAAAATGCACTCAAGAGCAAGCAGTCTACACAAGAAGAGAAGAAGAGGAATGTGTTTT
TGTTGTTGTGTATGTTAACGATCTCATTGTAACAGGAAGTAGCACTGAAAAGGTTAAGAAGTTCAAGCAGCAAATGATGGCAGAATTTGAAATGAGCGACTTAGGCCTTC
TCTCTTACTACTTAGGAATTGAAGTTGAACAACAGAAGAGTCGAATCTTGCTTAAACAACCAACTTATGCCAAAAGAATTTTGTCCAAGTTTGGAATGACTAATTGCAAT
GCCACAAAGTACCCGATGGAACCCAAGATACTTACTGAACACAAGCCAAATCTCTCATATGCTGTTGGGATGGCGAGTAGGTATATGGAAAGGCCTATAACCATGCATTA
CAAGGTGGTCAAGCAAATACTTAGGTATTTGAGAGGGACAATTCACTTTGGGTTCACTTATACGAAAGGTCCCAGAGAATTCAATATATTCGGTTACTCTGACAGTTATT
TAGCCGGTGATCTTGATGGGAGAAAAGGCACAAGTGGAATGACATTCTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACGGTGGCAGTCTCATCT
TGCGAAGTTGAGTTCATTGCAGCCACTACTGCAACTTGCCAAGCATTATGGTTAAGATGCCTTGTTAACGAGATAGTCAGAATGGAGCCAAGGCCGGTAACATTATTTGT
GGACAACAAATCTGCGATAGCTCTCATGAAGAATCTTGTATTTCATGAACAACGAGCCAATGTTATGACTAAAGCATTGACAGGAGTAAAGTTAGTTGCTATGCATGAGC
TACTTGGTGTTCGTAATTTAGAATCATGTCAGGATTAG
Protein sequenceShow/hide protein sequence
MASISQQTKENGIMPILYPMLTPHNYTVWTIKPEAILDAQGVGEAIEPARGAEVNVKKDKNAHAYILQCILEDVLLQIAKKKTAKEIWDSLKIRYFGSERVKMARETETI
DEFAGKISGLVSKFTTLRVALEDSSLVKKLLDFVLDKYLSIIARIEQFQDLDTMPFEEGADLVVSIDEDGEDVVVALSVKIVRGTSNTENDTRDKSHIKCFTCNKMGHYM
SKCRGKDRDDEAHLTCVFEEEPALMMVVSQEGTHTRCDQEDVILLSEERLLPEIYANRFHHAKSDAFESFKKFKLLMENKTKYKIRTLRTDRGERRNLTVMAIARSLLKS
MHVPAKFWEEAVRHAVYLLNRLPTKVLGERTPFEVWMGRKPHLAYLSTWLCGIGRGKLQISEMLFFKRILNGLVTKSLVTVRRLRSFSPSPPSMNTPVHLRSLTDIYANR
DEVVGGDEQENENELKSIEKNNTWSLTELPPGHKSIEVFAPVARLDTVRVILALAANQSWEVHHLDVKSAFLNGELEEEVYVTQPEGFEVPNEKHKMYKLSKALYGLRQA
PRAWNIRLDRSLKELDFGKCTQEQAVYTRREEEECVFVVVYVNDLIVTGSSTEKVKKFKQQMMAEFEMSDLGLLSYYLGIEVEQQKSRILLKQPTYAKRILSKFGMTNCN
ATKYPMEPKILTEHKPNLSYAVGMASRYMERPITMHYKVVKQILRYLRGTIHFGFTYTKGPREFNIFGYSDSYLAGDLDGRKGTSGMTFYLNESLVSWNSQKQKTVAVSS
CEVEFIAATTATCQALWLRCLVNEIVRMEPRPVTLFVDNKSAIALMKNLVFHEQRANVMTKALTGVKLVAMHELLGVRNLESCQD