; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G007380 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G007380
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCG_Chr07:18660052..18665048
RNA-Seq ExpressionClCG07G007380
SyntenyClCG07G007380
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN64335.1 hypothetical protein VITISV_001808 [Vitis vinifera]2.0e-23340.52Show/hide
Query:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITM-KEHLEDESKKEKSFALKTIALK
        M   E+I +M TRFT I+N L+ LG+    SE V KILRSLP  W  KVTAIQEAKDLTK  +EEL+G LMT+EI++ K+  E E KK+K+ ALK     
Subjt:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITM-KEHLEDESKKEKSFALKTIALK

Query:  TISFAVDSEDEDALDQ-DDIAYFSRKYKNFIK----RKKQFKKQLSTQR---ESKGEKSK---KDEVICYECKKPGHIRTDCPFLK-SSKKSKKKAMKAT
        T     D E+E   ++ DD+A  +RK   +++    R K+F  + +  R    S G+K K   K ++IC++CK PGHI+ DCP  K  +K+  KKAM AT
Subjt:  TISFAVDSEDEDALDQ-DDIAYFSRKYKNFIK----RKKQFKKQLSTQR---ESKGEKSK---KDEVICYECKKPGHIRTDCPFLK-SSKKSKKKAMKAT

Query:  WDDSSESESGSDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEM-
        W +S ES      +EVA +C MA+ D      DEVN    S +++   FE +  D EKLS K   LKK+   L  E + + +K +          IDE+ 
Subjt:  WDDSSESESGSDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEM-

Query:  KVSCDKHACVCDEKNALLDK-----VRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKT
        K   +K   +   +N +L K        L   SC + +   +L  ++         +KE    L  G  +     E    +  KR  GY+        + 
Subjt:  KVSCDKHACVCDEKNALLDK-----VRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKT

Query:  IFVKASSIVPNMPNVVSNDAKSSFV--PICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPR-ENFSKKTYSCYFSRSNAFNANAKMKWIPKF
        I         N+ N  S+  +S  +   + HN      +  K FK+ +  +    ++       +   R EN      S Y      F++     W+   
Subjt:  IFVKASSIVPNMPNVVSNDAKSSFV--PICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPR-ENFSKKTYSCYFSRSNAFNANAKMKWIPKF

Query:  VNANTLGPKQVCLKASKKNRWYLDNGCSRHMTRDRSKFISFSRKDGGMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAF
             LG   + L  S+ N+  L  G  +         I+F +     DK+C+ACQMGKQ K+SFK+KN IST+RPL+LLHMDLF PSR  S GG  YA+
Subjt:  VNANTLGPKQVCLKASKKNRWYLDNGCSRHMTRDRSKFISFSRKDGGMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAF

Query:  VIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEY
        VIVDDFSR+TWVL +  K +A   F     +VQNE GF I+ IRSDHG EF+N  F+ YC  K G +HNF +PRT QQNGVVERKNRTLQE AR+ML+E 
Subjt:  VIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEY

Query:  GLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMH--
         LP YFWAEA+NT+CYVLNR+L++P L KTPYELW +K PNI YFKVFGCKCFILN K+ LGKFD+K+D+GIFLGYS++SKA+RVFNKRT+++EES+H  
Subjt:  GLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMH--

Query:  VVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSL-NLFSNLAFVSQVEP
         +  E+C + + +                ++K  E +P        +K       +  + ++  +HP+D I+ +P  GV+TRSSL N+ +NLAF+SQ+EP
Subjt:  VVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSL-NLFSNLAFVSQVEP

Query:  KSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYK
        K+ KDA  DE W++ MQ+ELNQFER+ V ELVP+PSN S+IGTKWVFRNKMDE+G I+RNKARLVAQGY QEEGID EETFAPVARLEAIRMLLAFA +K
Subjt:  KSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYK

Query:  NFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRLS---------------------------------------
        +FILYQMDVKSAFLNG+I E++YVEQPPGF++F  P+HV+KLKKALYGLKQAPRAWY+RLS                                       
Subjt:  NFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRLS---------------------------------------

Query:  -------------------------------KIAKTPMRTTTKLDKDEKGKCVDIKIYR-----------------------------------------
                                       K+ KTPM ++ KLD DEKGK +D  +YR                                         
Subjt:  -------------------------------KIAKTPMRTTTKLDKDEKGKCVDIKIYR-----------------------------------------

Query:  -----------------------GYSDADFAGSLLDRKSTSGTCQFLGSSYSVKATGRIAQIPVSKAFLGRVINALA-------KPIDGQGEISSSDSRL
                               G+SDADFAG  ++RKSTSGTC  LG S     + +   I +S A       +L        K       +S     +
Subjt:  -----------------------GYSDADFAGSLLDRKSTSGTCQFLGSSYSVKATGRIAQIPVSKAFLGRVINALA-------KPIDGQGEISSSDSRL

Query:  IESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQR-ELIIGERQTGKTLADIFTKLLNEESFCKNRLELGI
               I + ++  +  +T  I I         Q+ ++ +    T   LADIFTK L+EE F   R +LGI
Subjt:  IESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQR-ELIIGERQTGKTLADIFTKLLNEESFCKNRLELGI

CAN74951.1 hypothetical protein VITISV_030567, partial [Vitis vinifera]4.1e-20741.31Show/hide
Query:  MFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHL-EDESKKEKSFALKTIALKTISFAVDSE
        +  RFT I+N L+ LG+    SE V KILRSLP  W  KVTAIQEAKDLTK  +EEL+G LMT+EI++ + L E E KK+KS A  + +  T       E
Subjt:  MFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHL-EDESKKEKSFALKTIALKTISFAVDSE

Query:  DEDALDQDDIAYFSRKYKNFIK----RKKQFKKQLSTQR---ESKGEKSK---KDEVICYECKKPGHIRTDCPFLK-SSKKSKKKAMKATWDDSSESESG
        ++ + + DD+A  +RK   +++    R K+F  + +  R    S G+K K   K ++IC++CKKPGHI+ DCP  K  +K+  KKAM ATW +S ES   
Subjt:  DEDALDQDDIAYFSRKYKNFIK----RKKQFKKQLSTQR---ESKGEKSK---KDEVICYECKKPGHIRTDCPFLK-SSKKSKKKAMKATWDDSSESESG

Query:  SDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEMKVSCDKHACVC
           +EVA +C MA+ D D                                             S E+K  LD                    C +H    
Subjt:  SDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEMKVSCDKHACVC

Query:  DEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKTIFVKASSIVPNMPNVV
        + K A L K          K   +                        T G     +II          G G+         K IF  +  I+ ++ N  
Subjt:  DEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKTIFVKASSIVPNMPNVV

Query:  SNDAKSSFVPICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPRENFSKKTYSCYFSRSNAFNANAKMKWIPKFVNANTLGPKQVCLKASKKN
            K+ F+   H C                                    EN      S Y      F++     W+        LG   + L  S+ N
Subjt:  SNDAKSSFVPICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPRENFSKKTYSCYFSRSNAFNANAKMKWIPKFVNANTLGPKQVCLKASKKN

Query:  RWYLDNGCSRHMTRDRSKFISFSRKDGGMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKD
        +  L  G  +         I+F +     DK+C+ACQMGKQ K+SFK+KN IST+RPL+LLHMDLF PSR  S GG  YA+VIVDDFSR+TWVL +  K 
Subjt:  RWYLDNGCSRHMTRDRSKFISFSRKDGGMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKD

Query:  DALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLN
        +A   F     +VQNE GF I+ IRSDHG EF+N  F+ YC  K+G +HNFS+PRTPQQNGVVERKNRTLQE AR+ML+E  LP YFWAEAVNT+CYVLN
Subjt:  DALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLN

Query:  RVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDD--
        R+L++P L KTPYELW +K PNI YFKVFGCKCFILN K+ LGKFD+K+D+GIFLGYS++SKA+RVFNKRT+++EES+HV+FDES N +       DD  
Subjt:  RVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDD--

Query:  LEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSLNLFSNLAFVSQVEPKSFKDAECDEFWILGMQEEL
        LE     L + +K ++     EE   + KKED   ++P            L     E   KT   +   S+            K  +C            
Subjt:  LEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSLNLFSNLAFVSQVEPKSFKDAECDEFWILGMQEEL

Query:  NQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIE
                 ELVP+PSN S+IGTKWVFRNKMDE+G I+RNKARLVAQGY QEEGID EETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I E
Subjt:  NQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIE

Query:  QVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRL----------------------------------------------------SKIAKTPM
        +VYVEQPPGF++F  P+HV+KLKKALYGLKQAPRAWY+RL                                                    +K+ KTPM
Subjt:  QVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRL----------------------------------------------------SKIAKTPM

Query:  RTTTKLDKDEKGKCVDIKIYR--------------------------GYSDADFAGSLLDRKSTSGTCQFLGSS
         ++ KLD DEKGK +D  +YR                          G+SDADFAG  ++RKSTSGTC FLG S
Subjt:  RTTTKLDKDEKGKCVDIKIYR--------------------------GYSDADFAGSLLDRKSTSGTCQFLGSS

CAN76317.1 hypothetical protein VITISV_012360 [Vitis vinifera]6.1e-21138.06Show/hide
Query:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHL-EDESKKEKSFALKTIALK
        +D  E+I +M TRF +I+N L+ LGK Y  SE V KILRSLP  W  KVTAIQEAKDLTK  +EELIG LMT+EI + + L E E K +K+  LK     
Subjt:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHL-EDESKKEKSFALKTIALK

Query:  TISFAVDSEDEDALDQD-DIAYFSRKYKNFIKRKKQFKKQLSTQRE-------SKGEKSK---KDEVICYECKKPGHIRTDCPFLKS-SKKSKKKAMKAT
        T     D E+E   D+D D+A  +RK   F++ ++  +++ +++R+       S G+  K   K E+ C++CKK GHI+ DCP  KS +K+  KKAM AT
Subjt:  TISFAVDSEDEDALDQD-DIAYFSRKYKNFIKRKKQFKKQLSTQRE-------SKGEKSK---KDEVICYECKKPGHIRTDCPFLKS-SKKSKKKAMKAT

Query:  WDDSSESESGSDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEM-
        W++S ES    + +EVA +C MA+ D      DEVN    S + + + FE +  D EKLS K   L KK   L  E + + +K +          IDE+ 
Subjt:  WDDSSESESGSDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEM-

Query:  KVSCDKHACVCDEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKTIFVKA
        K   +K   +   +N +L K                  K + L +         S+   + G + F+ I+   K   DK+GLG+    +    K  FVK 
Subjt:  KVSCDKHACVCDEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKTIFVKA

Query:  SSIVPNMPNVVSNDAKSSFVPICHNCGVEGHIRPKCF-----------KLK---------------YAHNTSSRRNFSQRAKFYNAPRE-----NFSKKT
        S             A +S    C+ CG  GHI   C            K K               +  +  SR      +KF    +       F + +
Subjt:  SSIVPNMPNVVSNDAKSSFVPICHNCGVEGHIRPKCF-----------KLK---------------YAHNTSSRRNFSQRAKFYNAPRE-----NFSKKT

Query:  YSCYFSRSNAFNANAKMKWIPKFVNA---NTLGPKQVCLKASK---------------KNRWYLDNGCSRHMTRDRSKF---------------------
              + N  N  + +      V+    N L   Q+C K  K                   ++ + C      + SK+                     
Subjt:  YSCYFSRSNAFNANAKMKWIPKFVNA---NTLGPKQVCLKASK---------------KNRWYLDNGCSRHMTRDRSKF---------------------

Query:  -------ISFSRKDG----------GMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDA
               IS   KD             DKVC+ACQMGKQ K+SFK+KN I TTRPL+LLHMDLF  SR  S GG  YAFVIVDDFSR+TWVL +  K++A
Subjt:  -------ISFSRKDG----------GMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDA

Query:  LKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRV
           F     +VQNE GF I+ IRSDHG EF+N  F+ YC  ++G +HNFS+PRTPQQNGVVERKNRTLQE AR+ML+E  LP YFWAEAVNT+CYVLNR+
Subjt:  LKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRV

Query:  LIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDD--LE
        L++P L KTP ELW +K PNI YFKVFGCKCFILN K+ LGKFD+K+D+GIFLGYS++SKA+RVFNKRT+++EES+HV+FDES N +     + DD  LE
Subjt:  LIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDD--LE

Query:  KDFEDLLVSEKGKEIVPSMEEVSINEKKE--------------DGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSL-NLFSNLAFVSQVEPKSFKDAE
             L + ++ ++     EEV  + KKE              + S  +PK+W++  +HP+D I+ +P  GV+TRSSL N+ +NLAF+SQ+EPK+ KDA 
Subjt:  KDFEDLLVSEKGKEIVPSMEEVSINEKKE--------------DGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSL-NLFSNLAFVSQVEPKSFKDAE

Query:  CDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFAS---------
         DE WI+ MQEELNQFER+ V EL+P+P N S+IGT+WVFRNKMDE+G IIRNKARLVAQG+ QEEGID EETFAP+ARLEAIRMLLAFAS         
Subjt:  CDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFAS---------

Query:  -----------YKNFILYQMDVKSAFL----NGYIIEQVYVE-------------------------QPPGFENFELPSHVYKLKK--------------
                    K F + ++D  + F+    N  ++ Q+YV+                            G   F L   + +LK+              
Subjt:  -----------YKNFILYQMDVKSAFL----NGYIIEQVYVE-------------------------QPPGFENFELPSHVYKLKK--------------

Query:  -------ALYGLKQAPRAWYD-----RLSKIAKTPMRTTTKLDKDEKGKCVDIKIYR---------GYSDADFAGSLLDRKSTSGTCQFLGSSYSVKATG
                LY     P   Y      R     K    +  K         +DI ++          GYSDADFAG  ++RK+TSGTC FLG S     + 
Subjt:  -------ALYGLKQAPRAWYD-----RLSKIAKTPMRTTTKLDKDEKGKCVDIKIYR---------GYSDADFAGSLLDRKSTSGTCQFLGSSYSVKATG

Query:  RIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIE-------SPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQR-ELIIGERQTGKTLADIFTKL
        +   + +S A    +  +L        + + SD  LI        +    I + +++ +  +T  I I         Q+ ++ +    T   LADIFTK 
Subjt:  RIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIE-------SPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQR-ELIIGERQTGKTLADIFTKL

Query:  LNEESFCKNRLELGII
        L+EE F   R +LG+I
Subjt:  LNEESFCKNRLELGII

XP_024024455.1 uncharacterized protein LOC112092461 [Morus notabilis]4.7e-21944.26Show/hide
Query:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHLEDESKKEKSFALKTIALKT
        M  +ESIT M+TRFT+I+N L  L + +  SE V KI+RSLPK+W  K T ++E K L+  SL++LIG LMTHE++  +  E++ KK+K  ALK    + 
Subjt:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHLEDESKKEKSFALKTIALKT

Query:  ISFAVDSEDEDALDQDDIAYFSRKYKNFIKRKKQFKKQLSTQRESKGEKSKKDEVICYECKKPGHIRTDCPFLKSSKKSK-----KKAMKATWDDSSESE
        ++   + ++ D LD++ I+ F++KYK F  +  Q K   S + E +G+K K+DE+ICY+CKKPGH RT+CP  +S+KKS      KK ++ATWD+    E
Subjt:  ISFAVDSEDEDALDQDDIAYFSRKYKNFIKRKKQFKKQLSTQRESKGEKSKKDEVICYECKKPGHIRTDCPFLKSSKKSK-----KKAMKATWDDSSESE

Query:  SGSD-VEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEMKVSCDKHA
        S SD  EE+A +C MA+ D +   + ++++E  S             D+E   S+   L ++Y  L+  N S            H   I+ +K    +  
Subjt:  SGSD-VEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEMKVSCDKHA

Query:  CVCDEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKTIFVKA--SSIVPN
         + +EK+ +L          CE  +    LKEK       +D   + I K T G + FD+++   +   +K G+GY            F+ +  S  +  
Subjt:  CVCDEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKTIFVKA--SSIVPN

Query:  MPNVVSNDAKSSFVPICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPRENFSKKTYSCYFSRSNAFNANAKMKWIPKF---------VNANT
          +++ N  + +   +      +G I         A             K           K Y   F  S     N     +            ++   
Subjt:  MPNVVSNDAKSSFVPICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPRENFSKKTYSCYFSRSNAFNANAKMKWIPKF---------VNANT

Query:  LGPKQVCLKASKKNRWYLDNGCSRHMTRDRSKFISFSRKD--GGM-------DKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGN
        L  K  CL     + W              SK    SRK+   G+       DK+CDACQ GKQ ++SFKS   ISTTRPLQLLHMDLF PSR  S GG 
Subjt:  LGPKQVCLKASKKNRWYLDNGCSRHMTRDRSKFISFSRKD--GGM-------DKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGN

Query:  YYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSM
        +YAFVIVDDFSR+TWV+ +  KD+ALK+F+   KRVQNE G+ I+TI+SDHGGEFDND F+  C  +NG+ HNFS+PRTPQQNGVVERKNR LQE ARSM
Subjt:  YYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSM

Query:  LHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEES
        L+E GLP YFWAEAVNT+CY++NRVLI+P +DKTPYELW  + PNIGYF+VFGCKCFILN K+ LGKFD+K+D+GIFLGYS+TSKAYRVFNKRTL++EES
Subjt:  LHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEES

Query:  MHVVFDESCNIVSNESIYSDDL------------EKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSS-MPKEWRYAPSHPKDLILCDPEQGVKTRSSL
        MHVVFDE+ N    +S+  DDL            +K+  D    E      P  +E  +N+ K+D +S+ +PKEWRY+ SHPKD                
Subjt:  MHVVFDESCNIVSNESIYSDDL------------EKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSS-MPKEWRYAPSHPKDLILCDPEQGVKTRSSL

Query:  NLFSNLAFVSQVEPKSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVAR
                                  IL MQEELNQFER++V +LVP+PS  S+IGTKWVFRNK DEHG I+RNKARLVAQG+ QEEGID EETFAPVAR
Subjt:  NLFSNLAFVSQVEPKSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVAR

Query:  LEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYG
        LE+IRMLLAFAS+K F LYQMDVKSAFLNGYI+E+VYVEQPPGF++ + P H+++ K  + G
Subjt:  LEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYG

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]1.8e-20778.03Show/hide
Query:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHLEDESKKEKSFALKTIALKT
        MDANE+ITDMFTRFTNIINALKGLGK YTTSENVRKILRSLPKTWEAKVTAIQEAKDLTK  LEELIG LMTHEI MKEHLEDESKK+KS     IALKT
Subjt:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHLEDESKKEKSFALKTIALKT

Query:  ISFAVDSEDEDALDQDDIAYFSRKYKNFIKRKKQFKKQLSTQRESKGEKSKKDEVICYECKKPGHIRTDCPFLKSSKKSKKKAMKATWDDSSESESGSDV
        IS  VD EDED LD+DDIAYFSRKYKNFIKRKK FKK LSTQ+ESKGEKSKKDEVICYECK+ GHIRTDCP LKSSKKSKKKAMKATWDDSSESE  S+V
Subjt:  ISFAVDSEDEDALDQDDIAYFSRKYKNFIKRKKQFKKQLSTQRESKGEKSKKDEVICYECKKPGHIRTDCPFLKSSKKSKKKAMKATWDDSSESESGSDV

Query:  EEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEMKVSCDKHACVCDEK
        EE+A + LMA SDKD   DD+V LEP S DELFE FESMQNDLEKLSSKYVVLKKKYN L SENKSLLD IACFK NE+  QI+E+ VS DKH C+  EK
Subjt:  EEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEMKVSCDKHACVCDEK

Query:  NALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTP-SSKTIFVKASSIVP--NMPNVV
        +ALLDKVRFLEHDSCEKDNLIK+LKE EL+ LQELDKAKE+IKKLTIGAQ+ DKIIEVGK YGDKRGLGYIDE STP SSKT FVKAS IVP  NM N V
Subjt:  NALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTP-SSKTIFVKASSIVP--NMPNVV

Query:  SNDAKSSFVPICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPRENFSKKT---------------------------YSCYFSRSNAFNANA
        SN  KSSFVPICHNCGVEGHIRPKCFKLKYA NT SRRNFSQRAKFY APR+NFS K+                           YSCY SRS+A N   
Subjt:  SNDAKSSFVPICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPRENFSKKT---------------------------YSCYFSRSNAFNANA

Query:  KMKWIPKFVNANTLGPKQV
        KMKWIPK+VNAN LGPKQV
Subjt:  KMKWIPKFVNANTLGPKQV

TrEMBL top hitse value%identityAlignment
A0A151RY83 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-20740.2Show/hide
Query:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHLEDESKKEKSFALKTIALKT
        M   ESI+D   RFT++IN L  LG+ +   E   K+L+ L +TW+AKVTAI+E+KDLT  +L  L G L  HE   K H+ DE +++            
Subjt:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHLEDESKKEKSFALKTIALKT

Query:  ISFAVDSEDEDALDQDDIAYFSRKYKNFIKRKKQFKKQLSTQRESKGEKSKKDEVICYECKKPGHIRTDCPFL--------KSSKKSKKKAMKATWDDS-
                              +K++ FI+RK+    + + ++ +K E++      C+EC KPGH++ DCP +        K  K +K K     W+D+ 
Subjt:  ISFAVDSEDEDALDQDDIAYFSRKYKNFIKRKKQFKKQLSTQRESKGEKSKKDEVICYECKKPGHIRTDCPFL--------KSSKKSKKKAMKATWDDS-

Query:  SESESGSDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEMKVSCD
        S + SG    E   +CLMA + +D ++ D      P YD+L E F  +  +  KL +  + LK K     +       K    K+   D+Q   M    +
Subjt:  SESESGSDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEMKVSCD

Query:  KHACVCDEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTP--SSKTIFVKASSI
         H+                  D CE   +     EKE NA       K ++ + T G    + ++   +   D+ GLG+    S P  S K  FVK+   
Subjt:  KHACVCDEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTP--SSKTIFVKASSI

Query:  VPNMPNVVSNDAKSSFVPICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPRENFSKKTYSCYFSRSNAFNANAK--MKWIPKFVNANTLGPK
          N   + SN +K +    C  C   GH    C+  K+   +                           +  +S     N K    W+PKF         
Subjt:  VPNMPNVVSNDAKSSFVPICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPRENFSKKTYSCYFSRSNAFNANAK--MKWIPKFVNANTLGPK

Query:  QVCLKASKKNRWYLDNGCSRHMTRDRSKFISFSRK------------------------------------DG---------------------------
           L  +K   WYLD+GCSRHMT D SKF S   K                                    DG                           
Subjt:  QVCLKASKKNRWYLDNGCSRHMTRDRSKFISFSRK------------------------------------DG---------------------------

Query:  --------------------------------------------------------------------------GMDKVCDACQMGKQTKSSFKSKNVIS
                                                                                  G DK+CDACQ GKQ K+SFKSKN IS
Subjt:  --------------------------------------------------------------------------GMDKVCDACQMGKQTKSSFKSKNVIS

Query:  TTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSS
        T+RPLQL+HMDLF PSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F   AK VQNE    I++IRSDHGGEF N +F+ +CE ++G +HNFS+
Subjt:  TTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSS

Query:  PRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDIG
        PRTPQQNGVVERKNR+L+E AR+ML+E  LP YFWA+A+NTAC+VLNRVLI+P L KTPYE++  K PNI YF+VFGCKC++LNN KE+LGKFD+K D  
Subjt:  PRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDIG

Query:  IFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILC
        IFLGYS+ SK+YR++NKRTL++EES+HVVFDES    + ++   D  E   + LL SE  ++   S E VS +EK++     +PKEW+ +     D I+ 
Subjt:  IFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILC

Query:  DPEQGVKTRSSL-NLFSNLAFVSQVEPKSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEE
        +  +GV TRS++ N+ + +AFVSQVEPK+  +A  DE W++ MQEELNQFERN V +LVP P +  IIGTKWVFRNK+DE G I+RNKARLVA+GY QEE
Subjt:  DPEQGVKTRSSL-NLFSNLAFVSQVEPKSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEE

Query:  GIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSK
        GID +ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG I E+VYVEQPPGF +F+ P+HVYKLKKALYGLKQAPR+WYDRLSK
Subjt:  GIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSK

A0A151SK87 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-20641.76Show/hide
Query:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHLEDESKKEKSFALKTIALKT
        M+  E++ DM  RF +IIN LKGLGKD+   E   KIL+SL + W+  VTAI E+K+L + +  EL G L  +E+ +    ++E K +K+   K +ALK 
Subjt:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHLEDESKKEKSFALKTIALKT

Query:  ISFAVDSED--EDALDQDDIAYFSRKYKNFIKRKKQFKKQLSTQRESKGEKSKKDEVICYECKKPGHIRTDCPFLKSSK------KSKKKAMKATWDD-S
           + D ED   ++ +Q+++    + ++ F++RK   K+  S ++  K  ++   +  C+EC K GH+R DCP LK ++      +SKKK     W++ +
Subjt:  ISFAVDSED--EDALDQDDIAYFSRKYKNFIKRKKQFKKQLSTQRESKGEKSKKDEVICYECKKPGHIRTDCPFLKSSK------KSKKKAMKATWDD-S

Query:  SESESGSDVEEVARICLMALSDKD-GVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDV--QIDEMKV
        S + S SD ++ A +CLM   + D  V D + +++  SYDEL   F  +  + +KL     V KKK          + DKI+  +N+   +   I ++K 
Subjt:  SESESGSDVEEVARICLMALSDKD-GVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDV--QIDEMKV

Query:  SCDKHACVCDEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECST--PSSKTIFVKA
         C+     C+   A       L+    EK N + ++K    ++     K+  +  +  I  +KF      G P G  + +  +D   T  P  K  +   
Subjt:  SCDKHACVCDEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECST--PSSKTIFVKA

Query:  SSIVPNMPNVVSNDAKSSFVPICHNCG--VEGHIRPKCFKLK--------YAHNTSSR-----------RNFSQRAKFYNAPRENFSKKTYSCYFSRSNA
           VP +   V   AK S   +   C   + G  + K   L+        Y  N   R               +   +    + N    +  C  + + +
Subjt:  SSIVPNMPNVVSNDAKSSFVPICHNCG--VEGHIRPKCFKLK--------YAHNTSSR-----------RNFSQRAKFYNAPRENFSKKTYSCYFSRSNA

Query:  FNANAKMKWIPKFVNANTLGPKQV---------------CLKASKKNRWYLDNGCSRHMTRDRSKFISFSRKDGGMDKV-------CDACQMGKQTKSSF
        FN    M    K  N   L  K+V               CL +     W      + H+  D+   ++      G+ K+       CDACQ GKQT++SF
Subjt:  FNANAKMKWIPKFVNANTLGPKQV---------------CLKASKKNRWYLDNGCSRHMTRDRSKFISFSRKDGGMDKV-------CDACQMGKQTKSSF

Query:  KSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNG
        KSK V+ST+RPL+LLHMDLF PSR  S GGN YA V++DD+SRFTWV  I HKD+A K+F  ++KR+QNE    I ++RSDHGGEF N+ F+ +CE +NG
Subjt:  KSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNG

Query:  FSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNN-KEKLGKF
         +HNFS PRTPQQNGVVERKNR+L E AR+ML+E GLP YFWA+AVNTACYVLNR+LI+P L KTPYEL+  + P+I +FKVFGCKCF+LNN K+ LGKF
Subjt:  FSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNN-KEKLGKF

Query:  DSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSH
        D+K D G+F+GYS+ SKAYRVFNK TL +EES+HV FDE+  +   +S+  +D           E G  I    E++ + +K     +S+PKEWR     
Subjt:  DSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSH

Query:  PKDLILCDPEQGVKTRSSLNLFS-NLAFVSQVEPKSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVA
          D IL D  +GV TR S NL S ++AFVSQ+EP   + A  DEFWI+ M +ELNQF+RN+V  LVP   N +IIGTKWVFRNK++E G I++NKARLVA
Subjt:  PKDLILCDPEQGVKTRSSLNLFS-NLAFVSQVEPKSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVA

Query:  QGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSKIAKT
        +GY Q+EGID  ET+APVARLEA+R+LLAFA   +F LYQMDVKSAFLNG I E+VYV QPPGF + +LP HVYKLKKALYGLKQAPR WY+RLSK   T
Subjt:  QGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSKIAKT

Query:  PMRTTTKLDKD--EKGKCVDIKIYRGYSDADFAGS
               +DK    K K  DI + + Y D    GS
Subjt:  PMRTTTKLDKD--EKGKCVDIKIYRGYSDADFAGS

A5B1P9 Uncharacterized protein3.0e-21138.06Show/hide
Query:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHL-EDESKKEKSFALKTIALK
        +D  E+I +M TRF +I+N L+ LGK Y  SE V KILRSLP  W  KVTAIQEAKDLTK  +EELIG LMT+EI + + L E E K +K+  LK     
Subjt:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHL-EDESKKEKSFALKTIALK

Query:  TISFAVDSEDEDALDQD-DIAYFSRKYKNFIKRKKQFKKQLSTQRE-------SKGEKSK---KDEVICYECKKPGHIRTDCPFLKS-SKKSKKKAMKAT
        T     D E+E   D+D D+A  +RK   F++ ++  +++ +++R+       S G+  K   K E+ C++CKK GHI+ DCP  KS +K+  KKAM AT
Subjt:  TISFAVDSEDEDALDQD-DIAYFSRKYKNFIKRKKQFKKQLSTQRE-------SKGEKSK---KDEVICYECKKPGHIRTDCPFLKS-SKKSKKKAMKAT

Query:  WDDSSESESGSDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEM-
        W++S ES    + +EVA +C MA+ D      DEVN    S + + + FE +  D EKLS K   L KK   L  E + + +K +          IDE+ 
Subjt:  WDDSSESESGSDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEM-

Query:  KVSCDKHACVCDEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKTIFVKA
        K   +K   +   +N +L K                  K + L +         S+   + G + F+ I+   K   DK+GLG+    +    K  FVK 
Subjt:  KVSCDKHACVCDEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKTIFVKA

Query:  SSIVPNMPNVVSNDAKSSFVPICHNCGVEGHIRPKCF-----------KLK---------------YAHNTSSRRNFSQRAKFYNAPRE-----NFSKKT
        S             A +S    C+ CG  GHI   C            K K               +  +  SR      +KF    +       F + +
Subjt:  SSIVPNMPNVVSNDAKSSFVPICHNCGVEGHIRPKCF-----------KLK---------------YAHNTSSRRNFSQRAKFYNAPRE-----NFSKKT

Query:  YSCYFSRSNAFNANAKMKWIPKFVNA---NTLGPKQVCLKASK---------------KNRWYLDNGCSRHMTRDRSKF---------------------
              + N  N  + +      V+    N L   Q+C K  K                   ++ + C      + SK+                     
Subjt:  YSCYFSRSNAFNANAKMKWIPKFVNA---NTLGPKQVCLKASK---------------KNRWYLDNGCSRHMTRDRSKF---------------------

Query:  -------ISFSRKDG----------GMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDA
               IS   KD             DKVC+ACQMGKQ K+SFK+KN I TTRPL+LLHMDLF  SR  S GG  YAFVIVDDFSR+TWVL +  K++A
Subjt:  -------ISFSRKDG----------GMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDA

Query:  LKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRV
           F     +VQNE GF I+ IRSDHG EF+N  F+ YC  ++G +HNFS+PRTPQQNGVVERKNRTLQE AR+ML+E  LP YFWAEAVNT+CYVLNR+
Subjt:  LKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRV

Query:  LIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDD--LE
        L++P L KTP ELW +K PNI YFKVFGCKCFILN K+ LGKFD+K+D+GIFLGYS++SKA+RVFNKRT+++EES+HV+FDES N +     + DD  LE
Subjt:  LIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDD--LE

Query:  KDFEDLLVSEKGKEIVPSMEEVSINEKKE--------------DGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSL-NLFSNLAFVSQVEPKSFKDAE
             L + ++ ++     EEV  + KKE              + S  +PK+W++  +HP+D I+ +P  GV+TRSSL N+ +NLAF+SQ+EPK+ KDA 
Subjt:  KDFEDLLVSEKGKEIVPSMEEVSINEKKE--------------DGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSL-NLFSNLAFVSQVEPKSFKDAE

Query:  CDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFAS---------
         DE WI+ MQEELNQFER+ V EL+P+P N S+IGT+WVFRNKMDE+G IIRNKARLVAQG+ QEEGID EETFAP+ARLEAIRMLLAFAS         
Subjt:  CDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFAS---------

Query:  -----------YKNFILYQMDVKSAFL----NGYIIEQVYVE-------------------------QPPGFENFELPSHVYKLKK--------------
                    K F + ++D  + F+    N  ++ Q+YV+                            G   F L   + +LK+              
Subjt:  -----------YKNFILYQMDVKSAFL----NGYIIEQVYVE-------------------------QPPGFENFELPSHVYKLKK--------------

Query:  -------ALYGLKQAPRAWYD-----RLSKIAKTPMRTTTKLDKDEKGKCVDIKIYR---------GYSDADFAGSLLDRKSTSGTCQFLGSSYSVKATG
                LY     P   Y      R     K    +  K         +DI ++          GYSDADFAG  ++RK+TSGTC FLG S     + 
Subjt:  -------ALYGLKQAPRAWYD-----RLSKIAKTPMRTTTKLDKDEKGKCVDIKIYR---------GYSDADFAGSLLDRKSTSGTCQFLGSSYSVKATG

Query:  RIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIE-------SPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQR-ELIIGERQTGKTLADIFTKL
        +   + +S A    +  +L        + + SD  LI        +    I + +++ +  +T  I I         Q+ ++ +    T   LADIFTK 
Subjt:  RIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIE-------SPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQR-ELIIGERQTGKTLADIFTKL

Query:  LNEESFCKNRLELGII
        L+EE F   R +LG+I
Subjt:  LNEESFCKNRLELGII

A5BSC1 Uncharacterized protein (Fragment)2.0e-20741.31Show/hide
Query:  MFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHL-EDESKKEKSFALKTIALKTISFAVDSE
        +  RFT I+N L+ LG+    SE V KILRSLP  W  KVTAIQEAKDLTK  +EEL+G LMT+EI++ + L E E KK+KS A  + +  T       E
Subjt:  MFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHL-EDESKKEKSFALKTIALKTISFAVDSE

Query:  DEDALDQDDIAYFSRKYKNFIK----RKKQFKKQLSTQR---ESKGEKSK---KDEVICYECKKPGHIRTDCPFLK-SSKKSKKKAMKATWDDSSESESG
        ++ + + DD+A  +RK   +++    R K+F  + +  R    S G+K K   K ++IC++CKKPGHI+ DCP  K  +K+  KKAM ATW +S ES   
Subjt:  DEDALDQDDIAYFSRKYKNFIK----RKKQFKKQLSTQR---ESKGEKSK---KDEVICYECKKPGHIRTDCPFLK-SSKKSKKKAMKATWDDSSESESG

Query:  SDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEMKVSCDKHACVC
           +EVA +C MA+ D D                                             S E+K  LD                    C +H    
Subjt:  SDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEMKVSCDKHACVC

Query:  DEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKTIFVKASSIVPNMPNVV
        + K A L K          K   +                        T G     +II          G G+         K IF  +  I+ ++ N  
Subjt:  DEKNALLDKVRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKTIFVKASSIVPNMPNVV

Query:  SNDAKSSFVPICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPRENFSKKTYSCYFSRSNAFNANAKMKWIPKFVNANTLGPKQVCLKASKKN
            K+ F+   H C                                    EN      S Y      F++     W+        LG   + L  S+ N
Subjt:  SNDAKSSFVPICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPRENFSKKTYSCYFSRSNAFNANAKMKWIPKFVNANTLGPKQVCLKASKKN

Query:  RWYLDNGCSRHMTRDRSKFISFSRKDGGMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKD
        +  L  G  +         I+F +     DK+C+ACQMGKQ K+SFK+KN IST+RPL+LLHMDLF PSR  S GG  YA+VIVDDFSR+TWVL +  K 
Subjt:  RWYLDNGCSRHMTRDRSKFISFSRKDGGMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKD

Query:  DALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLN
        +A   F     +VQNE GF I+ IRSDHG EF+N  F+ YC  K+G +HNFS+PRTPQQNGVVERKNRTLQE AR+ML+E  LP YFWAEAVNT+CYVLN
Subjt:  DALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLN

Query:  RVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDD--
        R+L++P L KTPYELW +K PNI YFKVFGCKCFILN K+ LGKFD+K+D+GIFLGYS++SKA+RVFNKRT+++EES+HV+FDES N +       DD  
Subjt:  RVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDD--

Query:  LEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSLNLFSNLAFVSQVEPKSFKDAECDEFWILGMQEEL
        LE     L + +K ++     EE   + KKED   ++P            L     E   KT   +   S+            K  +C            
Subjt:  LEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSLNLFSNLAFVSQVEPKSFKDAECDEFWILGMQEEL

Query:  NQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIE
                 ELVP+PSN S+IGTKWVFRNKMDE+G I+RNKARLVAQGY QEEGID EETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I E
Subjt:  NQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIE

Query:  QVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRL----------------------------------------------------SKIAKTPM
        +VYVEQPPGF++F  P+HV+KLKKALYGLKQAPRAWY+RL                                                    +K+ KTPM
Subjt:  QVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRL----------------------------------------------------SKIAKTPM

Query:  RTTTKLDKDEKGKCVDIKIYR--------------------------GYSDADFAGSLLDRKSTSGTCQFLGSS
         ++ KLD DEKGK +D  +YR                          G+SDADFAG  ++RKSTSGTC FLG S
Subjt:  RTTTKLDKDEKGKCVDIKIYR--------------------------GYSDADFAGSLLDRKSTSGTCQFLGSS

A5C8K0 Uncharacterized protein9.5e-23440.52Show/hide
Query:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITM-KEHLEDESKKEKSFALKTIALK
        M   E+I +M TRFT I+N L+ LG+    SE V KILRSLP  W  KVTAIQEAKDLTK  +EEL+G LMT+EI++ K+  E E KK+K+ ALK     
Subjt:  MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITM-KEHLEDESKKEKSFALKTIALK

Query:  TISFAVDSEDEDALDQ-DDIAYFSRKYKNFIK----RKKQFKKQLSTQR---ESKGEKSK---KDEVICYECKKPGHIRTDCPFLK-SSKKSKKKAMKAT
        T     D E+E   ++ DD+A  +RK   +++    R K+F  + +  R    S G+K K   K ++IC++CK PGHI+ DCP  K  +K+  KKAM AT
Subjt:  TISFAVDSEDEDALDQ-DDIAYFSRKYKNFIK----RKKQFKKQLSTQR---ESKGEKSK---KDEVICYECKKPGHIRTDCPFLK-SSKKSKKKAMKAT

Query:  WDDSSESESGSDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEM-
        W +S ES      +EVA +C MA+ D      DEVN    S +++   FE +  D EKLS K   LKK+   L  E + + +K +          IDE+ 
Subjt:  WDDSSESESGSDVEEVARICLMALSDKDGVLDDEVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEM-

Query:  KVSCDKHACVCDEKNALLDK-----VRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKT
        K   +K   +   +N +L K        L   SC + +   +L  ++         +KE    L  G  +     E    +  KR  GY+        + 
Subjt:  KVSCDKHACVCDEKNALLDK-----VRFLEHDSCEKDNLIKLLKEKELNALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKT

Query:  IFVKASSIVPNMPNVVSNDAKSSFV--PICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPR-ENFSKKTYSCYFSRSNAFNANAKMKWIPKF
        I         N+ N  S+  +S  +   + HN      +  K FK+ +  +    ++       +   R EN      S Y      F++     W+   
Subjt:  IFVKASSIVPNMPNVVSNDAKSSFV--PICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQRAKFYNAPR-ENFSKKTYSCYFSRSNAFNANAKMKWIPKF

Query:  VNANTLGPKQVCLKASKKNRWYLDNGCSRHMTRDRSKFISFSRKDGGMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAF
             LG   + L  S+ N+  L  G  +         I+F +     DK+C+ACQMGKQ K+SFK+KN IST+RPL+LLHMDLF PSR  S GG  YA+
Subjt:  VNANTLGPKQVCLKASKKNRWYLDNGCSRHMTRDRSKFISFSRKDGGMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAF

Query:  VIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEY
        VIVDDFSR+TWVL +  K +A   F     +VQNE GF I+ IRSDHG EF+N  F+ YC  K G +HNF +PRT QQNGVVERKNRTLQE AR+ML+E 
Subjt:  VIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEY

Query:  GLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMH--
         LP YFWAEA+NT+CYVLNR+L++P L KTPYELW +K PNI YFKVFGCKCFILN K+ LGKFD+K+D+GIFLGYS++SKA+RVFNKRT+++EES+H  
Subjt:  GLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMH--

Query:  VVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSL-NLFSNLAFVSQVEP
         +  E+C + + +                ++K  E +P        +K       +  + ++  +HP+D I+ +P  GV+TRSSL N+ +NLAF+SQ+EP
Subjt:  VVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSL-NLFSNLAFVSQVEP

Query:  KSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYK
        K+ KDA  DE W++ MQ+ELNQFER+ V ELVP+PSN S+IGTKWVFRNKMDE+G I+RNKARLVAQGY QEEGID EETFAPVARLEAIRMLLAFA +K
Subjt:  KSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYK

Query:  NFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRLS---------------------------------------
        +FILYQMDVKSAFLNG+I E++YVEQPPGF++F  P+HV+KLKKALYGLKQAPRAWY+RLS                                       
Subjt:  NFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRLS---------------------------------------

Query:  -------------------------------KIAKTPMRTTTKLDKDEKGKCVDIKIYR-----------------------------------------
                                       K+ KTPM ++ KLD DEKGK +D  +YR                                         
Subjt:  -------------------------------KIAKTPMRTTTKLDKDEKGKCVDIKIYR-----------------------------------------

Query:  -----------------------GYSDADFAGSLLDRKSTSGTCQFLGSSYSVKATGRIAQIPVSKAFLGRVINALA-------KPIDGQGEISSSDSRL
                               G+SDADFAG  ++RKSTSGTC  LG S     + +   I +S A       +L        K       +S     +
Subjt:  -----------------------GYSDADFAGSLLDRKSTSGTCQFLGSSYSVKATGRIAQIPVSKAFLGRVINALA-------KPIDGQGEISSSDSRL

Query:  IESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQR-ELIIGERQTGKTLADIFTKLLNEESFCKNRLELGI
               I + ++  +  +T  I I         Q+ ++ +    T   LADIFTK L+EE F   R +LGI
Subjt:  IESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQR-ELIIGERQTGKTLADIFTKLLNEESFCKNRLELGI

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.4e-6329.84Show/hide
Query:  KVCDACQMGKQTKSSFKS-KNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHG
        ++C+ C  GKQ +  FK  K+     RPL ++H D+  P    +     Y  + VD F+ +    +IK+K D    F     + +      +  +  D+G
Subjt:  KVCDACQMGKQTKSSFKS-KNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHG

Query:  GEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLD--KTPYELWHDKIPNIGYFK
         E+ ++  + +C KK G S++ + P TPQ NGV ER  RT+ E AR+M+    L   FW EAV TA Y++NR+  +  +D  KTPYE+WH+K P + + +
Subjt:  GEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLD--KTPYELWHDKIPNIGYFK

Query:  VFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTS-KAYRVFNK-----RTLLIEES---------MHVVF-------------DESCNIVS----NESIY
        VFG   ++ + K K GKFD K+   IF+GY     K +   N+     R ++++E+            VF             ++S  I+     NES  
Subjt:  VFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTS-KAYRVFNK-----RTLLIEES---------MHVVF-------------DESCNIVS----NESIY

Query:  SDDLE-----KDFEDLLVSEKGKEIVPS--------------------MEEVSINEKK---------EDGSSSMPKEWRYAPS--HPKDLILCDP-----
         D+++     K+ E+       ++I+ +                      +  +NE K         E   S  P E R + +  H K++ + +P     
Subjt:  SDDLE-----KDFEDLLVSEKGKEIVPS--------------------MEEVSINEKK---------EDGSSSMPKEWRYAPS--HPKDLILCDP-----

Query:  -------EQGVKTR---------SSLN-LFSNLAFVSQVEPKSFKDAECDE---FWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGN
                + +KT+         +SLN +  N   +    P SF + +  +    W   +  ELN  + NN   +  +P N +I+ ++WVF  K +E GN
Subjt:  -------EQGVKTR---------SSLN-LFSNLAFVSQVEPKSFKDAECDE---FWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGN

Query:  IIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAW
         IR KARLVA+G+ Q+  ID EETFAPVAR+ + R +L+     N  ++QMDVK+AFLNG + E++Y+  P G        +V KL KA+YGLKQA R W
Subjt:  IIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAW

Query:  YDRLSKIAKTPMRTTTKLDK
        ++   +  K      + +D+
Subjt:  YDRLSKIAKTPMRTTTKLDK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-7533.75Show/hide
Query:  RSKFISFSRKDGGMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQN
        +   IS+++  G   K CD C  GKQ + SF++ +       L L++ D+  P  I S GGN Y    +DD SR  WV ++K KD   + F      V+ 
Subjt:  RSKFISFSRKDGGMDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQN

Query:  ENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYEL
        E G  +  +RSD+GGE+ +  F+ YC   +G  H  + P TPQ NGV ER NRT+ E  RSML    LP  FW EAV TACY++NR    P   + P  +
Subjt:  ENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYEL

Query:  WHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDES-----------------CNIVSNESIYSDD
        W +K  +  + KVFGC+ F    KE+  K D K+   IF+GY      YR+++     +  S  VVF ES                  N V+  S  ++ 
Subjt:  WHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDES-----------------CNIVSNESIYSDD

Query:  LEKDFEDLLVSEKGKEIVPSMEE-VSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSLNLFSNLAFVSQVEPKSFKDA----ECDEFWILG
           +     VSE+G++    +E+   ++E  E+       E ++ P    +    +  +   T   L            EP+S K+     E ++  +  
Subjt:  LEKDFEDLLVSEKGKEIVPSMEE-VSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSLNLFSNLAFVSQVEPKSFKDA----ECDEFWILG

Query:  MQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLN
        MQEE+   ++N   +LV  P     +  KWVF+ K D    ++R KARLV +G+ Q++GID +E F+PV ++ +IR +L+ A+  +  + Q+DVK+AFL+
Subjt:  MQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLN

Query:  GYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSKIAKTPMRTTTKLD
        G + E++Y+EQP GFE       V KL K+LYGLKQAPR WY +     K+     T  D
Subjt:  GYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSKIAKTPMRTTTKLD

Q2QDA3 ATP synthase subunit alpha, chloroplastic5.0e-3884.31Show/hide
Query:  SVKATGRIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQRELIIGERQTGKTLADIFTKLLN
        SVKATGRIAQIPVS+A+LGRVINALAKPIDG+GEISSSDSRLIESPA GIILRRSVYEPLQTGLIAIDS+IPI RGQRELIIG+RQTGKT     T +LN
Subjt:  SVKATGRIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQRELIIGERQTGKTLADIFTKLLN

Query:  EE
        ++
Subjt:  EE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-6228.09Show/hide
Query:  CDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEF
        C  C + K  K  F S++ I++TRPL+ ++ D++  S I S     Y  + VD F+R+TW+  +K K    ++FI+    ++N     I T  SD+GGEF
Subjt:  CDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEF

Query:  DNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCK
           V       ++G SH  S P TP+ NG+ ERK+R + E   ++L    +P  +W  A   A Y++NR+       ++P++      PN    +VFGC 
Subjt:  DNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCK

Query:  CFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVS------------------------------------------
        C+         K D K+   +FLGYS T  AY   + +T  +  S HV FDE+C   S                                          
Subjt:  CFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVS------------------------------------------

Query:  --------------NESIYSDDLEKDFEDLLVS--------------------------------------EKGKEIVPSMEEVSINEKKEDGSSSMPKE
                      N  + S +L+  F     S                                      E   ++  S+   + +       ++    
Subjt:  --------------NESIYSDDLEKDFEDLLVS--------------------------------------EKGKEIVPSMEEVSINEKKEDGSSSMPKE

Query:  WRYAPSHPKDLILCDP---------------EQGVKTRSSLNLFS-------NLAFVSQVEPKSFKDAECDEFWILGMQEELNQFERNNVLELV-PKPSN
           +P+ P  LI   P                  + TR+   +          ++  ++ EP++   A  DE W   M  E+N    N+  +LV P PS+
Subjt:  WRYAPSHPKDLILCDP---------------EQGVKTRSSLNLFS-------NLAFVSQVEPKSFKDAECDEFWILGMQEELNQFERNNVLELV-PKPSN

Query:  ASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPS
         +I+G +W+F  K +  G++ R KARLVA+GY Q  G+D  ETF+PV +  +IR++L  A  +++ + Q+DV +AFL G + + VY+ QPPGF + + P+
Subjt:  ASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFELPS

Query:  HVYKLKKALYGLKQAPRAWYDRL
        +V KL+KALYGLKQAPRAWY  L
Subjt:  HVYKLKKALYGLKQAPRAWYDRL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.6e-5927.11Show/hide
Query:  CDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEF
        C  C + K  K  F S + I++++PL+ ++ D++  S I S     Y  + VD F+R+TW+  +K K     +FI     V+N     I T+ SD+GGEF
Subjt:  CDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEF

Query:  DNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCK
           V + Y   ++G SH  S P TP+ NG+ ERK+R + E   ++L    +P  +W  A + A Y++NR+       ++P++    + PN    KVFGC 
Subjt:  DNDVFKAYCEKKNGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCK

Query:  CFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPS--------------
        C+         K + K+    F+GYS T  AY   +  T  +  S HV FDE C   S  +      ++   D   +      +P+              
Subjt:  CFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLLIEESMHVVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPS--------------

Query:  ---------------MEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSLNLFSN----------------------------------
                         +VS +       SS       APSH        P Q   + S+  + +N                                  
Subjt:  ---------------MEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSLNLFSN----------------------------------

Query:  ---------------------------------------------------------LAFVSQVEPKSFKDAECDEFWILGMQEELNQFERNNVLELV-P
                                                                  +  +  EP++   A  D+ W   M  E+N    N+  +LV P
Subjt:  ---------------------------------------------------------LAFVSQVEPKSFKDAECDEFWILGMQEELNQFERNNVLELV-P

Query:  KPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENF
         P + +I+G +W+F  K +  G++ R KARLVA+GY Q  G+D  ETF+PV +  +IR++L  A  +++ + Q+DV +AFL G + ++VY+ QPPGF + 
Subjt:  KPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENF

Query:  ELPSHVYKLKKALYGLKQAPRAWYDRL
        + P +V +L+KA+YGLKQAPRAWY  L
Subjt:  ELPSHVYKLKKALYGLKQAPRAWYDRL

Arabidopsis top hitse value%identityAlignment
AT2G07698.1 ATPase, F1 complex, alpha subunit protein3.8e-2549.57Show/hide
Query:  VKATGRIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQRELIIGERQTGKTLADIFTKLLNE
        VK TG I  +P  KA LGRV++A+  PIDG+G +S  + R +E  A GI+ R+SV+EP+QTGL A+DS++PI RGQREL+IG+RQTGKT   I T L  +
Subjt:  VKATGRIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQRELIIGERQTGKTLADIFTKLLNE

Query:  ESFCKNRLELGIIHC
        +   +   E   ++C
Subjt:  ESFCKNRLELGIIHC

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-3241.32Show/hide
Query:  EPKSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFAS
        EP ++ +A+    W   M +E+   E  +  E+   P N   IG KWV++ K +  G I R KARLVA+GY Q+EGID  ETF+PV +L +++++LA ++
Subjt:  EPKSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFAS

Query:  YKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFE----LPSHVYKLKKALYGLKQAPRAWYDRLS
          NF L+Q+D+ +AFLNG + E++Y++ PPG+   +     P+ V  LKK++YGLKQA R W+ + S
Subjt:  YKNFILYQMDVKSAFLNGYIIEQVYVEQPPGFENFE----LPSHVYKLKKALYGLKQAPRAWYDRLS

ATCG00120.1 ATP synthase subunit alpha2.8e-3678.43Show/hide
Query:  SVKATGRIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQRELIIGERQTGKTLADIFTKLLN
        SVKATG+IAQIPVS+A+LGRVINALA PIDG+G+IS+S+SRLIESPA GII RRSVYEPLQTGLIAIDS+IPI RGQRELIIG+RQTGKT     T +LN
Subjt:  SVKATGRIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQRELIIGERQTGKTLADIFTKLLN

Query:  EE
        ++
Subjt:  EE

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.8e-1849.49Show/hide
Query:  EPKSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFA
        EPKS   A  D  W   MQEEL+   RN    LVP P N +I+G KWVF+ K+   G + R KARLVA+G+ QEEGI   ET++PV R   IR +L  A
Subjt:  EPKSFKDAECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFA

ATMG01190.1 ATP synthase subunit 11.1e-2449.57Show/hide
Query:  VKATGRIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQRELIIGERQTGKTLADIFTKLLNE
        VK TG I  +P  KA LGRV++A+  PIDG+G +S  + R +E  A GI+ R+SV+EP+QTGL A+DS++PI RGQREL+IG RQTGKT   I T L  +
Subjt:  VKATGRIAQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQRELIIGERQTGKTLADIFTKLLNE

Query:  ESFCKNRLELGIIHC
        +   +   E   ++C
Subjt:  ESFCKNRLELGIIHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCTAATGAGTCTATAACGGATATGTTTACTAGATTTACTAACATCATAAATGCTTTGAAGGGCCTTGGTAAAGACTATACAACTTCTGAAAATGTTAGAAAAAT
TCTGAGGTCTCTACCTAAAACTTGGGAAGCAAAGGTGACGGCGATCCAAGAAGCAAAGGATCTCACTAAATTTTCGTTGGAGGAACTCATTGGTTTGCTCATGACCCATG
AGATCACTATGAAGGAGCACTTGGAGGATGAGTCCAAAAAGGAGAAGAGTTTTGCATTAAAGACTATTGCATTAAAGACTATCTCCTTCGCAGTTGACTCCGAAGATGAG
GATGCCCTTGATCAAGATGACATTGCCTATTTTTCACGTAAGTACAAAAATTTCATTAAGAGGAAGAAGCAATTCAAGAAACAGCTATCAACCCAAAGAGAGTCAAAAGG
TGAGAAGAGCAAAAAGGATGAGGTGATTTGTTATGAATGCAAAAAGCCGGGTCATATAAGAACGGATTGTCCTTTCCTCAAATCATCTAAGAAATCCAAGAAGAAGGCAA
TGAAGGCTACTTGGGATGATAGTAGTGAAAGTGAAAGTGGAAGTGATGTTGAAGAAGTGGCACGCATTTGCCTCATGGCTCTTAGTGACAAAGATGGTGTACTTGATGAT
GAGGTAAATCTAGAACCCCCTTCTTACGATGAATTGTTTGAAACTTTTGAGAGTATGCAAAATGATTTAGAAAAACTTAGCTCTAAATATGTTGTGCTTAAAAAAAAATA
CAATGCATTGTCTAGTGAAAATAAGTCTTTACTTGACAAAATTGCTTGTTTTAAAAATAATGAGCATGATGTGCAAATTGATGAAATGAAGGTCTCTTGTGATAAGCATG
CTTGTGTTTGTGATGAGAAAAATGCTTTGCTTGATAAAGTTAGATTTCTTGAGCATGATAGTTGTGAGAAAGATAATTTGATTAAATTGCTTAAGGAAAAAGAATTAAAT
GCTTTACAAGAACTTGATAAAGCTAAAGAGTCCATTAAAAAGTTGACAATAGGTGCTCAAAAGTTCGACAAAATAATTGAAGTAGGCAAGCCTTATGGTGATAAGAGAGG
TTTAGGCTATATTGATGAATGCTCTACTCCAAGTTCTAAAACTATATTTGTTAAAGCATCTTCTATTGTGCCTAATATGCCTAATGTTGTGTCTAATGATGCTAAATCTA
GTTTTGTGCCTATATGTCATAATTGTGGTGTTGAAGGTCATATTAGACCAAAGTGCTTTAAATTGAAGTATGCTCATAATACTTCTTCAAGAAGAAATTTTTCTCAAAGA
GCAAAGTTCTACAATGCTCCAAGAGAGAATTTCTCCAAGAAAACTTATTCTTGTTACTTTTCTAGATCCAATGCTTTTAATGCAAATGCAAAAATGAAATGGATCCCAAA
GTTTGTAAATGCTAACACTCTTGGACCCAAACAAGTTTGTTTGAAAGCCTCCAAGAAAAATAGATGGTACTTGGATAATGGTTGCTCAAGGCACATGACCAGAGACCGAT
CCAAGTTTATTTCTTTCTCTAGAAAGGACGGAGGCATGGATAAAGTTTGTGATGCTTGTCAAATGGGTAAGCAAACTAAGTCCTCTTTCAAATCTAAAAATGTGATCTCT
ACTACTAGACCCCTACAACTATTACACATGGACTTATTTGTTCCTTCTAGAATTGCTAGCTTTGGAGGAAATTATTATGCATTCGTGATAGTTGATGATTTTTCAAGATT
TACTTGGGTTTTGATGATAAAACATAAGGACGATGCTTTGAAAAGTTTTATTAGTCTTGCAAAAAGAGTTCAAAATGAAAACGGATTTTTTATTTCTACAATTAGAAGTG
ATCATGGAGGAGAATTTGATAATGATGTTTTTAAAGCTTATTGTGAAAAAAAAAATGGTTTTTCTCATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTT
GAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGGTCAATGTTGCATGAGTATGGTTTACCTAATTATTTTTGGGCGGAAGCCGTTAACACCGCTTGCTATGTTTTAAA
TAGAGTTTTAATTAAACCATCTTTAGATAAAACTCCTTATGAGCTTTGGCATGACAAAATTCCAAATATTGGGTATTTCAAAGTTTTTGGTTGTAAATGTTTTATTTTGA
ACAATAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGATATTGGTATTTTCCTTGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAGAACTTTACTT
ATTGAGGAATCTATGCATGTTGTGTTTGATGAATCTTGTAATATTGTTTCTAATGAGTCTATTTATAGTGATGATTTAGAAAAAGATTTTGAAGATTTACTTGTAAGTGA
AAAAGGCAAAGAAATTGTTCCAAGTATGGAAGAAGTGAGCATCAATGAAAAGAAGGAAGATGGTTCTTCATCTATGCCCAAAGAATGGAGATATGCTCCATCCCATCCCA
AGGATTTGATTCTTTGTGATCCCGAACAAGGTGTGAAAACTCGTTCTTCTCTCAATTTATTTAGTAATCTTGCTTTTGTTTCTCAAGTTGAACCTAAAAGTTTTAAGGAT
GCCGAATGTGATGAGTTTTGGATTTTAGGTATGCAAGAAGAATTAAATCAATTTGAAAGGAACAATGTTTTGGAATTAGTCCCTAAGCCTTCTAATGCTTCTATAATTGG
GACTAAATGGGTTTTTAGAAACAAGATGGATGAACATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTCTGAAG
AAACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCCTCTTATAAGAATTTCATTTTGTATCAAATGGATGTGAAAAGTGCATTTTTAAAT
GGTTATATTATCGAGCAAGTTTATGTAGAACAACCTCCGGGCTTTGAAAATTTTGAATTGCCTAGTCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGC
TCCAAGAGCTTGGTATGATAGGCTTAGTAAAATTGCAAAAACTCCTATGAGAACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAATGTGTGGACATAAAGATTTATC
GAGGTTATTCCGATGCGGATTTTGCCGGTAGCTTACTTGACCGTAAGAGTACTAGTGGGACTTGCCAATTTCTTGGTAGTTCTTATTCTGTAAAAGCAACGGGAAGAATT
GCTCAGATACCAGTAAGCAAGGCTTTTTTAGGTCGTGTTATAAATGCCCTGGCTAAACCTATTGATGGCCAAGGTGAAATTTCATCTTCTGACTCTCGGTTAATTGAATC
CCCCGCTCTTGGTATTATTTTGAGACGTTCCGTATACGAACCTCTTCAAACAGGCCTTATTGCTATTGATTCGATAATTCCTATCAGACGTGGTCAGCGAGAATTAATTA
TTGGGGAGAGGCAGACCGGTAAAACATTAGCGGATATTTTTACCAAGCTATTAAATGAAGAAAGTTTTTGCAAAAATAGGCTTGAACTTGGTATCATTCATTGTGATGCA
TCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCTAATGAGTCTATAACGGATATGTTTACTAGATTTACTAACATCATAAATGCTTTGAAGGGCCTTGGTAAAGACTATACAACTTCTGAAAATGTTAGAAAAAT
TCTGAGGTCTCTACCTAAAACTTGGGAAGCAAAGGTGACGGCGATCCAAGAAGCAAAGGATCTCACTAAATTTTCGTTGGAGGAACTCATTGGTTTGCTCATGACCCATG
AGATCACTATGAAGGAGCACTTGGAGGATGAGTCCAAAAAGGAGAAGAGTTTTGCATTAAAGACTATTGCATTAAAGACTATCTCCTTCGCAGTTGACTCCGAAGATGAG
GATGCCCTTGATCAAGATGACATTGCCTATTTTTCACGTAAGTACAAAAATTTCATTAAGAGGAAGAAGCAATTCAAGAAACAGCTATCAACCCAAAGAGAGTCAAAAGG
TGAGAAGAGCAAAAAGGATGAGGTGATTTGTTATGAATGCAAAAAGCCGGGTCATATAAGAACGGATTGTCCTTTCCTCAAATCATCTAAGAAATCCAAGAAGAAGGCAA
TGAAGGCTACTTGGGATGATAGTAGTGAAAGTGAAAGTGGAAGTGATGTTGAAGAAGTGGCACGCATTTGCCTCATGGCTCTTAGTGACAAAGATGGTGTACTTGATGAT
GAGGTAAATCTAGAACCCCCTTCTTACGATGAATTGTTTGAAACTTTTGAGAGTATGCAAAATGATTTAGAAAAACTTAGCTCTAAATATGTTGTGCTTAAAAAAAAATA
CAATGCATTGTCTAGTGAAAATAAGTCTTTACTTGACAAAATTGCTTGTTTTAAAAATAATGAGCATGATGTGCAAATTGATGAAATGAAGGTCTCTTGTGATAAGCATG
CTTGTGTTTGTGATGAGAAAAATGCTTTGCTTGATAAAGTTAGATTTCTTGAGCATGATAGTTGTGAGAAAGATAATTTGATTAAATTGCTTAAGGAAAAAGAATTAAAT
GCTTTACAAGAACTTGATAAAGCTAAAGAGTCCATTAAAAAGTTGACAATAGGTGCTCAAAAGTTCGACAAAATAATTGAAGTAGGCAAGCCTTATGGTGATAAGAGAGG
TTTAGGCTATATTGATGAATGCTCTACTCCAAGTTCTAAAACTATATTTGTTAAAGCATCTTCTATTGTGCCTAATATGCCTAATGTTGTGTCTAATGATGCTAAATCTA
GTTTTGTGCCTATATGTCATAATTGTGGTGTTGAAGGTCATATTAGACCAAAGTGCTTTAAATTGAAGTATGCTCATAATACTTCTTCAAGAAGAAATTTTTCTCAAAGA
GCAAAGTTCTACAATGCTCCAAGAGAGAATTTCTCCAAGAAAACTTATTCTTGTTACTTTTCTAGATCCAATGCTTTTAATGCAAATGCAAAAATGAAATGGATCCCAAA
GTTTGTAAATGCTAACACTCTTGGACCCAAACAAGTTTGTTTGAAAGCCTCCAAGAAAAATAGATGGTACTTGGATAATGGTTGCTCAAGGCACATGACCAGAGACCGAT
CCAAGTTTATTTCTTTCTCTAGAAAGGACGGAGGCATGGATAAAGTTTGTGATGCTTGTCAAATGGGTAAGCAAACTAAGTCCTCTTTCAAATCTAAAAATGTGATCTCT
ACTACTAGACCCCTACAACTATTACACATGGACTTATTTGTTCCTTCTAGAATTGCTAGCTTTGGAGGAAATTATTATGCATTCGTGATAGTTGATGATTTTTCAAGATT
TACTTGGGTTTTGATGATAAAACATAAGGACGATGCTTTGAAAAGTTTTATTAGTCTTGCAAAAAGAGTTCAAAATGAAAACGGATTTTTTATTTCTACAATTAGAAGTG
ATCATGGAGGAGAATTTGATAATGATGTTTTTAAAGCTTATTGTGAAAAAAAAAATGGTTTTTCTCATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTT
GAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGGTCAATGTTGCATGAGTATGGTTTACCTAATTATTTTTGGGCGGAAGCCGTTAACACCGCTTGCTATGTTTTAAA
TAGAGTTTTAATTAAACCATCTTTAGATAAAACTCCTTATGAGCTTTGGCATGACAAAATTCCAAATATTGGGTATTTCAAAGTTTTTGGTTGTAAATGTTTTATTTTGA
ACAATAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGATATTGGTATTTTCCTTGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAGAACTTTACTT
ATTGAGGAATCTATGCATGTTGTGTTTGATGAATCTTGTAATATTGTTTCTAATGAGTCTATTTATAGTGATGATTTAGAAAAAGATTTTGAAGATTTACTTGTAAGTGA
AAAAGGCAAAGAAATTGTTCCAAGTATGGAAGAAGTGAGCATCAATGAAAAGAAGGAAGATGGTTCTTCATCTATGCCCAAAGAATGGAGATATGCTCCATCCCATCCCA
AGGATTTGATTCTTTGTGATCCCGAACAAGGTGTGAAAACTCGTTCTTCTCTCAATTTATTTAGTAATCTTGCTTTTGTTTCTCAAGTTGAACCTAAAAGTTTTAAGGAT
GCCGAATGTGATGAGTTTTGGATTTTAGGTATGCAAGAAGAATTAAATCAATTTGAAAGGAACAATGTTTTGGAATTAGTCCCTAAGCCTTCTAATGCTTCTATAATTGG
GACTAAATGGGTTTTTAGAAACAAGATGGATGAACATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTCTGAAG
AAACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCCTCTTATAAGAATTTCATTTTGTATCAAATGGATGTGAAAAGTGCATTTTTAAAT
GGTTATATTATCGAGCAAGTTTATGTAGAACAACCTCCGGGCTTTGAAAATTTTGAATTGCCTAGTCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGC
TCCAAGAGCTTGGTATGATAGGCTTAGTAAAATTGCAAAAACTCCTATGAGAACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAATGTGTGGACATAAAGATTTATC
GAGGTTATTCCGATGCGGATTTTGCCGGTAGCTTACTTGACCGTAAGAGTACTAGTGGGACTTGCCAATTTCTTGGTAGTTCTTATTCTGTAAAAGCAACGGGAAGAATT
GCTCAGATACCAGTAAGCAAGGCTTTTTTAGGTCGTGTTATAAATGCCCTGGCTAAACCTATTGATGGCCAAGGTGAAATTTCATCTTCTGACTCTCGGTTAATTGAATC
CCCCGCTCTTGGTATTATTTTGAGACGTTCCGTATACGAACCTCTTCAAACAGGCCTTATTGCTATTGATTCGATAATTCCTATCAGACGTGGTCAGCGAGAATTAATTA
TTGGGGAGAGGCAGACCGGTAAAACATTAGCGGATATTTTTACCAAGCTATTAAATGAAGAAAGTTTTTGCAAAAATAGGCTTGAACTTGGTATCATTCATTGTGATGCA
TCTTGA
Protein sequenceShow/hide protein sequence
MDANESITDMFTRFTNIINALKGLGKDYTTSENVRKILRSLPKTWEAKVTAIQEAKDLTKFSLEELIGLLMTHEITMKEHLEDESKKEKSFALKTIALKTISFAVDSEDE
DALDQDDIAYFSRKYKNFIKRKKQFKKQLSTQRESKGEKSKKDEVICYECKKPGHIRTDCPFLKSSKKSKKKAMKATWDDSSESESGSDVEEVARICLMALSDKDGVLDD
EVNLEPPSYDELFETFESMQNDLEKLSSKYVVLKKKYNALSSENKSLLDKIACFKNNEHDVQIDEMKVSCDKHACVCDEKNALLDKVRFLEHDSCEKDNLIKLLKEKELN
ALQELDKAKESIKKLTIGAQKFDKIIEVGKPYGDKRGLGYIDECSTPSSKTIFVKASSIVPNMPNVVSNDAKSSFVPICHNCGVEGHIRPKCFKLKYAHNTSSRRNFSQR
AKFYNAPRENFSKKTYSCYFSRSNAFNANAKMKWIPKFVNANTLGPKQVCLKASKKNRWYLDNGCSRHMTRDRSKFISFSRKDGGMDKVCDACQMGKQTKSSFKSKNVIS
TTRPLQLLHMDLFVPSRIASFGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISLAKRVQNENGFFISTIRSDHGGEFDNDVFKAYCEKKNGFSHNFSSPRTPQQNGVV
ERKNRTLQEFARSMLHEYGLPNYFWAEAVNTACYVLNRVLIKPSLDKTPYELWHDKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDIGIFLGYSSTSKAYRVFNKRTLL
IEESMHVVFDESCNIVSNESIYSDDLEKDFEDLLVSEKGKEIVPSMEEVSINEKKEDGSSSMPKEWRYAPSHPKDLILCDPEQGVKTRSSLNLFSNLAFVSQVEPKSFKD
AECDEFWILGMQEELNQFERNNVLELVPKPSNASIIGTKWVFRNKMDEHGNIIRNKARLVAQGYCQEEGIDSEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLN
GYIIEQVYVEQPPGFENFELPSHVYKLKKALYGLKQAPRAWYDRLSKIAKTPMRTTTKLDKDEKGKCVDIKIYRGYSDADFAGSLLDRKSTSGTCQFLGSSYSVKATGRI
AQIPVSKAFLGRVINALAKPIDGQGEISSSDSRLIESPALGIILRRSVYEPLQTGLIAIDSIIPIRRGQRELIIGERQTGKTLADIFTKLLNEESFCKNRLELGIIHCDA
S