; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042168 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042168
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr13:37733660..37751307
RNA-Seq ExpressionLag0042168
SyntenyLag0042168
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR003107 - HAT (Half-A-TPR) repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN72141.1 hypothetical protein VITISV_017108 [Vitis vinifera]2.4e-25945.2Show/hide
Query:  AVAQTGNYPQAFSCG-TSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCR
        ++A TGN   A SC   S+PWIIDSGASDHMT++S+ F SYSP   N+K++IADG+FS +AGKG IK++E I L+ VLHVPKLTCNLLFVSKLSRD +C 
Subjt:  AVAQTGNYPQAFSCG-TSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCR

Query:  VIFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMSSL--------------------------------------------------
        VIF+ESHCI QD+ S K IGSARM++GLYYF++   SNK+ QG SS+SSL                                                  
Subjt:  VIFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMSSL--------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------SDKG
                                                                                                        + KG
Subjt:  ------------------------------------------------------------------------------------------------SDKG

Query:  IVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP-------------------------------------------
        I+HQS+C DTPQQNG+A+RKN+HLLEVARA+MF M++PKYLWG+AILT +YLINRMP                                           
Subjt:  IVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP-------------------------------------------

Query:  ------------------------------------------KNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICP-------DDISPLVPSIETSHSG
                                                  +N PYFT++ LQGEK  +E NFW++  P P+VI          +  P+    E   S 
Subjt:  ------------------------------------------KNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICP-------DDISPLVPSIETSHSG

Query:  GEILHNDPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSET------------PTIETGRSGNSSSIPTSTIQNTL---PVVS-----------------
         EIL         E   Y+++K + R +DQ + P+  Q +             P+I T    +SSS+   ++ +     P +S                 
Subjt:  GEILHNDPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSET------------PTIETGRSGNSSSIPTSTIQNTL---PVVS-----------------

Query:  DLDMPIAIRK-------------------------------------------EDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD
        DLD+PIA+RK                                           ++ +WK+AV +EMNALK+NGTWE V+LP++KK VGCKWVFTIK  AD
Subjt:  DLDMPIAIRK-------------------------------------------EDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD

Query:  -------------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPR
                                                   VNSNW L+QLDVKNAFLNGDLEEEVFM  PP FEE  G  K+CKLKKSLY LKQSPR
Subjt:  -------------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPR

Query:  AWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLL
        AWFERFG V+  YG+ QSQA+HT+FYKHS   K+ ILIVYVDDI+LTGDD   L  +K  LA EF+IKDLG LKYFLGMEFARSK+GIFVNQRKYV +LL
Subjt:  AWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLL

Query:  EETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIE
        +E  +LGCK AE P+EPN+KLQ  K + +KD++RYQRLVGRLIYLSHT PDIA++VSMVSQFMH P P HFE VYRIL+YLKGTPG+G+LF    H QIE
Subjt:  EETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIE

Query:  IYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKH
         YT+ADW GS  DRRSTS YCSFV GNLVTWRSKKQNVVARSSAEA+FR +AHG CE +WI+R+LEELK+  ++P+++YCDNKAAIS+AHNPVLHDRTKH
Subjt:  IYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKH

Query:  IEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKPA
        +EVDK FIKEKI+   +C+ Y+PT EQ+A+V TK L K+QFD L+ KLAMEDIFK A
Subjt:  IEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKPA

CAN76494.1 hypothetical protein VITISV_024984 [Vitis vinifera]5.7e-23753.58Show/hide
Query:  AVAQTGNYPQAFSCG-TSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCR
        ++A TGN   A SC   S+PWIIDS ASDHMT++S+ F SYSP   N+K+RIA G+FS +A KG IK++E I L+SVLHVPKLTCNLL VSKLSRDS+C 
Subjt:  AVAQTGNYPQAFSCG-TSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCR

Query:  VIFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMSSLSDKGIVHQSTCRDTPQQNGVAE------RKNRHLLEVARAIMFSMHVPKY
        VIF+ESHCI QDQ S K IGSA+M++GLYYF++   SNK+ QG SS+SSLS +  + ++   ++  + G++E      +KNR+ LE    +++S    K 
Subjt:  VIFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMSSLSDKGIVHQSTCRDTPQQNGVAE------RKNRHLLEVARAIMFSMHVPKY

Query:  LWGEAILTTTYLINRMPKNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICPDDISPLVPSIETSHSGGEILHNDPTDQVPEFKFYTKRKFNQRKQDQIV
        + G +   +  L +  PK        ++ G  P++  +    S+ + ++  P    P +  I     G  ++   P   +        RK          
Subjt:  LWGEAILTTTYLINRMPKNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICPDDISPLVPSIETSHSGGEILHNDPTDQVPEFKFYTKRKFNQRKQDQIV

Query:  DPSQNQSETPTIETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRK--EDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD-----
          ++  ++ P ++     N S    +   N    +S L +P  I++  ++ +WK+AV EEMNALK+NGTWE+V+LP++KK VGCKWVFTIK  AD     
Subjt:  DPSQNQSETPTIETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRK--EDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD-----

Query:  --------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFER
                                              VNSNW L+QLDVKN FLNGDLEEEVFM  PPGFEE  G  K+CKLKKSLYGLK+SPRAWFER
Subjt:  --------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFER

Query:  FGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGL
        FG V+  YG+ QSQADHT+FYKHS   K++ILIVYVDDI+LTGDD   L  LK  LA EF+IKDLG LKYFL MEFARSK+GIFVNQ KYVLDLL+ETG+
Subjt:  FGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGL

Query:  LGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDA
        LGCK AETP+E N+KL   K + +KD++RYQRLVGRLIYLSHT PDIA++VSMVSQFMHAPGP HFEA+YRIL+YLKGTPG+G+LF    H QIE YTDA
Subjt:  LGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDA

Query:  DWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDK
        DWVGS  DRRST  YCSFVGGNLV WRSKKQNVVARSSA+A+FRA+AHGICE +WI+R+LEELK+              +   +   VLHDRTKH+EVDK
Subjt:  DWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDK

Query:  HFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKP
        HFIKEKI+   +C+ Y+PT EQ+ DV TKGL K+QFD L+ KLAMEDIFKP
Subjt:  HFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKP

GAU39772.1 hypothetical protein TSUD_220160 [Trifolium subterraneum]1.4e-22243.17Show/hide
Query:  VAQTGNYPQAFSC-GTSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCRV
        VAQTG    A S    S+PWIIDSGAS+HMT+ SH F+SY  S  +EK+RIADGS+SS+AGKG IK++EHI+LQSVLHVPK  CNLL V KLS+D++C V
Subjt:  VAQTGNYPQAFSC-GTSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCRV

Query:  IFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMS---SLSD----------------------------------------------
        +F  S C+ QDQ S KMIG+AR ++GLYY DE    NK      S S   S+SD                                              
Subjt:  IFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMS---SLSD----------------------------------------------

Query:  --------------------------------------------------------------------------------------------------KG
                                                                                                          KG
Subjt:  --------------------------------------------------------------------------------------------------KG

Query:  IVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMPKN-----------QPYFTQS-------------SLQGE---KP
        I+HQSTCRDTPQQNG+AERKNRHLLEV RAIM SM+VPKYLWG AILT  YLINRMP             Q  F  S             S QGE     
Subjt:  IVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMPKN-----------QPYFTQS-------------SLQGE---KP

Query:  NLEDNFWDVSTPLPNVICPDDI----SPLVPSIETSHSGGEILHNDPTD---------QVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETGRSGNS
        N EDNFW+   PLP +   DD+     P    +E  +   E+L N  ++         +  E K Y +++F++     I+ P+  QS++P+     + + 
Subjt:  NLEDNFWDVSTPLPNVICPDDI----SPLVPSIETSHSGGEILHNDPTD---------QVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETGRSGNS

Query:  SSIP--TSTIQNTLPVVS---------------DLDMPIAIRK-------------------------------------------EDSNWKVAVMEEMN
        +S P  +S   N LP +S               DLD+PIA RK                                            D NWK+AV EEM+
Subjt:  SSIP--TSTIQNTLPVVS---------------DLDMPIAIRK-------------------------------------------EDSNWKVAVMEEMN

Query:  ALKQNGTWEIVE-LPKDKKPVGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQLDVKNAFLNGDLEE
        AL++N TW I + LPK KK VGCKWVFT+KC AD                                           VN NW+L+Q DVKNAFLNG+L E
Subjt:  ALKQNGTWEIVE-LPKDKKPVGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQLDVKNAFLNGDLEE

Query:  EVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQ
        EV+M LPPGFEE+ G  +IC+                         +GF QSQADHT+F+KHS   KI+ILIVYVDDII+TGDD   + DLK+ L  EF 
Subjt:  EVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQ

Query:  IKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAP
        IKDLG LKYFLGMEF RSK+G F+NQRKY+LDLL+ETG++GC+  ETP++PN+KL+     E+ D+ERYQRL                            
Subjt:  IKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAP

Query:  GPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILE
        GP HFEA++RIL+YLKGTPGKG+LF    H Q+E YTDADW G+  DRRSTS YC+FVGGN VTWRSKKQ+VVARSSAEA+FR++AHG CE +WIK+ +E
Subjt:  GPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILE

Query:  ELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKPA
        ELKI   TPI++YCDNKAAISIAHNPV HDRTKH+EVDKHFIKEKI++  IC+ Y+PT+ Q+ADVLTK LPK+QFD ++ KLAM DIF PA
Subjt:  ELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKPA

XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina]6.5e-23342.9Show/hide
Query:  AVAQTGNYPQAFSC--GTSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSC
        ++AQ GN   A         PWIIDSGA+DHMTS S  F+SY P   ++KI+IADGS SSVAGKG+I ++ ++ L SVLHVP L+CNLL VSK+++D  C
Subjt:  AVAQTGNYPQAFSC--GTSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSC

Query:  RVIFFESHCILQDQKSEKMIGSARMLDGLYYFDEG--------SFSNKVT--------------------------------------------------
           F  S+C  QD  S K IGSAR +DGLYYF+E         + +N+VT                                                  
Subjt:  RVIFFESHCILQDQKSEKMIGSARMLDGLYYFDEG--------SFSNKVT--------------------------------------------------

Query:  ---------------------QGFSSMSSLS-------------------------------------------------------------------DK
                              G S ++++S                                                                   + 
Subjt:  ---------------------QGFSSMSSLS-------------------------------------------------------------------DK

Query:  GIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP------------------------------------------
        GIVHQS+C DTPQQNGVAERKNRHLLEVAR++MF+  VPK  WGEAILT +YLINRMP                                          
Subjt:  GIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP------------------------------------------

Query:  -------------------------------------------KNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICPDDISPLVPSIETSHSGGEILHN
                                                   +N+ +F ++SLQGE    ED+FW++S P+P + C   +   +PSI  +    E +  
Subjt:  -------------------------------------------KNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICPDDISPLVPSIETSHSGGEILHN

Query:  DPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRK----------------------------
              PE + YT+R  N ++ +    P    S+  T     +GN  S P S  ++ L   +DLD+PIA RK                            
Subjt:  DPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRK----------------------------

Query:  -----------EDS----NWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD--------------------------------VNS---
                   +D+     W+ AV  EM AL++N TWE+V+LP++KKPVGCKW+FT+K  AD                                +NS   
Subjt:  -----------EDS----NWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD--------------------------------VNS---

Query:  --------NWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILI
                 W L QLDVKNAFLNG+LEEEV+MDLPPGFE + G  K+CKLK+SLYGLKQSPRAWF+RF   V S+G++QSQADHT+F+KH  +  + I+I
Subjt:  --------NWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILI

Query:  VYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRL
        VYVDDIILTG +   + +LKK LAREF++KDLGPL+YFLGME ARS KGIFV+QRKY LDLL+ETG+LGCK  +TP++P  KL  A   +  D+E YQRL
Subjt:  VYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRL

Query:  VGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNV
        VG+LIYLSHTRPDIA+AVS++SQFMH+P   H EAVY++LKYLK TPGKG+LF ++N+ Q+E+YTDADW GS  DRRSTS YC+FVGGNLVTWRSKKQ+V
Subjt:  VGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNV

Query:  VARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPK
        VARSSAEA+FRA+AHGICE +W+K++LEELKI ++ P+++YCDNKAAI+IAHNPV HDRTKH+EVD+HFIKEKIE+  IC+ ++PT++Q+AD+LTKGL K
Subjt:  VARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPK

Query:  KQFDKLIDKLAMEDIFKPA
          F+ L  KL M DIF+PA
Subjt:  KQFDKLIDKLAMEDIFKPA

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]6.5e-23342.9Show/hide
Query:  AVAQTGNYPQAFSC--GTSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSC
        ++AQ GN   A         PWIIDSGA+DHMTS S  F+SY P   ++KI+IADGS SSVAGKG+I ++ ++ L SVLHVP L+CNLL VSK+++D  C
Subjt:  AVAQTGNYPQAFSC--GTSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSC

Query:  RVIFFESHCILQDQKSEKMIGSARMLDGLYYFDEG--------SFSNKVT--------------------------------------------------
           F  S+C  QD  S K IGSAR +DGLYYF+E         + +N+VT                                                  
Subjt:  RVIFFESHCILQDQKSEKMIGSARMLDGLYYFDEG--------SFSNKVT--------------------------------------------------

Query:  ---------------------QGFSSMSSLS-------------------------------------------------------------------DK
                              G S ++++S                                                                   + 
Subjt:  ---------------------QGFSSMSSLS-------------------------------------------------------------------DK

Query:  GIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP------------------------------------------
        GIVHQS+C DTPQQNGVAERKNRHLLEVAR++MF+  VPK  WGEAILT +YLINRMP                                          
Subjt:  GIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP------------------------------------------

Query:  -------------------------------------------KNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICPDDISPLVPSIETSHSGGEILHN
                                                   +N+ +F ++SLQGE    ED+FW++S P+P + C   +   +PSI  +    E +  
Subjt:  -------------------------------------------KNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICPDDISPLVPSIETSHSGGEILHN

Query:  DPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRK----------------------------
              PE + YT+R  N ++ +    P    S+  T     +GN  S P S  ++ L   +DLD+PIA RK                            
Subjt:  DPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRK----------------------------

Query:  -----------EDS----NWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD--------------------------------VNS---
                   +D+     W+ AV  EM AL++N TWE+V+LP++KKPVGCKW+FT+K  AD                                +NS   
Subjt:  -----------EDS----NWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD--------------------------------VNS---

Query:  --------NWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILI
                 W L QLDVKNAFLNG+LEEEV+MDLPPGFE + G  K+CKLK+SLYGLKQSPRAWF+RF   V S+G++QSQADHT+F+KH  +  + I+I
Subjt:  --------NWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILI

Query:  VYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRL
        VYVDDIILTG +   + +LKK LAREF++KDLGPL+YFLGME ARS KGIFV+QRKY LDLL+ETG+LGCK  +TP++P  KL  A   +  D+E YQRL
Subjt:  VYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRL

Query:  VGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNV
        VG+LIYLSHTRPDIA+AVS++SQFMH+P   H EAVY++LKYLK TPGKG+LF ++N+ Q+E+YTDADW GS  DRRSTS YC+FVGGNLVTWRSKKQ+V
Subjt:  VGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNV

Query:  VARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPK
        VARSSAEA+FRA+AHGICE +W+K++LEELKI ++ P+++YCDNKAAI+IAHNPV HDRTKH+EVD+HFIKEKIE+  IC+ ++PT++Q+AD+LTKGL K
Subjt:  VARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPK

Query:  KQFDKLIDKLAMEDIFKPA
          F+ L  KL M DIF+PA
Subjt:  KQFDKLIDKLAMEDIFKPA

TrEMBL top hitse value%identityAlignment
A0A2N9G304 Reverse transcriptase Ty1/copia-type domain-containing protein1.4e-22553.6Show/hide
Query:  ILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMSSLSDKGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLIN
        I +DQ S + IGSARM++GLYYFD+   S+K  QGFSS+SS+S                                       V + + GE I        
Subjt:  ILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMSSLSDKGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLIN

Query:  RMPKNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVI----------CPDDISPLVPSIETSHSG--------GEILHNDPTDQVPEFKFYTKRKFNQRKQ
                       GE    EDNFW+ S PLPN I           P     +  SI+ + SG         EIL  D  +   E   YT+++  +R +
Subjt:  RMPKNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVI----------CPDDISPLVPSIETSHSG--------GEILHNDPTDQVPEFKFYTKRKFNQRKQ

Query:  DQIVDPSQNQSETPTIETGR---SGNSSSIPTSTIQNTLPVV------------------SDLDMPIAIRK-----------------------------
        D  + P QNQSE  ++  G    SGNSS IP   + N++P++                  SDLD+PIAIRK                             
Subjt:  DQIVDPSQNQSETPTIETGR---SGNSSSIPTSTIQNTLPVV------------------SDLDMPIAIRK-----------------------------

Query:  ---------------EDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD--------------------------------------
                       +D NWK+AV+EEMNAL++NGTWE+V+LP+DKK VGCKWVF++K  AD                                      
Subjt:  ---------------EDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD--------------------------------------

Query:  -----VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILI
             +NSNW L+QLDVKNAFLNGDLEEEVFM LPPGFEE LG  K+C+L+KSLYGLKQSPRAWFERFG  V  +G+ QSQADHT+FYKHS+  KI+ILI
Subjt:  -----VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILI

Query:  VYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRL
        VYVDDIILTGDD A L  LK+ LA EF+IKDLG LKYFLGMEFA+SK+GIFVNQRKYVLDLL ETG+LGCK AETP+EPN KLQ A+ + + +KE+YQRL
Subjt:  VYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRL

Query:  VGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNV
        VGRLIYLSHTRPDIA++VSMVSQFMH+PGP HFEAVYRIL+YLKGTPGKG+LF +  H QIE YTDADW GS TDRRSTS YCSFVGGNLVTWRSKKQNV
Subjt:  VGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNV

Query:  VARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPK
        VARSSAEA+FR +AHGICE +WIKR+LEELK   + P+++YCDNKAAISIAHNPVLHDRTKH+EVDKHFIKEK+E+  IC+PY+PT+EQIAD+ TKGL K
Subjt:  VARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPK

Query:  KQFDKLIDKLAMEDIFKPA
        KQFD L+ KL MEDIFKPA
Subjt:  KQFDKLIDKLAMEDIFKPA

A0A2N9GKJ5 Uncharacterized protein6.5e-23953.18Show/hide
Query:  EKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCRVIFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSM
        EK+RIADGSFSS+AGKG IK++E I L+SVLHVPKL CNL                      L DQ S + IG+ARM++GLYYFD+   S+K  QGFSS+
Subjt:  EKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCRVIFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSM

Query:  SSLSDKGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMPKNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVI---
        SS+S                                       V + + GE I                       GE    EDNFW+ S PLPN I   
Subjt:  SSLSDKGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMPKNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVI---

Query:  -------CPDDISPLVPSIETSHSG--------GEILHNDPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETG---RSGNSSSIPTSTIQNTL
                P     +  SI+ + SG         EIL  D  +   E   YT+++  +R +D  + P QNQSE  ++  G    SGNSS IP   + N++
Subjt:  -------CPDDISPLVPSIETSHSG--------GEILHNDPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETG---RSGNSSSIPTSTIQNTL

Query:  PVV------------------SDLDMPIAIRK--------------------------------------------EDSNWKVAVMEEMNALKQNGTWEI
        P++                  SDLD+PIAIRK                                            +D NWK+AV+EEMNAL++NGTWE+
Subjt:  PVV------------------SDLDMPIAIRK--------------------------------------------EDSNWKVAVMEEMNALKQNGTWEI

Query:  VELPKDKKPVGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFE
        V+LP+DKK VGCKWVF++K  AD                                           VNSNW L+QLDVKNAFLNGDLEEEVFM LPPGFE
Subjt:  VELPKDKKPVGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFE

Query:  EDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFL
        E LG  K+C+L+KSLYGLKQSPRAWFERFG  V  +G+ QSQADHT+FYKHS+  KI+ILIVYVDDIILTGDD A L  LK+ LA EF+IKDLG LKYFL
Subjt:  EDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFL

Query:  GMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRI
        GMEFA+SK+GIFVNQ KYVLDLL ETG+LGCK AETP+EPN KLQ A+ + + +KE+YQRLVGRLIYLSHTRPDIA++VSMVSQFMH+PGP HFEAVYRI
Subjt:  GMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRI

Query:  LKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIR
        L+YLKGTPGKG+ F +  H QIE YTDADW GS TDRRSTS YCSFVGGNLVTWRSKKQNVVARSSAEA+FRA+AHGICE +WIKR+LEELK   + P++
Subjt:  LKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIR

Query:  IYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKPA
        +YCDNKAAISI HNPVLHDRTKH+EVDKHFIKEK+E+  IC+PY+PT+EQIAD+ TKGL KKQFD L+ KL MEDIFKPA
Subjt:  IYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKPA

A0A2Z6NTX3 Integrase catalytic domain-containing protein6.5e-22343.17Show/hide
Query:  VAQTGNYPQAFSC-GTSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCRV
        VAQTG    A S    S+PWIIDSGAS+HMT+ SH F+SY  S  +EK+RIADGS+SS+AGKG IK++EHI+LQSVLHVPK  CNLL V KLS+D++C V
Subjt:  VAQTGNYPQAFSC-GTSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCRV

Query:  IFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMS---SLSD----------------------------------------------
        +F  S C+ QDQ S KMIG+AR ++GLYY DE    NK      S S   S+SD                                              
Subjt:  IFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMS---SLSD----------------------------------------------

Query:  --------------------------------------------------------------------------------------------------KG
                                                                                                          KG
Subjt:  --------------------------------------------------------------------------------------------------KG

Query:  IVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMPKN-----------QPYFTQS-------------SLQGE---KP
        I+HQSTCRDTPQQNG+AERKNRHLLEV RAIM SM+VPKYLWG AILT  YLINRMP             Q  F  S             S QGE     
Subjt:  IVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMPKN-----------QPYFTQS-------------SLQGE---KP

Query:  NLEDNFWDVSTPLPNVICPDDI----SPLVPSIETSHSGGEILHNDPTD---------QVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETGRSGNS
        N EDNFW+   PLP +   DD+     P    +E  +   E+L N  ++         +  E K Y +++F++     I+ P+  QS++P+     + + 
Subjt:  NLEDNFWDVSTPLPNVICPDDI----SPLVPSIETSHSGGEILHNDPTD---------QVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETGRSGNS

Query:  SSIP--TSTIQNTLPVVS---------------DLDMPIAIRK-------------------------------------------EDSNWKVAVMEEMN
        +S P  +S   N LP +S               DLD+PIA RK                                            D NWK+AV EEM+
Subjt:  SSIP--TSTIQNTLPVVS---------------DLDMPIAIRK-------------------------------------------EDSNWKVAVMEEMN

Query:  ALKQNGTWEIVE-LPKDKKPVGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQLDVKNAFLNGDLEE
        AL++N TW I + LPK KK VGCKWVFT+KC AD                                           VN NW+L+Q DVKNAFLNG+L E
Subjt:  ALKQNGTWEIVE-LPKDKKPVGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQLDVKNAFLNGDLEE

Query:  EVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQ
        EV+M LPPGFEE+ G  +IC+                         +GF QSQADHT+F+KHS   KI+ILIVYVDDII+TGDD   + DLK+ L  EF 
Subjt:  EVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQ

Query:  IKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAP
        IKDLG LKYFLGMEF RSK+G F+NQRKY+LDLL+ETG++GC+  ETP++PN+KL+     E+ D+ERYQRL                            
Subjt:  IKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAP

Query:  GPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILE
        GP HFEA++RIL+YLKGTPGKG+LF    H Q+E YTDADW G+  DRRSTS YC+FVGGN VTWRSKKQ+VVARSSAEA+FR++AHG CE +WIK+ +E
Subjt:  GPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILE

Query:  ELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKPA
        ELKI   TPI++YCDNKAAISIAHNPV HDRTKH+EVDKHFIKEKI++  IC+ Y+PT+ Q+ADVLTK LPK+QFD ++ KLAM DIF PA
Subjt:  ELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKPA

A5B683 Uncharacterized protein2.7e-23753.58Show/hide
Query:  AVAQTGNYPQAFSCG-TSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCR
        ++A TGN   A SC   S+PWIIDS ASDHMT++S+ F SYSP   N+K+RIA G+FS +A KG IK++E I L+SVLHVPKLTCNLL VSKLSRDS+C 
Subjt:  AVAQTGNYPQAFSCG-TSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCR

Query:  VIFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMSSLSDKGIVHQSTCRDTPQQNGVAE------RKNRHLLEVARAIMFSMHVPKY
        VIF+ESHCI QDQ S K IGSA+M++GLYYF++   SNK+ QG SS+SSLS +  + ++   ++  + G++E      +KNR+ LE    +++S    K 
Subjt:  VIFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMSSLSDKGIVHQSTCRDTPQQNGVAE------RKNRHLLEVARAIMFSMHVPKY

Query:  LWGEAILTTTYLINRMPKNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICPDDISPLVPSIETSHSGGEILHNDPTDQVPEFKFYTKRKFNQRKQDQIV
        + G +   +  L +  PK        ++ G  P++  +    S+ + ++  P    P +  I     G  ++   P   +        RK          
Subjt:  LWGEAILTTTYLINRMPKNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICPDDISPLVPSIETSHSGGEILHNDPTDQVPEFKFYTKRKFNQRKQDQIV

Query:  DPSQNQSETPTIETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRK--EDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD-----
          ++  ++ P ++     N S    +   N    +S L +P  I++  ++ +WK+AV EEMNALK+NGTWE+V+LP++KK VGCKWVFTIK  AD     
Subjt:  DPSQNQSETPTIETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRK--EDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD-----

Query:  --------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFER
                                              VNSNW L+QLDVKN FLNGDLEEEVFM  PPGFEE  G  K+CKLKKSLYGLK+SPRAWFER
Subjt:  --------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFER

Query:  FGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGL
        FG V+  YG+ QSQADHT+FYKHS   K++ILIVYVDDI+LTGDD   L  LK  LA EF+IKDLG LKYFL MEFARSK+GIFVNQ KYVLDLL+ETG+
Subjt:  FGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGL

Query:  LGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDA
        LGCK AETP+E N+KL   K + +KD++RYQRLVGRLIYLSHT PDIA++VSMVSQFMHAPGP HFEA+YRIL+YLKGTPG+G+LF    H QIE YTDA
Subjt:  LGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDA

Query:  DWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDK
        DWVGS  DRRST  YCSFVGGNLV WRSKKQNVVARSSA+A+FRA+AHGICE +WI+R+LEELK+              +   +   VLHDRTKH+EVDK
Subjt:  DWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDK

Query:  HFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKP
        HFIKEKI+   +C+ Y+PT EQ+ DV TKGL K+QFD L+ KLAMEDIFKP
Subjt:  HFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKP

A5B9Y8 Integrase catalytic domain-containing protein1.1e-25945.2Show/hide
Query:  AVAQTGNYPQAFSCG-TSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCR
        ++A TGN   A SC   S+PWIIDSGASDHMT++S+ F SYSP   N+K++IADG+FS +AGKG IK++E I L+ VLHVPKLTCNLLFVSKLSRD +C 
Subjt:  AVAQTGNYPQAFSCG-TSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDSSCR

Query:  VIFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMSSL--------------------------------------------------
        VIF+ESHCI QD+ S K IGSARM++GLYYF++   SNK+ QG SS+SSL                                                  
Subjt:  VIFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMSSL--------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------SDKG
                                                                                                        + KG
Subjt:  ------------------------------------------------------------------------------------------------SDKG

Query:  IVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP-------------------------------------------
        I+HQS+C DTPQQNG+A+RKN+HLLEVARA+MF M++PKYLWG+AILT +YLINRMP                                           
Subjt:  IVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP-------------------------------------------

Query:  ------------------------------------------KNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICP-------DDISPLVPSIETSHSG
                                                  +N PYFT++ LQGEK  +E NFW++  P P+VI          +  P+    E   S 
Subjt:  ------------------------------------------KNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICP-------DDISPLVPSIETSHSG

Query:  GEILHNDPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSET------------PTIETGRSGNSSSIPTSTIQNTL---PVVS-----------------
         EIL         E   Y+++K + R +DQ + P+  Q +             P+I T    +SSS+   ++ +     P +S                 
Subjt:  GEILHNDPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSET------------PTIETGRSGNSSSIPTSTIQNTL---PVVS-----------------

Query:  DLDMPIAIRK-------------------------------------------EDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD
        DLD+PIA+RK                                           ++ +WK+AV +EMNALK+NGTWE V+LP++KK VGCKWVFTIK  AD
Subjt:  DLDMPIAIRK-------------------------------------------EDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD

Query:  -------------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPR
                                                   VNSNW L+QLDVKNAFLNGDLEEEVFM  PP FEE  G  K+CKLKKSLY LKQSPR
Subjt:  -------------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPR

Query:  AWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLL
        AWFERFG V+  YG+ QSQA+HT+FYKHS   K+ ILIVYVDDI+LTGDD   L  +K  LA EF+IKDLG LKYFLGMEFARSK+GIFVNQRKYV +LL
Subjt:  AWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLL

Query:  EETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIE
        +E  +LGCK AE P+EPN+KLQ  K + +KD++RYQRLVGRLIYLSHT PDIA++VSMVSQFMH P P HFE VYRIL+YLKGTPG+G+LF    H QIE
Subjt:  EETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIE

Query:  IYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKH
         YT+ADW GS  DRRSTS YCSFV GNLVTWRSKKQNVVARSSAEA+FR +AHG CE +WI+R+LEELK+  ++P+++YCDNKAAIS+AHNPVLHDRTKH
Subjt:  IYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKH

Query:  IEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKPA
        +EVDK FIKEKI+   +C+ Y+PT EQ+A+V TK L K+QFD L+ KLAMEDIFK A
Subjt:  IEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKPA

SwissProt top hitse value%identityAlignment
B3DJT0 Squamous cell carcinoma antigen recognized by T-cells 32.3e-5534.06Show/hide
Query:  LPTISDEGNTVNGDELMPDLPQNPTSPASDSSSSDSGSEDDESDQKLQLQTLQSELSTNPSYYDAHVQYIKLLRKIGDIDNLRKAREAMSEIFPLTPPMW
        LP I +E   +  +    D  +            +  + +DE + + ++Q L+ +LS N   Y+ HV  IKLLR+ G +  LRKAR+ MSE+FPLT  +W
Subjt:  LPTISDEGNTVNGDELMPDLPQNPTSPASDSSSSDSGSEDDESDQKLQLQTLQSELSTNPSYYDAHVQYIKLLRKIGDIDNLRKAREAMSEIFPLTPPMW

Query:  QEWAKDEASISTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDRATSGVKKARDLFERALTAAGLHFTEAEKLWEAYREFEKTIYQSIE-
         +W KDE  I+        +  L+ER + DY+   +W +Y+ +      +    A  G+++ R +FERALTA GLH T+   +WEAYREFE  I  +++ 
Subjt:  QEWAKDEASISTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDRATSGVKKARDLFERALTAAGLHFTEAEKLWEAYREFEKTIYQSIE-

Query:  -------ETDTQAKEKQVQLIRSIFHRQLSLPLSNMNSTLEAYKAWEMEVKQGCILDVKSNDSDGVPSQVASAYQRALDMYNARVQLEEQISKQDLTDTE
               +   +    Q++ I ++F RQL++PL +M  T   Y  W                 DGVP  V   Y+RAL         EE +    +++  
Subjt:  -------ETDTQAKEKQVQLIRSIFHRQLSLPLSNMNSTLEAYKAWEMEVKQGCILDVKSNDSDGVPSQVASAYQRALDMYNARVQLEEQISKQDLTDTE

Query:  RLHQYIIYLKFEQSAGDPARVQILFERAVADFPVSVDLWLDYTHYMDKTLKVGNIVRNVYSRATRNC
        +L +Y  Y+ FE   GDPARVQI+FERA+A+  +  DLW+ YT Y+D+ LK+ ++V + + RA RNC
Subjt:  RLHQYIIYLKFEQSAGDPARVQILFERAVADFPVSVDLWLDYTHYMDKTLKVGNIVRNVYSRATRNC

P04146 Copia protein8.0e-8533.58Show/hide
Query:  ETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRKEDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCN---------------------
        E   S N   +   TI N +P   D    I  R + S+W+ A+  E+NA K N TW I + P++K  V  +WVF++K N                     
Subjt:  ETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRKEDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCN---------------------

Query:  -----------ADVNS-----------NWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQ
                   A ++S           N  ++Q+DVK AFLNG L+EE++M LP G         +CKL K++YGLKQ+ R WFE F   +    F+ S 
Subjt:  -----------ADVNS-----------NWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQ

Query:  ADHTIFYKHSEN-NKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPN
         D  I+     N N+   +++YVDD+++   D   + + K+ L  +F++ DL  +K+F+G+     +  I+++Q  YV  +L +  +  C    TP+   
Subjt:  ADHTIFYKHSEN-NKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPN

Query:  LKLQVAKVEEIKDKERYQRLVGRLIYLS-HTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEI--YTDADWVGSTTDRR
        +  ++   +E       + L+G L+Y+   TRPD+  AV+++S++        ++ + R+L+YLKGT    ++F ++  F+ +I  Y D+DW GS  DR+
Subjt:  LKLQVAKVEEIKDKERYQRLVGRLIYLS-HTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEI--YTDADWVGSTTDRR

Query:  STSSYC-SFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIET
        ST+ Y       NL+ W +K+QN VA SS EA++ AL   + E +W+K +L  + I L  PI+IY DN+  ISIA+NP  H R KHI++  HF +E+++ 
Subjt:  STSSYC-SFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIET

Query:  NTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAM
        N IC+ Y+PT  Q+AD+ TK LP  +F +L DKL +
Subjt:  NTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAM

P04146 Copia protein1.4e-0750.85Show/hide
Query:  KGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP
        KGI +  T   TPQ NGV+ER  R + E AR ++    + K  WGEA+LT TYLINR+P
Subjt:  KGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-9831.59Show/hide
Query:  SDKGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP--------------KNQPYFTQSSLQG----------EK
        S  GI H+ T   TPQ NGVAER NR ++E  R+++    +PK  WGEA+ T  YLINR P                +  ++   + G          ++
Subjt:  SDKGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMP--------------KNQPYFTQSSLQG----------EK

Query:  PNLED-----------------NFW------------------------DVSTPLPNVICPDDISPLVPSIETSHSGGEILHNDPTDQVPEFKFYTKRKF
          L+D                   W                        D+S  + N I P+ ++  +PS   + +  E      TD+V E         
Subjt:  PNLED-----------------NFW------------------------DVSTPLPNVICPDDISPLVPSIETSHSGGEILHNDPTDQVPEFKFYTKRKF

Query:  NQRKQ-----DQIVDPSQNQ--------SETPTIETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRKEDSNWKVAVMEEMNALKQNGTWEIVELPKDKK
         Q +Q     +++  P+Q +        SE P +E+ R  ++  +  S  +    +   L  P     E +    A+ EEM +L++NGT+++VELPK K+
Subjt:  NQRKQ-----DQIVDPSQNQ--------SETPTIETGRSGNSSSIPTSTIQNTLPVVSDLDMPIAIRKEDSNWKVAVMEEMNALKQNGTWEIVELPKDKK

Query:  PVGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKI
        P+ CKWVF +K + D                                            + +  + QLDVK AFL+GDLEEE++M+ P GFE       +
Subjt:  PVGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKI

Query:  CKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSK
        CKL KSLYGLKQ+PR W+ +F + + S  ++++ +D  +++K    N   IL++YVDD+++ G D+  +A LK +L++ F +KDLGP +  LGM+  R +
Subjt:  CKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSK

Query:  --KGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQ----VAKVEEIKD--KERYQRLVGRLIY-LSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYR
          + ++++Q KY+  +LE   +   K   TP+  +LKL        VEE  +  K  Y   VG L+Y +  TRPDIA+AV +VS+F+  PG  H+EAV  
Subjt:  --KGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQ----VAKVEEIKD--KERYQRLVGRLIY-LSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYR

Query:  ILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPI
        IL+YL+GT G  + FG  +   ++ YTDAD  G   +R+S++ Y     G  ++W+SK Q  VA S+ EA++ A      E IW+KR L+EL +H    +
Subjt:  ILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPI

Query:  RIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFD
         +YCD+++AI ++ N + H RTKHI+V  H+I+E ++  ++ +  + T+E  AD+LTK +P+ +F+
Subjt:  RIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.7e-9932.94Show/hide
Query:  PKNQPYFTQSSLQGEKPNLEDNFWD--VSTPLPNVICPDDISPLVPSIET---SHSGGEILHNDPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPT
        P + P     + Q    NL+ +F     S+P P     +   P     +T   +HS      N+PT++ P            +    +  P+Q+ S +P+
Subjt:  PKNQPYFTQSSLQGEKPNLEDNFWD--VSTPLPNVICPDDISPLVPSIET---SHSGGEILHNDPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPT

Query:  IETGRSGNSSS-IPTSTIQNTLPVVS-------------------------------DLDMPIAIRKE---------DSNWKVAVMEEMNALKQNGTWEI
          T  S +S+S  P S + +  P ++                                L + +A   E         D  W+ A+  E+NA   N TW++
Subjt:  IETGRSGNSSS-IPTSTIQNTLPVVS-------------------------------DLDMPIAIRKE---------DSNWKVAVMEEMNALKQNGTWEI

Query:  VELPKDKKP-VGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGF
        V  P      VGC+W+FT K N+D                                           V+ +W + QLDV NAFL G L ++V+M  PPGF
Subjt:  VELPKDKKP-VGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGF

Query:  EEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYF
         +    + +CKL+K+LYGLKQ+PRAW+    N + + GF+ S +D ++F        I  ++VYVDDI++TG+D   L +   NL++ F +KD   L YF
Subjt:  EEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYF

Query:  LGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYR
        LG+E  R   G+ ++QR+Y+LDLL  T ++  K   TP+ P+ KL +    ++ D   Y+ +VG L YL+ TRPDI+YAV+ +SQFMH P   H +A+ R
Subjt:  LGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYR

Query:  ILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPI
        IL+YL GTP  GI   + N   +  Y+DADW G   D  ST+ Y  ++G + ++W SKKQ  V RSS EA++R++A+   E  WI  +L EL I L  P 
Subjt:  ILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPI

Query:  RIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAM
         IYCDN  A  +  NPV H R KHI +D HFI+ ++++  + + ++ T +Q+AD LTK L +  F     K+ +
Subjt:  RIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-0630.39Show/hide
Query:  QDQKSEKMIGSARMLDGLYYFDEGSF-SNKVTQGFSSMSSLSDKGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINR
        + Q  E  I    +L+  +    G+F S+   +  +     S  GI H ++   TP+ NG++ERK+RH++E    ++    +PK  W  A     YLINR
Subjt:  QDQKSEKMIGSARMLDGLYYFDEGSF-SNKVTQGFSSMSSLSDKGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINR

Query:  MP
        +P
Subjt:  MP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.5e-9533.28Show/hide
Query:  PKNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICPDDISPLVPSIETSHSGGEILHNDPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETGR
        P + P+    S    +PN   +    + PLP V+      P  P I+ +       H+  T                R +D I  P+Q  S   ++    
Subjt:  PKNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICPDDISPLVPSIETSHSGGEILHNDPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETGR

Query:  SGNSSSIPTSTIQNTLPVVSDLDMPIAIRKEDSNWKVAVMEEMNALKQNGTWEIVELPKDKKP-VGCKWVFTIKCNAD----------------------
           ++S P + IQ                 +D  W+ A+  E+NA   N TW++V  P      VGC+W+FT K N+D                      
Subjt:  SGNSSSIPTSTIQNTLPVVSDLDMPIAIRKEDSNWKVAVMEEMNALKQNGTWEIVELPKDKKP-VGCKWVFTIKCNAD----------------------

Query:  ---------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADH
                             V+ +W + QLDV NAFL G L +EV+M  PPGF +      +C+L+K++YGLKQ+PRAW+      + + GF+ S +D 
Subjt:  ---------------------VNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADH

Query:  TIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQ
        ++F        I  ++VYVDDI++TG+D   L      L++ F +K+   L YFLG+E  R  +G+ ++QR+Y LDLL  T +L  K   TP+  + KL 
Subjt:  TIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQ

Query:  VAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCS
        +    ++ D   Y+ +VG L YL+ TRPD++YAV+ +SQ+MH P   H+ A+ R+L+YL GTP  GI   + N   +  Y+DADW G T D  ST+ Y  
Subjt:  VAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCS

Query:  FVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYL
        ++G + ++W SKKQ  V RSS EA++R++A+   E  WI  +L EL I L+ P  IYCDN  A  +  NPV H R KHI +D HFI+ ++++  + + ++
Subjt:  FVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIETNTICIPYL

Query:  PTSEQIADVLTKGLPKKQFDKLIDKLAM
         T +Q+AD LTK L +  F     K+ +
Subjt:  PTSEQIADVLTKGLPKKQFDKLIDKLAM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.7e-0738.55Show/hide
Query:  LSDKGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMPKN--QPYFTQSSLQGEKPNLE
        LS  GI H ++   TP+ NG++ERK+RH++E+   ++    VPK  W  A     YLINR+P    Q       L G+ PN E
Subjt:  LSDKGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLINRMPKN--QPYFTQSSLQGEKPNLE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-10942.98Show/hide
Query:  KEDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQ
        KE   W  A+ +E+ A++   TWEI  LP +KKP+GCKWV+ IK N+D                                              N++L+Q
Subjt:  KEDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNAD-------------------------------------------VNSNWSLYQ

Query:  LDVKNAFLNGDLEEEVFMDLPPGFE----EDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTG
        LD+ NAFLNGDL+EE++M LPPG+     + L  + +C LKKS+YGLKQ+ R WF +F   +  +GF+QS +DHT F K +    + +L VYVDDII+  
Subjt:  LDVKNAFLNGDLEEEVFMDLPPGFE----EDLGCSKICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTG

Query:  DDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHT
        +++A + +LK  L   F+++DLGPLKYFLG+E ARS  GI + QRKY LDLL+ETGLLGCK +  P++P++        +  D + Y+RL+GRL+YL  T
Subjt:  DDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHT

Query:  RPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKF
        R DI++AV+ +SQF  AP   H +AV +IL Y+KGT G+G+ +      Q+++++DA +      RRST+ YC F+G +L++W+SKKQ VV++SSAEA++
Subjt:  RPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKF

Query:  RALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEK
        RAL+    E +W+ +   EL++ L+ P  ++CDN AAI IA N V H+RTKHIE D H ++E+
Subjt:  RALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEK

AT4G24270.1 EMBRYO DEFECTIVE 1403.6e-10453.98Show/hide
Query:  NGDELMPDL----PQNPTSPASDSSSSDSGSEDDESDQKLQLQTLQSELSTNPSYYDAHVQYIKLLRKIGDIDNLRKAREAMSEIFPLTPPMWQEWAKDE
        N D+ M D     P     P+SD S       +DE++   Q+ TL+SELS NP  YDA+VQYIKLLRK  +++ LR+AREAMS IFPL+P +W EWA+DE
Subjt:  NGDELMPDL----PQNPTSPASDSSSSDSGSEDDESDQKLQLQTLQSELSTNPSYYDAHVQYIKLLRKIGDIDNLRKAREAMSEIFPLTPPMWQEWAKDE

Query:  ASISTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDRATSGVKKARDLFERALTAAGLHFTEAEKLWEAYREFEKTIYQSIEETDTQAKE
        AS++   E +  I  LYERG+ DY SVSLWCDYL+F+ E+DP V+   + G+ K R LFERA+ AAG H TE  ++WE YREFE+ +  +I+E D + + 
Subjt:  ASISTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDRATSGVKKARDLFERALTAAGLHFTEAEKLWEAYREFEKTIYQSIEETDTQAKE

Query:  KQVQLIRSIFHRQLSLPLSNMNSTLEAYKAWEMEVKQGCILDVKSNDSDGVPSQVASAYQRALDMYNARVQLEEQISKQDLTDTERLHQYIIYLKFEQSA
        KQ+Q IRSIFHR LS+PL N++STL AYK WE+E  QG  LD+ S+D   V  QVA A ++A  MY+ R  LEE ISKQDL+DTE+  +++ Y+KFE+++
Subjt:  KQVQLIRSIFHRQLSLPLSNMNSTLEAYKAWEMEVKQGCILDVKSNDSDGVPSQVASAYQRALDMYNARVQLEEQISKQDLTDTERLHQYIIYLKFEQSA

Query:  GDPARVQILFERAVADFPVSVDLWLDYTHYMDKTLKVGNIVRNVYSRATRNC
        GDP RVQ ++ERAVA++PVS DLW+DYT Y+DKTLKVG  + + YSRATR+C
Subjt:  GDPARVQILFERAVADFPVSVDLWLDYTHYMDKTLKVGNIVRNVYSRATRNC

AT4G24270.2 EMBRYO DEFECTIVE 1403.6e-10453.98Show/hide
Query:  NGDELMPDL----PQNPTSPASDSSSSDSGSEDDESDQKLQLQTLQSELSTNPSYYDAHVQYIKLLRKIGDIDNLRKAREAMSEIFPLTPPMWQEWAKDE
        N D+ M D     P     P+SD S       +DE++   Q+ TL+SELS NP  YDA+VQYIKLLRK  +++ LR+AREAMS IFPL+P +W EWA+DE
Subjt:  NGDELMPDL----PQNPTSPASDSSSSDSGSEDDESDQKLQLQTLQSELSTNPSYYDAHVQYIKLLRKIGDIDNLRKAREAMSEIFPLTPPMWQEWAKDE

Query:  ASISTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDRATSGVKKARDLFERALTAAGLHFTEAEKLWEAYREFEKTIYQSIEETDTQAKE
        AS++   E +  I  LYERG+ DY SVSLWCDYL+F+ E+DP V+   + G+ K R LFERA+ AAG H TE  ++WE YREFE+ +  +I+E D + + 
Subjt:  ASISTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDRATSGVKKARDLFERALTAAGLHFTEAEKLWEAYREFEKTIYQSIEETDTQAKE

Query:  KQVQLIRSIFHRQLSLPLSNMNSTLEAYKAWEMEVKQGCILDVKSNDSDGVPSQVASAYQRALDMYNARVQLEEQISKQDLTDTERLHQYIIYLKFEQSA
        KQ+Q IRSIFHR LS+PL N++STL AYK WE+E  QG  LD+ S+D   V  QVA A ++A  MY+ R  LEE ISKQDL+DTE+  +++ Y+KFE+++
Subjt:  KQVQLIRSIFHRQLSLPLSNMNSTLEAYKAWEMEVKQGCILDVKSNDSDGVPSQVASAYQRALDMYNARVQLEEQISKQDLTDTERLHQYIIYLKFEQSA

Query:  GDPARVQILFERAVADFPVSVDLWLDYTHYMDKTLKVGNIVRNVYSRATRNC
        GDP RVQ ++ERAVA++PVS DLW+DYT Y+DKTLKVG  + + YSRATR+C
Subjt:  GDPARVQILFERAVADFPVSVDLWLDYTHYMDKTLKVGNIVRNVYSRATRNC

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.8e-1439.02Show/hide
Query:  IYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFV
        +YL+ TRPD+ +AV+ +SQF  A      +AVY++L Y+KGT G+G+ +   +  Q++ + D+DW      RRS + +CS V
Subjt:  IYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFV

ATMG00810.1 DNA/RNA polymerases superfamily protein3.4e-4640.18Show/hide
Query:  LIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQ
        L++YVDDI+LTG     L  L   L+  F +KDLGP+ YFLG++      G+F++Q KY   +L   G+L CK   TP+   L   V+   +  D   ++
Subjt:  LIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQRKYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQ

Query:  RLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQ
         +VG L YL+ TRPDI+YAV++V Q MH P    F+ + R+L+Y+KGT   G+   +++   ++ + D+DW G T+ RRST+ +C+F+G N+++W +K+Q
Subjt:  RLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYTDADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQ

Query:  NVVARSSAEAKFRALAHGICEGIW
          V+RSS E ++RALA    E  W
Subjt:  NVVARSSAEAKFRALAHGICEGIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAAACCATAGAATCCAACGCCGAAGTTCCACTTCCAACCATTTCAGACGAAGGAAATACGGTTAATGGAGATGAACTAATGCCCGACCTTCCTCAGAACCCCAC
CTCCCCCGCCTCCGATTCTTCTTCCTCTGATTCCGGTTCTGAAGACGACGAATCCGACCAAAAGCTACAACTCCAGACCCTTCAATCTGAGCTCTCCACCAACCCTTCCT
ACTATGATGCTCATGTCCAATATATTAAGCTTTTGAGGAAAATTGGTGACATTGACAACCTGAGGAAAGCAAGGGAAGCGATGAGTGAGATATTCCCTTTGACCCCTCCA
ATGTGGCAGGAATGGGCTAAGGACGAAGCATCTATCAGTACTGGACCAGAGGCTCTTGCTGCAATTGAAAGGCTATATGAACGAGGGGTGTTTGATTATCTGTCGGTTTC
TCTTTGGTGTGACTACCTAAATTTCGTTCAAGAATATGATCCATTGGTGCAAGATCGTGCAACTTCTGGAGTTAAAAAGGCTAGAGATTTATTTGAACGTGCTCTCACAG
CTGCTGGTTTGCATTTTACTGAGGCTGAAAAATTATGGGAAGCATATAGAGAATTTGAGAAAACCATATATCAGAGCATTGAAGAAACTGATACTCAGGCGAAGGAAAAA
CAAGTCCAGCTAATACGGAGCATTTTCCATCGACAATTGTCACTTCCTCTATCTAACATGAATTCAACACTTGAGGCCTATAAGGCATGGGAGATGGAAGTGAAACAAGG
TTGCATTCTTGATGTGAAATCTAATGACTCGGATGGGGTTCCTTCTCAAGTTGCCTCAGCTTATCAAAGAGCATTGGATATGTATAATGCCCGTGTTCAATTGGAAGAAC
AAATATCCAAACAGGATTTAACTGATACTGAAAGGCTTCACCAGTACATCATTTACCTGAAATTTGAACAATCTGCTGGAGATCCAGCAAGGGTCCAAATCCTGTTTGAG
CGTGCAGTTGCTGATTTTCCAGTGTCGGTTGATCTTTGGCTTGATTATACCCATTACATGGACAAAACTTTGAAGGTGGGCAATATTGTGAGGAACGTTTACTCCAGGGC
AACTAGGAATTGTTTAAACCCTAGCCACCATACTTTTCTTGAAACCCTAGCCGCCGCCGCCGTCGGAACCCTAGCCGCCATTCTTGATTGGTGCATCCTACTTCCGACAT
CGGCGGTCTTACGCCGCCGCCGCCTCCGTCGACTACCTCAGGCGCCTCCGTCGACCATCTCAGACGCCGCCGTCGACCACCTCAGACGCCGCCGTCGTTTACCCAAACCT
TTCCATCGGTTACGTCGTGATCTGGAGACCCGTGAGAAAACAGCTGCGACCCACGCGACCAGCGACCTTGCAAACGCGACCCACGACCTCGTGAAAACTCGCCGCTGCAA
GTCACCGCTGTCAATCAATCCAGCGCCACATTCTCTCTCCCAGTCGATTCTCAACACACCATTGTTCTCTCTGGTTCCCTTTGCATACACATCGCACCCTCGCTGGCAAC
CACCTGATCAGAACCCTCGCCGCCGACCGGAACCCTTGCCGTCGACTGCAAACGGCGCATCCTTCTTCCGATCTGAGTTCTCGTGTTCGCAGGTGAAGTTGTTTTGGGCT
TGTGTGAGCTTCCCGATAGCAGTAAACTTTTTGGGTTGGTTCGCTGTGGCACAAACAGGGAATTATCCTCAAGCCTTCTCTTGTGGCACTTCTTCTCCGTGGATCATTGA
TTCCGGAGCTTCTGACCATATGACTAGTACCTCTCACTTTTTTAACTCATATTCTCCTTCTTATTCTAATGAAAAAATTCGAATTGCAGATGGCAGTTTTTCTTCCGTTG
CAGGAAAAGGAACTATTAAATTGACTGAACATATTTCTTTACAGTCTGTCTTACATGTTCCGAAATTAACCTGTAATCTCTTATTTGTCAGCAAACTTTCTAGAGATTCT
AGTTGTCGTGTTATATTCTTTGAATCTCATTGTATTCTTCAGGACCAAAAATCGGAGAAGATGATTGGGAGTGCTAGGATGCTTGATGGCCTTTACTACTTTGATGAAGG
TTCTTTCAGTAATAAAGTAACTCAGGGTTTCAGTAGTATGAGTTCTCTTTCTGATAAAGGTATTGTTCATCAATCTACTTGTCGAGATACTCCGCAACAGAATGGTGTTG
CTGAAAGAAAAAATAGGCATTTACTTGAAGTAGCTCGTGCTATAATGTTTTCCATGCATGTTCCAAAATATTTATGGGGCGAAGCAATTCTCACAACTACTTATCTAATA
AATAGAATGCCTAAAAATCAACCATATTTTACCCAAAGTTCTCTTCAGGGGGAGAAACCAAATTTGGAAGATAACTTTTGGGATGTTTCTACTCCTCTTCCTAACGTTAT
TTGTCCTGATGATATTAGTCCTTTGGTGCCAAGTATAGAAACTTCTCATTCAGGGGGAGAAATACTACATAATGATCCTACTGATCAAGTTCCTGAGTTTAAGTTTTATA
CTAAAAGAAAATTCAATCAGAGGAAGCAAGACCAAATAGTTGATCCTTCACAGAACCAATCTGAGACTCCGACGATTGAAACTGGAAGATCTGGTAATTCTTCTTCCATT
CCTACTTCAACTATTCAAAATACTTTGCCAGTTGTATCTGATCTTGATATGCCCATAGCTATTAGAAAAGAAGATTCAAATTGGAAAGTAGCAGTTATGGAGGAGATGAA
TGCTCTAAAACAAAATGGGACTTGGGAAATAGTAGAGTTGCCAAAAGATAAGAAACCAGTGGGTTGTAAATGGGTGTTCACCATAAAATGCAATGCAGATGTCAATTCAA
ATTGGTCACTTTACCAACTTGATGTAAAAAATGCATTTCTGAATGGTGATCTCGAGGAAGAGGTTTTTATGGACTTACCCCCAGGTTTTGAAGAAGATCTCGGATGCAGT
AAAATTTGTAAATTAAAAAAATCCCTCTATGGCTTGAAACAGTCACCAAGAGCTTGGTTTGAGCGGTTTGGAAATGTTGTGTCCAGCTATGGATTTATTCAGAGTCAAGC
AGATCATACTATATTTTATAAACACTCTGAGAATAATAAAATCTCAATTTTGATTGTCTATGTAGATGATATTATTCTCACAGGTGATGATGAAGCCGGTCTAGCTGATC
TCAAGAAAAACCTTGCAAGAGAGTTTCAAATCAAGGATCTAGGACCGTTGAAATATTTTCTAGGGATGGAATTTGCAAGATCGAAGAAGGGCATCTTTGTGAATCAGAGA
AAGTATGTTCTTGACTTACTTGAGGAAACTGGATTACTTGGATGCAAGACTGCAGAAACTCCAGTTGAACCCAATCTAAAGTTACAAGTAGCAAAAGTAGAAGAAATAAA
GGACAAGGAACGATATCAGAGACTTGTTGGAAGATTGATTTATCTATCTCATACACGTCCTGATATTGCATATGCAGTAAGCATGGTAAGTCAGTTTATGCATGCACCAG
GACCAACTCATTTTGAAGCTGTTTACAGAATTTTGAAGTATCTAAAAGGGACTCCTGGGAAAGGCATTCTATTTGGAAGACATAATCACTTTCAGATAGAAATTTATACT
GATGCAGACTGGGTAGGAAGTACTACAGATAGAAGATCTACCTCTAGTTATTGTTCCTTTGTTGGAGGGAACTTGGTTACTTGGAGAAGTAAAAAACAAAATGTGGTAGC
TAGAAGCAGTGCTGAAGCAAAATTTAGAGCATTGGCGCATGGAATTTGTGAGGGTATATGGATCAAAAGGATACTTGAAGAATTAAAGATTCATTTGAACACACCTATAC
GGATTTATTGTGATAATAAAGCTGCCATCTCTATAGCTCATAACCCAGTTCTACATGATAGAACAAAACACATTGAGGTTGATAAACATTTTATCAAGGAGAAGATTGAA
ACGAATACAATATGTATTCCTTACCTTCCTACATCAGAACAAATTGCCGATGTACTAACTAAAGGTCTCCCAAAGAAGCAATTCGATAAGTTGATTGACAAGCTGGCTAT
GGAAGACATCTTTAAACCAGCTTTGAGGGGGAGTGTTGGAATTTCTAATATTTGTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACAAAACCATAGAATCCAACGCCGAAGTTCCACTTCCAACCATTTCAGACGAAGGAAATACGGTTAATGGAGATGAACTAATGCCCGACCTTCCTCAGAACCCCAC
CTCCCCCGCCTCCGATTCTTCTTCCTCTGATTCCGGTTCTGAAGACGACGAATCCGACCAAAAGCTACAACTCCAGACCCTTCAATCTGAGCTCTCCACCAACCCTTCCT
ACTATGATGCTCATGTCCAATATATTAAGCTTTTGAGGAAAATTGGTGACATTGACAACCTGAGGAAAGCAAGGGAAGCGATGAGTGAGATATTCCCTTTGACCCCTCCA
ATGTGGCAGGAATGGGCTAAGGACGAAGCATCTATCAGTACTGGACCAGAGGCTCTTGCTGCAATTGAAAGGCTATATGAACGAGGGGTGTTTGATTATCTGTCGGTTTC
TCTTTGGTGTGACTACCTAAATTTCGTTCAAGAATATGATCCATTGGTGCAAGATCGTGCAACTTCTGGAGTTAAAAAGGCTAGAGATTTATTTGAACGTGCTCTCACAG
CTGCTGGTTTGCATTTTACTGAGGCTGAAAAATTATGGGAAGCATATAGAGAATTTGAGAAAACCATATATCAGAGCATTGAAGAAACTGATACTCAGGCGAAGGAAAAA
CAAGTCCAGCTAATACGGAGCATTTTCCATCGACAATTGTCACTTCCTCTATCTAACATGAATTCAACACTTGAGGCCTATAAGGCATGGGAGATGGAAGTGAAACAAGG
TTGCATTCTTGATGTGAAATCTAATGACTCGGATGGGGTTCCTTCTCAAGTTGCCTCAGCTTATCAAAGAGCATTGGATATGTATAATGCCCGTGTTCAATTGGAAGAAC
AAATATCCAAACAGGATTTAACTGATACTGAAAGGCTTCACCAGTACATCATTTACCTGAAATTTGAACAATCTGCTGGAGATCCAGCAAGGGTCCAAATCCTGTTTGAG
CGTGCAGTTGCTGATTTTCCAGTGTCGGTTGATCTTTGGCTTGATTATACCCATTACATGGACAAAACTTTGAAGGTGGGCAATATTGTGAGGAACGTTTACTCCAGGGC
AACTAGGAATTGTTTAAACCCTAGCCACCATACTTTTCTTGAAACCCTAGCCGCCGCCGCCGTCGGAACCCTAGCCGCCATTCTTGATTGGTGCATCCTACTTCCGACAT
CGGCGGTCTTACGCCGCCGCCGCCTCCGTCGACTACCTCAGGCGCCTCCGTCGACCATCTCAGACGCCGCCGTCGACCACCTCAGACGCCGCCGTCGTTTACCCAAACCT
TTCCATCGGTTACGTCGTGATCTGGAGACCCGTGAGAAAACAGCTGCGACCCACGCGACCAGCGACCTTGCAAACGCGACCCACGACCTCGTGAAAACTCGCCGCTGCAA
GTCACCGCTGTCAATCAATCCAGCGCCACATTCTCTCTCCCAGTCGATTCTCAACACACCATTGTTCTCTCTGGTTCCCTTTGCATACACATCGCACCCTCGCTGGCAAC
CACCTGATCAGAACCCTCGCCGCCGACCGGAACCCTTGCCGTCGACTGCAAACGGCGCATCCTTCTTCCGATCTGAGTTCTCGTGTTCGCAGGTGAAGTTGTTTTGGGCT
TGTGTGAGCTTCCCGATAGCAGTAAACTTTTTGGGTTGGTTCGCTGTGGCACAAACAGGGAATTATCCTCAAGCCTTCTCTTGTGGCACTTCTTCTCCGTGGATCATTGA
TTCCGGAGCTTCTGACCATATGACTAGTACCTCTCACTTTTTTAACTCATATTCTCCTTCTTATTCTAATGAAAAAATTCGAATTGCAGATGGCAGTTTTTCTTCCGTTG
CAGGAAAAGGAACTATTAAATTGACTGAACATATTTCTTTACAGTCTGTCTTACATGTTCCGAAATTAACCTGTAATCTCTTATTTGTCAGCAAACTTTCTAGAGATTCT
AGTTGTCGTGTTATATTCTTTGAATCTCATTGTATTCTTCAGGACCAAAAATCGGAGAAGATGATTGGGAGTGCTAGGATGCTTGATGGCCTTTACTACTTTGATGAAGG
TTCTTTCAGTAATAAAGTAACTCAGGGTTTCAGTAGTATGAGTTCTCTTTCTGATAAAGGTATTGTTCATCAATCTACTTGTCGAGATACTCCGCAACAGAATGGTGTTG
CTGAAAGAAAAAATAGGCATTTACTTGAAGTAGCTCGTGCTATAATGTTTTCCATGCATGTTCCAAAATATTTATGGGGCGAAGCAATTCTCACAACTACTTATCTAATA
AATAGAATGCCTAAAAATCAACCATATTTTACCCAAAGTTCTCTTCAGGGGGAGAAACCAAATTTGGAAGATAACTTTTGGGATGTTTCTACTCCTCTTCCTAACGTTAT
TTGTCCTGATGATATTAGTCCTTTGGTGCCAAGTATAGAAACTTCTCATTCAGGGGGAGAAATACTACATAATGATCCTACTGATCAAGTTCCTGAGTTTAAGTTTTATA
CTAAAAGAAAATTCAATCAGAGGAAGCAAGACCAAATAGTTGATCCTTCACAGAACCAATCTGAGACTCCGACGATTGAAACTGGAAGATCTGGTAATTCTTCTTCCATT
CCTACTTCAACTATTCAAAATACTTTGCCAGTTGTATCTGATCTTGATATGCCCATAGCTATTAGAAAAGAAGATTCAAATTGGAAAGTAGCAGTTATGGAGGAGATGAA
TGCTCTAAAACAAAATGGGACTTGGGAAATAGTAGAGTTGCCAAAAGATAAGAAACCAGTGGGTTGTAAATGGGTGTTCACCATAAAATGCAATGCAGATGTCAATTCAA
ATTGGTCACTTTACCAACTTGATGTAAAAAATGCATTTCTGAATGGTGATCTCGAGGAAGAGGTTTTTATGGACTTACCCCCAGGTTTTGAAGAAGATCTCGGATGCAGT
AAAATTTGTAAATTAAAAAAATCCCTCTATGGCTTGAAACAGTCACCAAGAGCTTGGTTTGAGCGGTTTGGAAATGTTGTGTCCAGCTATGGATTTATTCAGAGTCAAGC
AGATCATACTATATTTTATAAACACTCTGAGAATAATAAAATCTCAATTTTGATTGTCTATGTAGATGATATTATTCTCACAGGTGATGATGAAGCCGGTCTAGCTGATC
TCAAGAAAAACCTTGCAAGAGAGTTTCAAATCAAGGATCTAGGACCGTTGAAATATTTTCTAGGGATGGAATTTGCAAGATCGAAGAAGGGCATCTTTGTGAATCAGAGA
AAGTATGTTCTTGACTTACTTGAGGAAACTGGATTACTTGGATGCAAGACTGCAGAAACTCCAGTTGAACCCAATCTAAAGTTACAAGTAGCAAAAGTAGAAGAAATAAA
GGACAAGGAACGATATCAGAGACTTGTTGGAAGATTGATTTATCTATCTCATACACGTCCTGATATTGCATATGCAGTAAGCATGGTAAGTCAGTTTATGCATGCACCAG
GACCAACTCATTTTGAAGCTGTTTACAGAATTTTGAAGTATCTAAAAGGGACTCCTGGGAAAGGCATTCTATTTGGAAGACATAATCACTTTCAGATAGAAATTTATACT
GATGCAGACTGGGTAGGAAGTACTACAGATAGAAGATCTACCTCTAGTTATTGTTCCTTTGTTGGAGGGAACTTGGTTACTTGGAGAAGTAAAAAACAAAATGTGGTAGC
TAGAAGCAGTGCTGAAGCAAAATTTAGAGCATTGGCGCATGGAATTTGTGAGGGTATATGGATCAAAAGGATACTTGAAGAATTAAAGATTCATTTGAACACACCTATAC
GGATTTATTGTGATAATAAAGCTGCCATCTCTATAGCTCATAACCCAGTTCTACATGATAGAACAAAACACATTGAGGTTGATAAACATTTTATCAAGGAGAAGATTGAA
ACGAATACAATATGTATTCCTTACCTTCCTACATCAGAACAAATTGCCGATGTACTAACTAAAGGTCTCCCAAAGAAGCAATTCGATAAGTTGATTGACAAGCTGGCTAT
GGAAGACATCTTTAAACCAGCTTTGAGGGGGAGTGTTGGAATTTCTAATATTTGTGTGTAA
Protein sequenceShow/hide protein sequence
MDKTIESNAEVPLPTISDEGNTVNGDELMPDLPQNPTSPASDSSSSDSGSEDDESDQKLQLQTLQSELSTNPSYYDAHVQYIKLLRKIGDIDNLRKAREAMSEIFPLTPP
MWQEWAKDEASISTGPEALAAIERLYERGVFDYLSVSLWCDYLNFVQEYDPLVQDRATSGVKKARDLFERALTAAGLHFTEAEKLWEAYREFEKTIYQSIEETDTQAKEK
QVQLIRSIFHRQLSLPLSNMNSTLEAYKAWEMEVKQGCILDVKSNDSDGVPSQVASAYQRALDMYNARVQLEEQISKQDLTDTERLHQYIIYLKFEQSAGDPARVQILFE
RAVADFPVSVDLWLDYTHYMDKTLKVGNIVRNVYSRATRNCLNPSHHTFLETLAAAAVGTLAAILDWCILLPTSAVLRRRRLRRLPQAPPSTISDAAVDHLRRRRRLPKP
FHRLRRDLETREKTAATHATSDLANATHDLVKTRRCKSPLSINPAPHSLSQSILNTPLFSLVPFAYTSHPRWQPPDQNPRRRPEPLPSTANGASFFRSEFSCSQVKLFWA
CVSFPIAVNFLGWFAVAQTGNYPQAFSCGTSSPWIIDSGASDHMTSTSHFFNSYSPSYSNEKIRIADGSFSSVAGKGTIKLTEHISLQSVLHVPKLTCNLLFVSKLSRDS
SCRVIFFESHCILQDQKSEKMIGSARMLDGLYYFDEGSFSNKVTQGFSSMSSLSDKGIVHQSTCRDTPQQNGVAERKNRHLLEVARAIMFSMHVPKYLWGEAILTTTYLI
NRMPKNQPYFTQSSLQGEKPNLEDNFWDVSTPLPNVICPDDISPLVPSIETSHSGGEILHNDPTDQVPEFKFYTKRKFNQRKQDQIVDPSQNQSETPTIETGRSGNSSSI
PTSTIQNTLPVVSDLDMPIAIRKEDSNWKVAVMEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKCNADVNSNWSLYQLDVKNAFLNGDLEEEVFMDLPPGFEEDLGCS
KICKLKKSLYGLKQSPRAWFERFGNVVSSYGFIQSQADHTIFYKHSENNKISILIVYVDDIILTGDDEAGLADLKKNLAREFQIKDLGPLKYFLGMEFARSKKGIFVNQR
KYVLDLLEETGLLGCKTAETPVEPNLKLQVAKVEEIKDKERYQRLVGRLIYLSHTRPDIAYAVSMVSQFMHAPGPTHFEAVYRILKYLKGTPGKGILFGRHNHFQIEIYT
DADWVGSTTDRRSTSSYCSFVGGNLVTWRSKKQNVVARSSAEAKFRALAHGICEGIWIKRILEELKIHLNTPIRIYCDNKAAISIAHNPVLHDRTKHIEVDKHFIKEKIE
TNTICIPYLPTSEQIADVLTKGLPKKQFDKLIDKLAMEDIFKPALRGSVGISNICV