; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029180 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029180
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:36135806..36147215
RNA-Seq ExpressionLag0029180
SyntenyLag0029180
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.7e-29844.1Show/hide
Query:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK
        MA  RF++ KF+G GDF LW+ KI+AIL Q K    L D ++LP  I+  +K  M+  AY TI+L LS+ VLR V +  T  ++W KL  LY TK   NK
Subjt:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK

Query:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESSEG--LF
        +Y++EKFF +KMD  K L ENLDEF+K+  +  N+GEK+ DEN+A +LLNSLPE Y++VK A+KYGR+++T   ++ A++T+ LE   +KKE  +G  L 
Subjt:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESSEG--LF

Query:  VKGKGKNKETKGQS---EEKSKPKVR-CNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAV--GENTITYSDALATSDKSCTQNETAD---------K
         +G+ + K  KG+      KSK K R C  CHKEGH K+ C         +KS++   SEA V  G N+   +D     D + T  E+A+         +
Subjt:  VKGKGKNKETKGQS---EEKSKPKVR-CNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAV--GENTITYSDALATSDKSCTQNETAD---------K

Query:  FDWVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRV
          W++DSGC+FHMTP + + + +++ DGG V +G+N TC V G GSV +   DG V +L NVR VP LKRNLISLG LD  GCT    +G +++ + S V
Subjt:  FDWVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRV

Query:  VLTGNKINGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLW
         L G   +GLYV++   +  SA + S      S LWHKRL+H+S RGLQ L++QG+L      +L FCE C++GK+ R  F K + TTKGIL+YVHSDLW
Subjt:  VLTGNKINGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLW

Query:  GPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLY
        GP    S+  SRYF++ IDD+SRK W+Y LK KD+ F KF EWK  VE QT + +K LRTDNGLEF N +FN FCKS+GI RH TV YTPQQNG+AER  
Subjt:  GPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLY

Query:  RTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLWHPV
        RTIMER RC L++A LP ++W EAA    Y +NR P T+LN  TP+E W+GK P L+HL+VFGC ++ H   GKL  RA+KCMF+G+ +GVKGY+LW  +
Subjt:  RTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLWHPV

Query:  EK---KCINSRDVFFREQEM-YMLQKQSVEKELIEKETQIEVEHELLPS---------DSALEPETSLERGGGSEEGEE------DTVEVSQEVQDLQNY
        EK   KCI SRDV F E EM Y +++Q  ++      T++ +  E+ PS          S +E     E  G   + E         +E S    DLQNY
Subjt:  EK---KCINSRDVFFREQEM-YMLQKQSVEKELIEKETQIEVEHELLPS---------DSALEPETSLERGGGSEEGEE------DTVEVSQEVQDLQNY

Query:  SLARDRQRRQIVPPARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKP
         L RDR +R+   P R+  AD ++ AL  A      EP  FEEA+   + +QW +AM EE+ SL +N TW+L   P   K I SKWIYKIK G  G  KP
Subjt:  SLARDRQRRQIVPPARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKP

Query:  RFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL---------
        R+KARLVAKGYTQ+EG+D+ EIFSPVV+ +SIR++LSI V  D+ ++Q+DV TAFLHG L E IYM QP+GY  KG+E++VC L+KS+YGL         
Subjt:  RFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL---------

Query:  -----------------------NNHQGAGI----------------------------------KGSMK--LSLRLVSRKVPSTVVCAKEVQTIR----
                                + +G  I                                   G +K  L + +   K    +  ++E   I+    
Subjt:  -----------------------NNHQGAGI----------------------------------KGSMK--LSLRLVSRKVPSTVVCAKEVQTIR----

Query:  -----SKTSFNPLAHHFKLSAMNSP--KENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR---------LRF
             SK    PLA HF+LS+   P  K+  +E    MS IPY  AVGS+MYLM+ TRPDL YA S++SR+MSNPGK HW+AVKW+LR         L +
Subjt:  -----SKTSFNPLAHHFKLSAMNSP--KENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR---------LRF

Query:  RR---------GFR---------PKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTK
         R         GF           ++   G  F L  N++SWK  LQ VVALSTTE+E+I+L EA+KE +WLK ++ +   ++    I+CD+QS I L K
Subjt:  RR---------GFR---------PKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTK

Query:  HPQFNNRTKHIDIKYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLE
        +P  + R+KHID+K+H++R  I + +VE++KVHT +N +D+LTK L   +F++ L+
Subjt:  HPQFNNRTKHIDIKYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLE

KAG8484780.1 hypothetical protein CXB51_021428 [Gossypium anomalum]2.0e-28341.75Show/hide
Query:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK
        MA  +++I   D    F LW+ K++A+L Q     AL   DK+P+T++ E+K+  +  A   + L+LSN +L+ V+ E+T   +W +L ++  +K   +K
Subjt:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK

Query:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRT-KELELLSVKKES-SEGLF
        ++++++ +  +++ G  + E+L  FK++ S  + +  +   E+   +LL SLP +Y   ++ + Y RE++T D +  ++ +  +++ L VK +S  EGL 
Subjt:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRT-KELELLSVKKES-SEGLF

Query:  VKG---------KGKNKET--KGQSEEKSKPKVR---CNYCHKEGHIKRECYSLKRK------NQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQN
        V+G         +G+ +E   +G+S+ +SK   R   CN+C K+GHIK ECY L+ K      NQ  K  +N   EA V E+   YSD         + N
Subjt:  VKG---------KGKNKET--KGQSEEKSKPKVR---CNYCHKEGHIKRECYSLKRK------NQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQN

Query:  ETADKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIR
        ++    +W++DSGC+FHM+P++ WF+TY+    G+V MGNN +C++AG+G++ +++ DG V  L +VR VP LKRNLISL  LDS G  Y   SG ++I 
Subjt:  ETADKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIR

Query:  RESRVVLTG-NKINGLYVVKDVQMMESALVVSSDGPTE--SDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGIL
        + S VV+ G  K   LYV++   +   A V SS    +  + LWH RL H+S  G+ EL+K+G+L       L+FCE CV GK KR  F++    TK  L
Subjt:  RESRVVLTG-NKINGLYVVKDVQMMESALVVSSDGPTE--SDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGIL

Query:  EYVHSDLWGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQ
        EY+HSDLWGP+   S   + Y LTFIDD+SRK W +FLK K  VF  FK WK ++EKQT K IK LRTDNGLEFC++EFN FCKS+GI+RH TVR+TPQQ
Subjt:  EYVHSDLWGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQ

Query:  NGVAERLYRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVK
        NGVAER+ RTIME+VRC LS+A LP+ +WAEAAS   + +NR P  ++   TP+E WSG P     LK+FGC ++ H N GKL+ R+IKC+FLG+  GVK
Subjt:  NGVAERLYRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVK

Query:  GYRLWHPVEKKCINSRDVFFREQEMYMLQKQSVEKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARDRQRRQI
        GY+LW P  +K + SRDV F   E  ML   S+ K+   KE Q +VEH++                  +E   + + ++   V     YS+A++R +R+I
Subjt:  GYRLWHPVEKKCINSRDVFFREQEMYMLQKQSVEKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARDRQRRQI

Query:  VPPARFSEADCISMALNVADSI-SSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKG
         PP +++EAD ++ ALNVA+ I +++EP N+ EA++  +  +W+ AM EEM SL +N TW L  LP G K +  KW++K KEG  G+ +P++KARLVAKG
Subjt:  VPPARFSEADCISMALNVADSI-SSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKG

Query:  YTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL-------------------
        Y+Q  G+D++++FSPVVK +SIR LL IV   DLEL+QLDVKTAFLHG L E+IYM QP+G+    +E+ VC L KS+YGL                   
Subjt:  YTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL-------------------

Query:  -------------NNHQGAGI---------------KGSMKLSLRLVSRKV------PSTVVCAKEVQTIRSKTS----------------FN-------
                      N+ G+ +               KG ++     +S +       P+  +   E+ T R KTS                FN       
Subjt:  -------------NNHQGAGI---------------KGSMKLSLRLVSRKV------PSTVVCAKEVQTIRSKTS----------------FN-------

Query:  --PLAHHFKLSAMNSPK-ENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR---------LRFRR--------
          PLA HF+LS+  SP+ ++E+E   +MS +PYS AVGSLMY MV +RPDLSYA S VSRYM+NPGK HW+AV+WILR         L+F R        
Subjt:  --PLAHHFKLSAMNSPK-ENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR---------LRFRR--------

Query:  -------GFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDI
                   ++   G  F +G   +SWKATLQ+ VALSTTEAE++A++EA KE IWLKGL ++         ++CD+QS IFLTK   F+ RTKHID+
Subjt:  -------GFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDI

Query:  KYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG
        +YHFVR+ I +G++ V K+ T +N AD++TK LP  KF+HCL++VG
Subjt:  KYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG

PNX96445.1 copia LTR rider [Trifolium pratense]5.1e-28742.21Show/hide
Query:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK
        M   +++IEKF G  DF LW+ K+KA+L QQ  L AL     + A ++A +K +M   A+  I+L+L + VLRQV  E T   +WAKL  LY TK   N+
Subjt:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK

Query:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESS--EGLF
        +YL++  ++FKM   K L+E LD F K+  + +N+  KI DE++A +LL +LP ++   K  L YGRE++T + + SA+ +K+L      K S+  EGL 
Subjt:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESS--EGLF

Query:  VKGKGKNK----ETKGQSEEKSKPK----VRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDWVIDS
        VKGK   K    + KG+S+ KS       +RC +C KEGH ++ C     + + H    N    AA+ ++    SD L  S           + +W++DS
Subjt:  VKGKGKNK----ETKGQSEEKSKPK----VRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDWVIDS

Query:  GCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVLTGNKI
        GC++HMTP+K  F    + DGG V +GNN  C++AG+GSV  +L D S+ LL  VR VP+LKRNL+SLG  D  G  + G    + + + S+ VL G K 
Subjt:  GCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVLTGNKI

Query:  NGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLWGPASTNS
         GLY ++   +  S  VVS+   +++++WH RL H+S RGL EL KQ +L       L FCE CV GK+ R  F+K +Q T G L+Y+H+DLWGPA   S
Subjt:  NGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLWGPASTNS

Query:  LSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLYRTIMERV
         S +RYFL+ +DDYSRK WV+  KTKD+ FE FK WK LVE QT + +K LRTDNGLEFCNE F+ FC + GI RHRT   TPQQNG+AER  RTI+ERV
Subjt:  LSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLYRTIMERV

Query:  RCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--HPVEKKCI
        RC L+ A L + +WAEA S   Y +NRCP T+L+  TPEE WSG PP L  L+VFGCV++ H  Q K++ RA+KCMF+G+ EGVK YRLW   P  K+CI
Subjt:  RCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--HPVEKKCI

Query:  NSRDVFFREQEMYMLQKQSV-------EKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARDRQRRQIVPPARF
         SRDV F E EM   +   V       ++EL + E  +EVEH     D+ L     +E        E +  E  +E  D  +Y L+RDR RR I  P R 
Subjt:  NSRDVFFREQEMYMLQKQSV-------EKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARDRQRRQIVPPARF

Query:  SEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQREGI
          AD I+ AL  A  +  EEP++++E +   N  +W++AM++EM+SL +N TW L   P G + ++ KWI+K+KEGI G+   R+KARLVA+G+TQ+EG+
Subjt:  SEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQREGI

Query:  DYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL--------------------------
        D++++FSPVVK  SIRMLL++V Q DLEL+Q+DVKTAFL+G L E I M QP+GYV +G+E+ VC L +S+YGL                          
Subjt:  DYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL--------------------------

Query:  ----------------------------NNHQGAGIKGSMKLSLRL-------------VSRKVPSTVVCAKEVQTIR----------SKTSFNPLAHHF
                                    N      +K  +     +             + R    + +C  +   +R          SK    P    F
Subjt:  ----------------------------NNHQGAGIKGSMKLSLRL-------------VSRKVPSTVVCAKEVQTIR----------SKTSFNPLAHHF

Query:  KLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILRL-----------------------------R
        KLS    P  +    +A+M+ IPY+  VGSLMY MV TRPD++YA SLVSRYM+NPGK HW+A+KWILR                               
Subjt:  KLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILRL-----------------------------R

Query:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF
        +      +K   G  F +    +SWKATLQ VVALSTTEAE+IAL+EA+KE +WL+G   +  ++   + + CD+QS I L+K+  ++ RTKHID++ HF
Subjt:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF

Query:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMV
        VR  IE+GEV+VLKV T DNAAD++TK LP  KF HC++++
Subjt:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMV

PPR84446.1 hypothetical protein GOBAR_AA36262 [Gossypium barbadense]8.8e-28742.32Show/hide
Query:  RTQILKHSMALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLY
        R QI+  S++  ++D+EKF GK  F LW+ K++A+L QQ  L AL+  DKLP+T+S E K+ M   A+  I+L L + VLR+V DE+T   +W +L   Y
Subjt:  RTQILKHSMALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLY

Query:  ETKDAHNKMYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELEL----LS
         TK   N++YL+++ +  KM+ G P+S++LD+F  +  +  N+  KI DE++A ++L SLP +Y++  + + YGR+++T + + +A+ + EL        
Subjt:  ETKDAHNKMYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELEL----LS

Query:  VKKESSEGLFVKGKGKNK---ETKGQSEEKSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFD
        V+    EGL  +G+ K K    +K     +SK +++C YC K GH+K +C   K K++  + ++N ++  A  +++      LA SD             
Subjt:  VKKESSEGLFVKGKGKNK---ETKGQSEEKSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFD

Query:  WVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVL
        W++D+G +FH++ SK  FSTY++  G ++ MGN+H C+V GIG+V +++ DG V  L +VR +P +K+NLISL  LD  G  Y    G +++   +  V+
Subjt:  WVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVL

Query:  TGNKINGLYVVKDVQMMESALVVSSDGPTESD---LWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDL
         GN   GLY + D   +     VSS    +SD   LWH RL H+S RGL  L+K+G+L       L+FCE CV GK  R  FS     TKG ++Y HSDL
Subjt:  TGNKINGLYVVKDVQMMESALVVSSDGPTESD---LWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDL

Query:  WGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERL
        WGP+ T S    RY LTFIDDYSRK WVYFLK+K +V   FK++KAL+E QT K IK  RTDNGLEFC+ EFN FCK++GI+RHRTVR TPQQNGVAER+
Subjt:  WGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERL

Query:  YRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--
         RT++ER RC  S+A L E +WAEA +   Y +NR P T++   TPEE WSG P     L+VFGC ++ H N+GKLK RA KC+FLG+ +GVKGYRLW  
Subjt:  YRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--

Query:  HPVEKKCINSRDVFFREQEMYMLQKQSVEKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQN--YSLARDRQRRQIVPP
         PV  K I SRDV F E  M      S EKE  ++     VE ++       + +  +   G S + ++  V+ S E +  Q   YS+A  R +RQI P 
Subjt:  HPVEKKCINSRDVFFREQEMYMLQKQSVEKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQN--YSLARDRQRRQIVPP

Query:  ARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQR
         R+  A+ +S AL+VA+SI   EP ++ EAV      QW  AM+EE+ SL +N TW L   P+  K +  KW++K KEGI G+   RFKARLVAKG+TQ+
Subjt:  ARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQR

Query:  EGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGAGIKGSMKLSLR------
        EGIDY+E+FSPVVK +SIR+LL++V + DLEL+QLDVKTAFLHG L E IYM QP+G+   G+E+ VC L KS+YGL        K      ++      
Subjt:  EGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGAGIKGSMKLSLR------

Query:  -----LVSRK------------VPSTVVCAKEVQTI------------------------------------------------------RSKTSFNPLA
             +  RK            V   ++ +K +  I                                                      ++KT   PLA
Subjt:  -----LVSRK------------VPSTVVCAKEVQTI------------------------------------------------------RSKTSFNPLA

Query:  HHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-LR-------------------------
         HFKLSA  SP+ +E + Q  MS IPYS AVGS+MY MV TRPD+S+A S+VSRYMS PGK HW+AVKWILR LR                         
Subjt:  HHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-LR-------------------------

Query:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF
        +      ++   G  F      +SWKA LQS VALSTTEAE++AL+EA+KE +W+KGL++  G++Q    ++CD+QS I LTK+  F+ RTKHID++YHF
Subjt:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF

Query:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG
        VRE + +G++ + KV T  N AD+LTK +P  KF+HCL+++G
Subjt:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG

PPS20553.1 hypothetical protein GOBAR_AA00016 [Gossypium barbadense]5.7e-28642.28Show/hide
Query:  RTQILKHSMALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLY
        R QI+  S++  ++D+EKF GK  F LW+ K++A+L QQ  L AL+  DKLP+T+S E K+ M   A+  I+L L + VLR+V DE+T   +W +L   Y
Subjt:  RTQILKHSMALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLY

Query:  ETKDAHNKMYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELEL----LS
         TK   N++YL+++ +  KM+ G P+S++LD+F  +  +  N+  KI DE++A ++L SLP +Y++  + + YGR+++T + + +A+ + EL        
Subjt:  ETKDAHNKMYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELEL----LS

Query:  VKKESSEGLFVKGKGKNK---ETKGQSEEKSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFD
        V+    EGL  +G+ K K    +K     +SK +++C YC K GH+K +C   K K++  + ++N ++  A  +++      LA SD             
Subjt:  VKKESSEGLFVKGKGKNK---ETKGQSEEKSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFD

Query:  WVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVL
        W++D+G +FH++ SK  FSTY++  G ++ MGN+H C+V GIG+V +++ DG V  L +VR +P +K+NLISL  LD  G  Y    G +++   +  V+
Subjt:  WVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVL

Query:  TGNKINGLYVVKDVQMMESALVVSSDGPTESD---LWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDL
         GN   GLY + D   +     VSS    +SD   LWH RL H+S RGL  L+K+G+L       L+FCE CV GK  R  FS     TKG ++Y HSDL
Subjt:  TGNKINGLYVVKDVQMMESALVVSSDGPTESD---LWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDL

Query:  WGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERL
        WGP+ T S    RY LTFIDDYSRK WVYFLK+K +V   FK++KAL+E QT K IK  RTDNGLEFC+ EFN FCK++GI+RHRTVR TPQQNGVAER+
Subjt:  WGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERL

Query:  YRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--
         RT++ER RC  S+A L E +WAEA +   Y +NR P T++   TPEE WSG P     L+VFGC ++ H N+GKLK RA KC+FLG+ +GVKGYRLW  
Subjt:  YRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--

Query:  HPVEKKCINSRDVFFREQEMYMLQKQSVEKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQN--YSLARDRQRRQIVPP
         PV  K I SRDV F E  M      S EKE  ++     VE ++       + +  +   G S + ++  V+ S E +  Q   YS+A  R +RQI P 
Subjt:  HPVEKKCINSRDVFFREQEMYMLQKQSVEKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQN--YSLARDRQRRQIVPP

Query:  ARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQR
         R+  A+ +S AL+VA+SI   EP ++ EAV      QW  AM+EE+ SL +N TW L   P+  K +  KW++K KEGI G+   RFKARLVAKG+TQ+
Subjt:  ARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQR

Query:  EGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGAGIKGSMKLSLR------
        EGIDY+E+FSPVVK +SIR+LL++V + DLEL+QLDVKTAFLHG L E IYM QP+G+   G+E+ VC L KS+YGL        K      ++      
Subjt:  EGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGAGIKGSMKLSLR------

Query:  -----LVSRK------------VPSTVVCAKEVQTI------------------------------------------------------RSKTSFNPLA
             +  RK            V   ++ +K +  I                                                      ++KT   PLA
Subjt:  -----LVSRK------------VPSTVVCAKEVQTI------------------------------------------------------RSKTSFNPLA

Query:  HHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-LR-------------------------
         HFKLSA  SP+ +E + Q  MS IPYS AVGS+MY MV TRPD+S+A S+VSRYMS PGK HW+AVKWILR LR                         
Subjt:  HHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-LR-------------------------

Query:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF
        +      ++   G  F      +SWKA LQS VALSTTEAE++AL+EA+KE +W+KGL++  G++Q    ++CD+QS I LTK+  F+ RTKHID++YHF
Subjt:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF

Query:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMV
        VRE + +G++ + KV T  N AD+LTK +P  KF+HCL+++
Subjt:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMV

TrEMBL top hitse value%identityAlignment
A0A2K3N065 Copia LTR rider2.5e-28742.21Show/hide
Query:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK
        M   +++IEKF G  DF LW+ K+KA+L QQ  L AL     + A ++A +K +M   A+  I+L+L + VLRQV  E T   +WAKL  LY TK   N+
Subjt:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK

Query:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESS--EGLF
        +YL++  ++FKM   K L+E LD F K+  + +N+  KI DE++A +LL +LP ++   K  L YGRE++T + + SA+ +K+L      K S+  EGL 
Subjt:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESS--EGLF

Query:  VKGKGKNK----ETKGQSEEKSKPK----VRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDWVIDS
        VKGK   K    + KG+S+ KS       +RC +C KEGH ++ C     + + H    N    AA+ ++    SD L  S           + +W++DS
Subjt:  VKGKGKNK----ETKGQSEEKSKPK----VRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDWVIDS

Query:  GCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVLTGNKI
        GC++HMTP+K  F    + DGG V +GNN  C++AG+GSV  +L D S+ LL  VR VP+LKRNL+SLG  D  G  + G    + + + S+ VL G K 
Subjt:  GCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVLTGNKI

Query:  NGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLWGPASTNS
         GLY ++   +  S  VVS+   +++++WH RL H+S RGL EL KQ +L       L FCE CV GK+ R  F+K +Q T G L+Y+H+DLWGPA   S
Subjt:  NGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLWGPASTNS

Query:  LSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLYRTIMERV
         S +RYFL+ +DDYSRK WV+  KTKD+ FE FK WK LVE QT + +K LRTDNGLEFCNE F+ FC + GI RHRT   TPQQNG+AER  RTI+ERV
Subjt:  LSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLYRTIMERV

Query:  RCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--HPVEKKCI
        RC L+ A L + +WAEA S   Y +NRCP T+L+  TPEE WSG PP L  L+VFGCV++ H  Q K++ RA+KCMF+G+ EGVK YRLW   P  K+CI
Subjt:  RCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--HPVEKKCI

Query:  NSRDVFFREQEMYMLQKQSV-------EKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARDRQRRQIVPPARF
         SRDV F E EM   +   V       ++EL + E  +EVEH     D+ L     +E        E +  E  +E  D  +Y L+RDR RR I  P R 
Subjt:  NSRDVFFREQEMYMLQKQSV-------EKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARDRQRRQIVPPARF

Query:  SEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQREGI
          AD I+ AL  A  +  EEP++++E +   N  +W++AM++EM+SL +N TW L   P G + ++ KWI+K+KEGI G+   R+KARLVA+G+TQ+EG+
Subjt:  SEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQREGI

Query:  DYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL--------------------------
        D++++FSPVVK  SIRMLL++V Q DLEL+Q+DVKTAFL+G L E I M QP+GYV +G+E+ VC L +S+YGL                          
Subjt:  DYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL--------------------------

Query:  ----------------------------NNHQGAGIKGSMKLSLRL-------------VSRKVPSTVVCAKEVQTIR----------SKTSFNPLAHHF
                                    N      +K  +     +             + R    + +C  +   +R          SK    P    F
Subjt:  ----------------------------NNHQGAGIKGSMKLSLRL-------------VSRKVPSTVVCAKEVQTIR----------SKTSFNPLAHHF

Query:  KLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILRL-----------------------------R
        KLS    P  +    +A+M+ IPY+  VGSLMY MV TRPD++YA SLVSRYM+NPGK HW+A+KWILR                               
Subjt:  KLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILRL-----------------------------R

Query:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF
        +      +K   G  F +    +SWKATLQ VVALSTTEAE+IAL+EA+KE +WL+G   +  ++   + + CD+QS I L+K+  ++ RTKHID++ HF
Subjt:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF

Query:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMV
        VR  IE+GEV+VLKV T DNAAD++TK LP  KF HC++++
Subjt:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMV

A0A2P5W031 Uncharacterized protein4.2e-28742.32Show/hide
Query:  RTQILKHSMALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLY
        R QI+  S++  ++D+EKF GK  F LW+ K++A+L QQ  L AL+  DKLP+T+S E K+ M   A+  I+L L + VLR+V DE+T   +W +L   Y
Subjt:  RTQILKHSMALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLY

Query:  ETKDAHNKMYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELEL----LS
         TK   N++YL+++ +  KM+ G P+S++LD+F  +  +  N+  KI DE++A ++L SLP +Y++  + + YGR+++T + + +A+ + EL        
Subjt:  ETKDAHNKMYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELEL----LS

Query:  VKKESSEGLFVKGKGKNK---ETKGQSEEKSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFD
        V+    EGL  +G+ K K    +K     +SK +++C YC K GH+K +C   K K++  + ++N ++  A  +++      LA SD             
Subjt:  VKKESSEGLFVKGKGKNK---ETKGQSEEKSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFD

Query:  WVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVL
        W++D+G +FH++ SK  FSTY++  G ++ MGN+H C+V GIG+V +++ DG V  L +VR +P +K+NLISL  LD  G  Y    G +++   +  V+
Subjt:  WVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVL

Query:  TGNKINGLYVVKDVQMMESALVVSSDGPTESD---LWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDL
         GN   GLY + D   +     VSS    +SD   LWH RL H+S RGL  L+K+G+L       L+FCE CV GK  R  FS     TKG ++Y HSDL
Subjt:  TGNKINGLYVVKDVQMMESALVVSSDGPTESD---LWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDL

Query:  WGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERL
        WGP+ T S    RY LTFIDDYSRK WVYFLK+K +V   FK++KAL+E QT K IK  RTDNGLEFC+ EFN FCK++GI+RHRTVR TPQQNGVAER+
Subjt:  WGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERL

Query:  YRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--
         RT++ER RC  S+A L E +WAEA +   Y +NR P T++   TPEE WSG P     L+VFGC ++ H N+GKLK RA KC+FLG+ +GVKGYRLW  
Subjt:  YRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--

Query:  HPVEKKCINSRDVFFREQEMYMLQKQSVEKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQN--YSLARDRQRRQIVPP
         PV  K I SRDV F E  M      S EKE  ++     VE ++       + +  +   G S + ++  V+ S E +  Q   YS+A  R +RQI P 
Subjt:  HPVEKKCINSRDVFFREQEMYMLQKQSVEKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQN--YSLARDRQRRQIVPP

Query:  ARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQR
         R+  A+ +S AL+VA+SI   EP ++ EAV      QW  AM+EE+ SL +N TW L   P+  K +  KW++K KEGI G+   RFKARLVAKG+TQ+
Subjt:  ARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQR

Query:  EGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGAGIKGSMKLSLR------
        EGIDY+E+FSPVVK +SIR+LL++V + DLEL+QLDVKTAFLHG L E IYM QP+G+   G+E+ VC L KS+YGL        K      ++      
Subjt:  EGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGAGIKGSMKLSLR------

Query:  -----LVSRK------------VPSTVVCAKEVQTI------------------------------------------------------RSKTSFNPLA
             +  RK            V   ++ +K +  I                                                      ++KT   PLA
Subjt:  -----LVSRK------------VPSTVVCAKEVQTI------------------------------------------------------RSKTSFNPLA

Query:  HHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-LR-------------------------
         HFKLSA  SP+ +E + Q  MS IPYS AVGS+MY MV TRPD+S+A S+VSRYMS PGK HW+AVKWILR LR                         
Subjt:  HHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-LR-------------------------

Query:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF
        +      ++   G  F      +SWKA LQS VALSTTEAE++AL+EA+KE +W+KGL++  G++Q    ++CD+QS I LTK+  F+ RTKHID++YHF
Subjt:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF

Query:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG
        VRE + +G++ + KV T  N AD+LTK +P  KF+HCL+++G
Subjt:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG

A0A2P5YYC3 Uncharacterized protein2.7e-28642.28Show/hide
Query:  RTQILKHSMALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLY
        R QI+  S++  ++D+EKF GK  F LW+ K++A+L QQ  L AL+  DKLP+T+S E K+ M   A+  I+L L + VLR+V DE+T   +W +L   Y
Subjt:  RTQILKHSMALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLY

Query:  ETKDAHNKMYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELEL----LS
         TK   N++YL+++ +  KM+ G P+S++LD+F  +  +  N+  KI DE++A ++L SLP +Y++  + + YGR+++T + + +A+ + EL        
Subjt:  ETKDAHNKMYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELEL----LS

Query:  VKKESSEGLFVKGKGKNK---ETKGQSEEKSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFD
        V+    EGL  +G+ K K    +K     +SK +++C YC K GH+K +C   K K++  + ++N ++  A  +++      LA SD             
Subjt:  VKKESSEGLFVKGKGKNK---ETKGQSEEKSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFD

Query:  WVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVL
        W++D+G +FH++ SK  FSTY++  G ++ MGN+H C+V GIG+V +++ DG V  L +VR +P +K+NLISL  LD  G  Y    G +++   +  V+
Subjt:  WVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVL

Query:  TGNKINGLYVVKDVQMMESALVVSSDGPTESD---LWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDL
         GN   GLY + D   +     VSS    +SD   LWH RL H+S RGL  L+K+G+L       L+FCE CV GK  R  FS     TKG ++Y HSDL
Subjt:  TGNKINGLYVVKDVQMMESALVVSSDGPTESD---LWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDL

Query:  WGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERL
        WGP+ T S    RY LTFIDDYSRK WVYFLK+K +V   FK++KAL+E QT K IK  RTDNGLEFC+ EFN FCK++GI+RHRTVR TPQQNGVAER+
Subjt:  WGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERL

Query:  YRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--
         RT++ER RC  S+A L E +WAEA +   Y +NR P T++   TPEE WSG P     L+VFGC ++ H N+GKLK RA KC+FLG+ +GVKGYRLW  
Subjt:  YRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLW--

Query:  HPVEKKCINSRDVFFREQEMYMLQKQSVEKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQN--YSLARDRQRRQIVPP
         PV  K I SRDV F E  M      S EKE  ++     VE ++       + +  +   G S + ++  V+ S E +  Q   YS+A  R +RQI P 
Subjt:  HPVEKKCINSRDVFFREQEMYMLQKQSVEKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQN--YSLARDRQRRQIVPP

Query:  ARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQR
         R+  A+ +S AL+VA+SI   EP ++ EAV      QW  AM+EE+ SL +N TW L   P+  K +  KW++K KEGI G+   RFKARLVAKG+TQ+
Subjt:  ARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQR

Query:  EGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGAGIKGSMKLSLR------
        EGIDY+E+FSPVVK +SIR+LL++V + DLEL+QLDVKTAFLHG L E IYM QP+G+   G+E+ VC L KS+YGL        K      ++      
Subjt:  EGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGAGIKGSMKLSLR------

Query:  -----LVSRK------------VPSTVVCAKEVQTI------------------------------------------------------RSKTSFNPLA
             +  RK            V   ++ +K +  I                                                      ++KT   PLA
Subjt:  -----LVSRK------------VPSTVVCAKEVQTI------------------------------------------------------RSKTSFNPLA

Query:  HHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-LR-------------------------
         HFKLSA  SP+ +E + Q  MS IPYS AVGS+MY MV TRPD+S+A S+VSRYMS PGK HW+AVKWILR LR                         
Subjt:  HHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-LR-------------------------

Query:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF
        +      ++   G  F      +SWKA LQS VALSTTEAE++AL+EA+KE +W+KGL++  G++Q    ++CD+QS I LTK+  F+ RTKHID++YHF
Subjt:  FRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHF

Query:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMV
        VRE + +G++ + KV T  N AD+LTK +P  KF+HCL+++
Subjt:  VREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMV

A0A5A7UB25 Putative gag-pol polyprotein8.2e-29944.1Show/hide
Query:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK
        MA  RF++ KF+G GDF LW+ KI+AIL Q K    L D ++LP  I+  +K  M+  AY TI+L LS+ VLR V +  T  ++W KL  LY TK   NK
Subjt:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK

Query:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESSEG--LF
        +Y++EKFF +KMD  K L ENLDEF+K+  +  N+GEK+ DEN+A +LLNSLPE Y++VK A+KYGR+++T   ++ A++T+ LE   +KKE  +G  L 
Subjt:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESSEG--LF

Query:  VKGKGKNKETKGQS---EEKSKPKVR-CNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAV--GENTITYSDALATSDKSCTQNETAD---------K
         +G+ + K  KG+      KSK K R C  CHKEGH K+ C         +KS++   SEA V  G N+   +D     D + T  E+A+         +
Subjt:  VKGKGKNKETKGQS---EEKSKPKVR-CNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAV--GENTITYSDALATSDKSCTQNETAD---------K

Query:  FDWVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRV
          W++DSGC+FHMTP + + + +++ DGG V +G+N TC V G GSV +   DG V +L NVR VP LKRNLISLG LD  GCT    +G +++ + S V
Subjt:  FDWVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRV

Query:  VLTGNKINGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLW
         L G   +GLYV++   +  SA + S      S LWHKRL+H+S RGLQ L++QG+L      +L FCE C++GK+ R  F K + TTKGIL+YVHSDLW
Subjt:  VLTGNKINGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLW

Query:  GPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLY
        GP    S+  SRYF++ IDD+SRK W+Y LK KD+ F KF EWK  VE QT + +K LRTDNGLEF N +FN FCKS+GI RH TV YTPQQNG+AER  
Subjt:  GPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLY

Query:  RTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLWHPV
        RTIMER RC L++A LP ++W EAA    Y +NR P T+LN  TP+E W+GK P L+HL+VFGC ++ H   GKL  RA+KCMF+G+ +GVKGY+LW  +
Subjt:  RTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLWHPV

Query:  EK---KCINSRDVFFREQEM-YMLQKQSVEKELIEKETQIEVEHELLPS---------DSALEPETSLERGGGSEEGEE------DTVEVSQEVQDLQNY
        EK   KCI SRDV F E EM Y +++Q  ++      T++ +  E+ PS          S +E     E  G   + E         +E S    DLQNY
Subjt:  EK---KCINSRDVFFREQEM-YMLQKQSVEKELIEKETQIEVEHELLPS---------DSALEPETSLERGGGSEEGEE------DTVEVSQEVQDLQNY

Query:  SLARDRQRRQIVPPARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKP
         L RDR +R+   P R+  AD ++ AL  A      EP  FEEA+   + +QW +AM EE+ SL +N TW+L   P   K I SKWIYKIK G  G  KP
Subjt:  SLARDRQRRQIVPPARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKP

Query:  RFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL---------
        R+KARLVAKGYTQ+EG+D+ EIFSPVV+ +SIR++LSI V  D+ ++Q+DV TAFLHG L E IYM QP+GY  KG+E++VC L+KS+YGL         
Subjt:  RFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL---------

Query:  -----------------------NNHQGAGI----------------------------------KGSMK--LSLRLVSRKVPSTVVCAKEVQTIR----
                                + +G  I                                   G +K  L + +   K    +  ++E   I+    
Subjt:  -----------------------NNHQGAGI----------------------------------KGSMK--LSLRLVSRKVPSTVVCAKEVQTIR----

Query:  -----SKTSFNPLAHHFKLSAMNSP--KENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR---------LRF
             SK    PLA HF+LS+   P  K+  +E    MS IPY  AVGS+MYLM+ TRPDL YA S++SR+MSNPGK HW+AVKW+LR         L +
Subjt:  -----SKTSFNPLAHHFKLSAMNSP--KENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR---------LRF

Query:  RR---------GFR---------PKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTK
         R         GF           ++   G  F L  N++SWK  LQ VVALSTTE+E+I+L EA+KE +WLK ++ +   ++    I+CD+QS I L K
Subjt:  RR---------GFR---------PKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTK

Query:  HPQFNNRTKHIDIKYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLE
        +P  + R+KHID+K+H++R  I + +VE++KVHT +N +D+LTK L   +F++ L+
Subjt:  HPQFNNRTKHIDIKYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLE

A5BPC1 Uncharacterized protein1.9e-27943.35Show/hide
Query:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK
        M  A+FD+EKF GK DF L + K++A+L QQ    AL     LP+T+  + K  +   A+  IIL+L + VLR+    ++  ++W KL  LY TK   N+
Subjt:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK

Query:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELL-SVKKESSEGLFV
        ++ + K +TFKM  G  +  +LD F K+  + +N+   I DE++A +LL SL  +Y ++K+A+ YGR+++T D + S +  +EL+     K+ES EGL +
Subjt:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELL-SVKKESSEGLFV

Query:  KGKGKNKETKG---QSEEKSK-PKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDWVIDSGCSFH
        +G+ + +E KG   +S  KSK  K +C  CHKEGH K++C    R+    K   NR +    G                             +  G  F 
Subjt:  KGKGKNKETKG---QSEEKSK-PKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDWVIDSGCSFH

Query:  MTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVLTGNKINGLYV
           SK    T+KE DGG V +GNN  C++ G G+V ++  DG   +L +VR +P LKRNLISLGMLD  G T+     ++ + R S  V+ G   NGLY 
Subjt:  MTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVLTGNKINGLYV

Query:  VKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLWGPASTNSLSNSR
        +    +      V  +    + LWH+RL HIS RGLQEL KQG+L      DL FCE CV GKA R  F+K+   T+  L+Y+HSDLWGP+   S+  +R
Subjt:  VKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLWGPASTNSLSNSR

Query:  YFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLYRTIMERVRCQLS
        YFLT IDDYSRK W+YFLK K + F KFKEWK LVE QT++ +K LRTDNGLEF + +FN+ C+ +GI RHRTVRYTPQQNG+AER+ RTI+ERVRC LS
Subjt:  YFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLYRTIMERVRCQLS

Query:  DAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLWHPVE--KKCINSRDV
         + L + +WAEAA   V+ +NR P ++L F TP+EKW+GK    QHLKVFGC ++VH    KL+ RA+KC+FLG+ +GVKGY+LW   +   KCI SRDV
Subjt:  DAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLWHPVE--KKCINSRDV

Query:  FFREQEMYMLQKQSVEKELIE-KETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARDRQRRQIVPPARFSEADCISMALN
         F EQ+M    KQ+  K++    + Q EVEHE       L+PE S E    + + E      ++  Q L++Y+L RDRQ+RQ+ PP R+ +A+  + AL+
Subjt:  FFREQEMYMLQKQSVEKELIE-KETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARDRQRRQIVPPARFSEADCISMALN

Query:  VADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQREGIDYSEIFSPVVK
        VA+ I   EPK ++EA+N     QW++A+ EEM SL +N+TW L + P   K + SKW++K K+G  G   PR+KARLVAKG++Q+EG+DY+EIFSPVVK
Subjt:  VADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQREGIDYSEIFSPVVK

Query:  MTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQ----ENLVCYLNKSIYGLNNHQGAGIKGSMKLSLRLVSRKVPSTVVCAKEV
         +SIR+LL+ V  EDLELDQLDVKTAFLHG L E IYM  P+    + Q    + LV    K            +K  +K    +        ++   E+
Subjt:  MTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQ----ENLVCYLNKSIYGLNNHQGAGIKGSMKLSLRLVSRKVPSTVVCAKEV

Query:  QTIRSKTSFNPLAHHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR---------------
        +  RSK     L    K   + +P+ +  E + FM +IPY+  VGS+MY MV +RPDL+YA S++SRYMS PGK HW+AVKW+ +               
Subjt:  QTIRSKTSFNPLAHHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR---------------

Query:  ------------LRFRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQ
                      +      +K   G  F      +SWKA LQSVVALSTTEAE++A++EA+KE IWLKG+  +  + +  V +YCDNQS I L K+  
Subjt:  ------------LRFRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQ

Query:  FNNRTKHIDIKYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMV
        F+ R+KHID++ HFVR+ I  GE+ V KVHT DN +D+LTK L   KF+HCL ++
Subjt:  FNNRTKHIDIKYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMV

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.4e-13127.77Show/hide
Query:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK
        M  A+ +I+ FDG+  + +WK +I+A+L +Q  L  +   D L      +  +     A  TII  LS++ L     + T  +I   L+ +YE K   ++
Subjt:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNK

Query:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALK-YGRENITTDGIISAIRTKELELLSVKKESSE----
        + LR++  + K+ +   L  +   F ++ SE    G KI + ++   LL +LP  Y  +  A++    EN+T   + + +  +E+++ +   ++S+    
Subjt:  MYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALK-YGRENITTDGIISAIRTKELELLSVKKESSE----

Query:  -------GLFVKGKGKNKETKGQS--EEKSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDW
                 +     KN+ TK +   +  SK KV+C++C +EGHIK++C+  KR      + KN+++E  V   T   S  +A   K        D   +
Subjt:  -------GLFVKGKGKNKETKGQS--EEKSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDW

Query:  VIDSGCSFHMTPSKGWFSTYKE----WDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESR
        V+DSG S H+   +  ++   E        +   G        GI    +RL++   I L +V        NL+S+  L   G +       + I +   
Subjt:  VIDSGCSFHMTPSKGWFSTYKE----WDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESR

Query:  VVLTGNKINGLYVVKDVQMMES-------ALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGI-----LPTGTGNDLSFCEQCVIGKAKRQNFSKSQQT
                NGL VVK+  M+ +       A  +++       LWH+R  HIS   L E+ ++ +     L          CE C+ GK  R  F + +  
Subjt:  VVLTGNKINGLYVVKDVQMMES-------ALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGI-----LPTGTGNDLSFCEQCVIGKAKRQNFSKSQQT

Query:  T--KGILEYVHSDLWGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRT
        T  K  L  VHSD+ GP +  +L +  YF+ F+D ++     Y +K K  VF  F+++ A  E   N  +  L  DNG E+ + E   FC  +GI  H T
Subjt:  T--KGILEYVHSDLWGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRT

Query:  VRYTPQQNGVAERLYRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSL--NFLTPEEKWSGKPPKLQHLKVFGCVSFVH--QNQGKLKARAIK
        V +TPQ NGV+ER+ RTI E+ R  +S A L + +W EA     Y +NR P  +L  +  TP E W  K P L+HL+VFG   +VH    QGK   ++ K
Subjt:  VRYTPQQNGVAERLYRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSL--NFLTPEEKWSGKPPKLQHLKVFGCVSFVH--QNQGKLKARAIK

Query:  CMFLGFTEGVKGYRLWHPVEKKCINSRDVFFREQEMY---------MLQKQSVEKE----------LIEKE---------------TQIEVEHELLPSDS
         +F+G+     G++LW  V +K I +RDV   E  M          +  K S E E          +I+ E                  E E++  P+DS
Subjt:  CMFLGFTEGVKGYRLWHPVEKKCINSRDVFFREQEMY---------MLQKQSVEKE----------LIEKE---------------TQIEVEHELLPSDS

Query:  ALEPET----------------------------------------SLERGGGSEEGEEDTVEVSQEVQDLQNYS-------LARDRQRRQIVPPARFSE
            +T                                        S   G  +E  E +T E  +E+  + N +       + R  +R +  P   ++E
Subjt:  ALEPET----------------------------------------SLERGGGSEEGEEDTVEVSQEVQDLQNYS-------LARDRQRRQIVPPARFSE

Query:  AD-CISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKP-RFKARLVAKGYTQREGI
         D  ++  +  A +I ++ P +F+E     +   W EA+N E+ + + N+TWT+   P     + S+W++ +K    G   P R+KARLVA+G+TQ+  I
Subjt:  AD-CISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKP-RFKARLVAKGYTQREGI

Query:  DYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGAGI------------------
        DY E F+PV +++S R +LS+V+Q +L++ Q+DVKTAFL+G L E IYM  PQG +    +N VC LNK+IYGL                          
Subjt:  DYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGAGI------------------

Query:  -------KGSMKLSLRLVSRKVPSTVVCAKEVQTIRS----------KTSFNPLAHHFKL-----------------------------SAMNSPKENEL
               KG++  ++  V   V   V+   ++  + +           T  N + H   +                             +A+++P  +++
Subjt:  -------KGSMKLSLRLVSRKVPSTVVCAKEVQTIRS----------KTSFNPLAHHFKL-----------------------------SAMNSPKENEL

Query:  EHQAFMS----KIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR---------LRFRRGFRPKKVPIG----------------
         ++   S      P    +G LMY+M+ TRPDL+ A +++SRY S      W+ +K +LR         L F++    +   IG                
Subjt:  EHQAFMS----KIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR---------LRFRRGFRPKKVPIG----------------

Query:  ---LFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIK-QTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHFVREEIEKGE
           LF +   NL+ W    Q+ VA S+TEAE++AL EA++E +WLK LL    IK +  ++IY DNQ  I +  +P  + R KHIDIKYHF RE+++   
Subjt:  ---LFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIK-QTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHFVREEIEKGE

Query:  VEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG
        + +  + T +  AD+ TKPLP  +F    + +G
Subjt:  VEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-22436.11Show/hide
Query:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPD-KLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHN
        M+  ++++ KF+G   F  W+ +++ +L QQ  L+ + D D K P T+ AED   ++  A   I L+LS++V+  +IDE+T   IW +L  LY +K   N
Subjt:  MALARFDIEKFDGKGDFDLWKAKIKAILGQQKALYALADPD-KLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHN

Query:  KMYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESSEGLFV
        K+YL+++ +   M  G     +L+ F  + ++  NLG KI +E++A +LLNSLP +Y ++   + +G+  I    + SA+   E ++    +   + L  
Subjt:  KMYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESSEGLFV

Query:  KGKGKNKE----------TKGQSEEKSKPKVR-CNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDWVI
        +G+G++ +           +G+S+ +SK +VR C  C++ GH KR+C + ++       +KN  + AA+ +N       L  +++    + +  + +WV+
Subjt:  KGKGKNKE----------TKGQSEEKSKPKVR-CNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDWVI

Query:  DSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVLTGN
        D+  S H TP +  F  Y   D G V MGN    ++AGIG + ++   G  ++L++VR VP+L+ NLIS   LD  G     ++    + + S V+  G 
Subjt:  DSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDSIGCTYGGSSGTIEIRRESRVVLTGN

Query:  KINGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLWGPAST
            LY   + ++ +  L  + D     DLWHKR+ H+S +GLQ LAK+ ++    G  +  C+ C+ GK  R +F  S +    IL+ V+SD+ GP   
Subjt:  KINGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQTTKGILEYVHSDLWGPAST

Query:  NSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLYRTIME
         S+  ++YF+TFIDD SRK WVY LKTKDQVF+ F+++ ALVE++T + +K LR+DNG E+ + EF  +C S GI   +TV  TPQ NGVAER+ RTI+E
Subjt:  NSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGVAERLYRTIME

Query:  RVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVH---QNQGKLKARAIKCMFLGFTEGVKGYRLWHPVEK
        +VR  L  A LP+ +W EA     Y +NR P   L F  PE  W+ K     HLKVFGC +F H   + + KL  ++I C+F+G+ +   GYRLW PV+K
Subjt:  RVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVH---QNQGKLKARAIKCMFLGFTEGVKGYRLWHPVEK

Query:  KCINSRDVFFREQEMYML--QKQSVEKELI----------EKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARDRQRR
        K I SRDV FRE E+       + V+  +I             T  E   + + S+   +P   +E+G   +EG E   EV    Q  + +   R R  R
Subjt:  KCINSRDVFFREQEMYML--QKQSVEKELI----------EKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARDRQRR

Query:  QIVPPARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILK-PRFKARLVA
          V   R+   + + ++          EP++ +E ++ P   Q ++AM EEM SL++N T+ L  LP G +P+  KW++K+K+   G  K  R+KARLV 
Subjt:  QIVPPARFSEADCISMALNVADSISSEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILK-PRFKARLVA

Query:  KGYTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL-----------------
        KG+ Q++GID+ EIFSPVVKMTSIR +LS+    DLE++QLDVKTAFLHG L E IYM QP+G+   G++++VC LNKS+YGL                 
Subjt:  KGYTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGL-----------------

Query:  -------------------NN----------------HQG--AGIKGSMK--------------LSLRLVSRKVPSTVVCAKEVQTIRSKTSFN------
                           NN                 +G  A +KG +               L +++V  +    +  ++E    R    FN      
Subjt:  -------------------NN----------------HQG--AGIKGSMK--------------LSLRLVSRKVPSTVVCAKEVQTIRSKTSFN------

Query:  ---PLAHHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-LRFRRG---------------
           PLA H KLS    P    +E +  M+K+PYS AVGSLMY MV TRPD+++A  +VSR++ NPGK HWEAVKWILR LR   G               
Subjt:  ---PLAHHFKLSAMNSPKENELEHQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-LRFRRG---------------

Query:  --------FRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDI
                   +K   G  F      +SW++ LQ  VALSTTEAE+IA +E  KE IWLK  L + G+ Q    +YCD+QS I L+K+  ++ RTKHID+
Subjt:  --------FRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDI

Query:  KYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG
        +YH++RE ++   ++VLK+ T++N AD+LTK +P+ KF+ C E+VG
Subjt:  KYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG

P93293 Uncharacterized mitochondrial protein AtMg003007.0e-2143.22Show/hide
Query:  SSGTIEIRRESRVVLTGNKINGLYVVK-DVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQ
        S G +++ +  R +L GN+ + LY+++  V+  ES L  +     E+ LWH RL+H+S RG++ L K+G L +   + L FCE C+ GK  R NFS  Q 
Subjt:  SSGTIEIRRESRVVLTGNKINGLYVVK-DVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQ

Query:  TTKGILEYVHSDLWGPAS
        TTK  L+YVHSDLWG  S
Subjt:  TTKGILEYVHSDLWGPAS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.0e-9325.1Show/hide
Query:  LWKAKIKAIL-GQQKALYALADPDKLPATISAEDKESMN----------MTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNKMYLREKF
        +W  ++ A+  G + A +        PATI  +    +N             Y  ++  +S +V   V    T  +IW  L K+Y      +   LR + 
Subjt:  LWKAKIKAIL-GQQKALYALADPDKLPATISAEDKESMN----------MTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNKMYLREKF

Query:  FTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSV---------------KKE
          +     K + + +        +   LG+ +  + +   +L +LPE YK V + +       T   I   +   E ++L+V               +  
Subjt:  FTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSV---------------KKE

Query:  SSEGLFVKGKGKNKETKGQSEEKSKP------------------KVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKS
        ++      G   N+     +   SKP                    +C  C  +GH  + C  L    Q+  S  N Q   +        ++    S  S
Subjt:  SSEGLFVKGKGKNKETKGQSEEKSKP------------------KVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKS

Query:  CTQNETADKFDWVIDSGCSFHMTPSKGWFSTYKEWDGG-IVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGML---DSIGCTYGG
                  +W++DSG + H+T      S ++ + GG  V + +  T  ++  GS SL  K    + L N+  VPN+ +NLIS+  L   + +   +  
Subjt:  CTQNETADKFDWVIDSGCSFHMTPSKGWFSTYKEWDGG-IVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGML---DSIGCTYGG

Query:  SSGTIEIRRESRVVLTGNKINGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQE-LAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQ
        +S  ++       +L G   + LY          +L  S         WH RL H +   L   ++   +      +    C  C+I K+ +  FS+S  
Subjt:  SSGTIEIRRESRVVLTGNKINGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQE-LAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQ

Query:  TTKGILEYVHSDLWGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTV
         +   LEY++SD+W  +   S  N RY++ F+D ++R +W+Y LK K QV E F  +K L+E +    I    +DNG EF       +    GI    + 
Subjt:  TTKGILEYVHSDLWGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTV

Query:  RYTPQQNGVAERLYRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFV---HQNQGKLKARAIKCM
         +TP+ NG++ER +R I+E     LS A +P+ YW  A +  VY +NR P   L   +P +K  G  P    L+VFGC  +      NQ KL  ++ +C+
Subjt:  RYTPQQNGVAERLYRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFV---HQNQGKLKARAIKCM

Query:  FLGFTEGVKGYRLWHPVEKKCINSRDVFFREQ---------EMYMLQKQSVEKELI-EKETQIEVEHELLPSDSALEPE---------------------
        FLG++     Y   H    +   SR V F E           +  +Q+Q  E   +    T +     +LP+ S  +P                      
Subjt:  FLGFTEGVKGYRLWHPVEKKCINSRDVFFREQ---------EMYMLQKQSVEKELI-EKETQIEVEHELLPSDSALEPE---------------------

Query:  -----------TSLERGGGSEEGEEDTVEVSQ---EVQDLQNYSL------ARDRQRRQIVPPARFSEAD-------CISMALNVADSISSEEPKNFEEA
                   +S E     + G + T + +Q   +    QN S       +  +  + +  PA+ S +          S       SI    P    + 
Subjt:  -----------TSLERGGGSEEGEEDTVEVSQ---EVQDLQNYSL------ARDRQRRQIVPPARFSEAD-------CISMALNVADSISSEEPKNFEEA

Query:  VNGPN--------------------------------------------GRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIAS-KWIYKIKEGITGILK
        VN  N                                              +W  AM  E+ +   N TW L   P  +  I   +WI+  K    G L 
Subjt:  VNGPN--------------------------------------------GRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIAS-KWIYKIKEGITGILK

Query:  PRFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGA--
         R+KARLVAKGY QR G+DY+E FSPV+K TSIR++L + V     + QLDV  AFL G L++++YM+QP G++ K + N VC L K++YGL     A  
Subjt:  PRFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGA--

Query:  ----------GIKGSMKLSLRLVSRKVPSTVVCAKEVQTIRSKTSFNPLAHHF--KLSAMNSPKENELEHQAF---MSKIP-------------------
                  G   S+  +   V ++  S V     V  I    +   L H+    LS   S K++E  H        ++P                   
Subjt:  ----------GIKGSMKLSLRLVSRKVPSTVVCAKEVQTIRSKTSFNPLAHHF--KLSAMNSPKENELEHQAF---MSKIP-------------------

Query:  ------------------------------YSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILRLRF---RRGFRPKK----------
                                      Y   VGSL YL   TRPD+SYA + +S++M  P + H +A+K ILR        G   KK          
Subjt:  ------------------------------YSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILRLRF---RRGFRPKK----------

Query:  ------------VPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVR-IYCDNQSTIFLTKHPQFNNRTKHIDI
                       G    LG + +SW +  Q  V  S+TEAE+ +++    E  W+  LL + GI+ T    IYCDN    +L  +P F++R KHI I
Subjt:  ------------VPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQTTVR-IYCDNQSTIFLTKHPQFNNRTKHIDI

Query:  KYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG
         YHF+R +++ G + V+ V T D  AD LTKPL +  FQ+    +G
Subjt:  KYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-9425.53Show/hide
Query:  LWKAKIKAIL-GQQKALYALADPDKLPATISAEDKESMN----------MTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNKMYLREKF
        +W  ++ A+  G + A +        PATI  +    +N             Y  I+  +S +V   V    T  +IW  L K+Y      +   LR   
Subjt:  LWKAKIKAIL-GQQKALYALADPDKLPATISAEDKESMN----------MTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNKMYLREKF

Query:  FTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVK--------------KES
        F  + D                 +   LG+ +  + +   +L +LP+ YK V + +       +   I   +  +E +LL++               + +
Subjt:  FTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVK--------------KES

Query:  SEGLFVKGKGKNK--------------ETKGQSEEKSKPKV---RCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCT
        +       +G N+               + G   +  +PK    RC  C  +GH  + C  L   +Q+  +   +QS       T  ++     ++ +  
Subjt:  SEGLFVKGKGKNK--------------ETKGQSEEKSKPKV---RCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCT

Query:  QNETADKFDWVIDSGCSFHMTPSKGWFSTYKEWDGG-IVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGML---DSIGCTYGGSS
            A+  +W++DSG + H+T      S ++ + GG  V + +  T  +   GS SL     S + L  V  VPN+ +NLIS+  L   + +   +  +S
Subjt:  QNETADKFDWVIDSGCSFHMTPSKGWFSTYKEWDGG-IVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGML---DSIGCTYGGSS

Query:  GTIEIRRESRVVLTGNKINGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPT-GTGNDLSFCEQCVIGKAKRQNFSKSQQTT
          ++       +L G   + LY          ++  S         WH RL H S   L  +     LP     + L  C  C I K+ +  FS S  T+
Subjt:  GTIEIRRESRVVLTGNKINGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPT-GTGNDLSFCEQCVIGKAKRQNFSKSQQTT

Query:  KGILEYVHSDLWGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRY
           LEY++SD+W  +   S+ N RY++ F+D ++R +W+Y LK K QV + F  +K+LVE +    I  L +DNG EF       +    GI    +  +
Subjt:  KGILEYVHSDLWGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRY

Query:  TPQQNGVAERLYRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFV---HQNQGKLKARAIKCMFL
        TP+ NG++ER +R I+E     LS A +P+ YW  A S  VY +NR P   L   +P +K  G+PP  + LKVFGC  +      N+ KL+ ++ +C F+
Subjt:  TPQQNGVAERLYRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFV---HQNQGKLKARAIKCMFL

Query:  GFTEGVKGYRLWHPVEKKCINSRDVFFRE---------------QEMYMLQKQSVEKELIEKETQI----------------------------EVEHEL
        G++     Y   H    +   SR V F E               QE       +         T +                            +V    
Subjt:  GFTEGVKGYRLWHPVEKKCINSRDVFFRE---------------QEMYMLQKQSVEKELIEKETQI----------------------------EVEHEL

Query:  LPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARD-----------RQRRQI-----------VPPARFSE--------------------
        LPS S   P +S E    S  G + T +  Q      N  +  +            Q   +            P    SE                    
Subjt:  LPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARD-----------RQRRQI-----------VPPARFSE--------------------

Query:  -----------ADCISMALNVADSISSEEPK---------NFE--EAVNGPNGRQWIEAMNEEMRSLEENDTWTL-KSLPNGYKPIASKWIYKIKEGITG
                    +  SMA    D I     K         N E   A+      +W +AM  E+ +   N TW L    P     +  +WI+  K    G
Subjt:  -----------ADCISMALNVADSISSEEPK---------NFE--EAVNGPNGRQWIEAMNEEMRSLEENDTWTL-KSLPNGYKPIASKWIYKIKEGITG

Query:  ILKPRFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQG
         L  R+KARLVAKGY QR G+DY+E FSPV+K TSIR++L + V     + QLDV  AFL G L++ +YM+QP G+V K + + VC L K+IYGL     
Subjt:  ILKPRFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQG

Query:  A------------GIKGSMKLSLRLVSRKVPSTVVCAKEV--------QTIRSKTSFNPLAHHFK------------LSAMNSPKENELEHQAFM-----
        A            G   S+  +   V ++  S +     V         T+  K + + L+  F             + A   P+   L  + +      
Subjt:  A------------GIKGSMKLSLRLVSRKVPSTVVCAKEV--------QTIRSKTSFNPLAHHFK------------LSAMNSPKENELEHQAFM-----

Query:  -------------------------SKIP----YSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILRLRF---RRGFRPKK-------
                                 +K+P    Y   VGSL YL   TRPDLSYA + +S+YM  P   HW A+K +LR        G   KK       
Subjt:  -------------------------SKIP----YSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILRLRF---RRGFRPKK-------

Query:  ---------------VPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQT-TVRIYCDNQSTIFLTKHPQFNNRTKH
                          G    LG + +SW +  Q  V  S+TEAE+ +++    E  W+  LL + GI+ +    IYCDN    +L  +P F++R KH
Subjt:  ---------------VPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIKQT-TVRIYCDNQSTIFLTKHPQFNNRTKH

Query:  IDIKYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG
        I + YHF+R +++ G + V+ V T D  AD LTKPL ++ FQ+    +G
Subjt:  IDIKYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVG

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein6.6e-0623.08Show/hide
Query:  DFDLWKAKIKAILGQQKALYALA-----DPDKLP---ATISAEDKES------MNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNKM
        D+++W    K+ L +Q     +      DP K P   ATI  E+          +  A   +  +L+++V R+ +   +   +W  L K       + + 
Subjt:  DFDLWKAKIKAILGQQKALYALA-----DPDKLP---ATISAEDKES------MNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNKM

Query:  YLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEA------YKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESSE
         +R      +    + L + L++ K +    K  G    D  +A  +L  L  A      Y+  KN         T  G    + +   EL+ V K +S+
Subjt:  YLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEKIGDENEAFVLLNSLPEA------YKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESSE

Query:  GLF------VKGKGKNKETKGQSEE---KSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDW
         L       V      +   G  ++   KSK +  C  C+K  H + +C     K + H  K+ ++ E  V     T  +  A         +T D   W
Subjt:  GLF------VKGKGKNKETKGQSEE---KSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSKKNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDW

Query:  VIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDS
        +I      +MTP   +F+T        V   +     V G G V +R+K+G    +RNV  VP L RN++S G + S
Subjt:  VIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISLGMLDS

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.7e-4730.48Show/hide
Query:  SEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRM
        ++EP  + EA        W  AM++E+ ++E   TW + +LP   KPI  KW+YKIK    G ++ R+KARLVAKGYTQ+EGID+ E FSPV K+TS+++
Subjt:  SEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRM

Query:  LLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGY-VRKGQE---NLVCYLNKSIYGLNNHQGAGIKGSMKLSLRLVS-------------RKVPST
        +L+I    +  L QLD+  AFL+G L E IYM  P GY  R+G     N VCYL KSIYGL     A  +  +K S+ L+               K+ +T
Subjt:  LLSIVVQEDLELDQLDVKTAFLHGFLSENIYMTQPQGY-VRKGQE---NLVCYLNKSIYGLNNHQGAGIKGSMKLSLRLVS-------------RKVPST

Query:  ------------VVCAKE----------------------------VQTIRSKTSFNPLAHHFKLSAMNS-----------PKENELEHQA-----FMSK
                    ++C+                              ++  RS    N     + L  ++            P +  +   A     F+  
Subjt:  ------------VVCAKE----------------------------VQTIRSKTSFNPLAHHFKLSAMNS-----------PKENELEHQA-----FMSK

Query:  IPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-------------------------LRFRRGFRPKKVPIGLFFLLGQNLLSW
          Y + +G LMYL + TR D+S+A + +S++   P   H +AV  IL                            F+     ++   G    LG +L+SW
Subjt:  IPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILR-------------------------LRFRRGFRPKKVPIGLFFLLGQNLLSW

Query:  KATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIK-QTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHFVRE
        K+  Q VV+ S+ EAE+ ALS A  E +WL     +  +       ++CDN + I +  +  F+ RTKHI+   H VRE
Subjt:  KATLQSVVALSTTEAEFIALSEAIKEGIWLKGLLNDFGIK-QTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHFVRE

ATMG00300.1 Gag-Pol-related retrotransposon family protein5.0e-2243.22Show/hide
Query:  SSGTIEIRRESRVVLTGNKINGLYVVK-DVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQ
        S G +++ +  R +L GN+ + LY+++  V+  ES L  +     E+ LWH RL+H+S RG++ L K+G L +   + L FCE C+ GK  R NFS  Q 
Subjt:  SSGTIEIRRESRVVLTGNKINGLYVVK-DVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQQ

Query:  TTKGILEYVHSDLWGPAS
        TTK  L+YVHSDLWG  S
Subjt:  TTKGILEYVHSDLWGPAS

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-1443.21Show/hide
Query:  RTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIK
        RTI+E+VR  L +  LP+ + A+AA+  V+ +N+ P T++NF  P+E W    P   +L+ FGCV+++H ++GKLK RA K
Subjt:  RTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.8e-1441.59Show/hide
Query:  EEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRML
        +EPK+   A+  P    W +AM EE+ +L  N TW L   P     +  KW++K K    G L  R KARLVAKG+ Q EGI + E +SPVV+  +IR +
Subjt:  EEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRML

Query:  LSIVVQEDLELDQ
        L++  Q  LE+ Q
Subjt:  LSIVVQEDLELDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTCGACTGTCCATGTAGATACAGTGTAATCTTGTCGAGAATCTTGGTTATGAGACGAAGCTTGAAGGCGTCTGTAAAGAAGATGATTGATGGTAGAACGATGGA
AGAGTTTGTCTCAGAGGTTCGAATGATGGAGGAAGGGGAAGGGGTTCTCTCGCGAACGAGCAAGACGTACCCAACAAAAAACAACAAGAAGATCAAAACCTCAATACAAA
CAAGTCATGGGTGTCGGGTTTCCGGCGAGGGACGAAAGAGAGATCTGACGAAATCCTTAATACAATGGTACATCATCGTCTTCACTAACTTCCTCCTCTTCGGAATCACA
GATTTCATTTTCCCGCCGCCGGCTCTGCAGTGTGGAATGAGCAGCACTCGGAATTCCGTCGTCTTCTCCAAGTTTCTTCTCGCCGGAGTTACAGTGGCAACCGTCGGCGG
ACTCAGAATTGCGACGACGGTGGCCTGCAAGTGGATCGGATTGTTATGTGGGACTGGAACACGAACGGACGAGGACTCTGTTAGCCATTTTTCTTGTCGACACGAGGATT
ACAAACGGACTGGAACCAACGTGAATCGACTGTCGACGACTGTGTCCGAGGAAGAGAACGACGTTTCGTCTCTGAACAATACGATTTCTAGCATCAGTGGAGTTCAGCCT
CGAAATTCCGATGAGATTCCGATGAGATTTGGGCGTGTGTGCGTGAGAGGGACAGTGAGAGAGAGAGGAATGGTGAGGGAGAGAAAGAGAGGGACGGTGAGGGAGAGAAA
GAGAGAAGAAAAAATTGTCGCCGGCGACCGGATCGACGGTCGATGGTGGTCCTCAGCAGCCGACGGCAGGAGGGTTAGGGTTTTGTGGTATCAGAGCAAGGAAGATCAAC
CAAGAACTCAGATCTTGAAGCATTCAATGGCGTTGGCAAGATTTGATATAGAGAAATTTGATGGAAAGGGAGATTTCGACTTGTGGAAGGCCAAGATCAAGGCAATTCTT
GGCCAACAGAAGGCACTTTACGCACTGGCAGACCCTGATAAACTGCCAGCAACCATCTCTGCCGAAGACAAAGAAAGCATGAACATGACTGCATATGGGACCATCATTCT
GAACTTGAGTAACAATGTGTTGCGGCAAGTCATAGATGAGGAGACTCCATTAAAGATTTGGGCAAAGTTGAATAAACTCTATGAGACTAAGGATGCACATAATAAGATGT
ATCTTAGAGAGAAGTTCTTTACATTCAAGATGGATGCTGGAAAACCCCTATCTGAAAATCTGGATGAATTTAAGAAGATGACCTCAGAATTTAAGAATCTTGGAGAAAAG
ATTGGGGATGAGAATGAAGCCTTTGTCCTATTAAATTCACTTCCAGAAGCCTATAAAGATGTTAAAAATGCTTTGAAATATGGCAGGGAGAATATAACCACTGATGGGAT
AATCTCTGCTATTCGAACAAAGGAGCTTGAGCTTCTGTCAGTAAAGAAAGAGTCTTCAGAAGGACTCTTTGTGAAAGGGAAAGGAAAGAACAAGGAAACCAAAGGCCAAT
CCGAAGAAAAATCAAAGCCCAAGGTGAGATGTAATTACTGCCACAAGGAAGGGCACATCAAGCGAGAATGTTACTCATTGAAGCGGAAAAATCAGTACCATAAATCAAAG
AAAAACAGACAATCTGAGGCTGCTGTGGGAGAAAACACTATCACTTACTCAGATGCATTAGCAACCTCAGACAAGAGTTGTACACAAAATGAAACAGCAGATAAATTTGA
CTGGGTGATAGATTCAGGATGCTCGTTTCACATGACTCCATCCAAGGGTTGGTTCAGCACTTATAAAGAATGGGATGGTGGAATTGTGTACATGGGAAACAACCATACCT
GTCGGGTGGCAGGAATTGGATCAGTTTCTTTAAGGCTGAAGGATGGCTCAGTTATCTTGTTGAGGAATGTTCGACTTGTGCCTAACCTTAAAAGGAACTTGATTTCTTTA
GGCATGTTGGACTCAATTGGATGCACATATGGAGGCAGCAGTGGTACTATTGAAATCAGAAGAGAATCTCGAGTTGTACTGACAGGAAACAAGATAAATGGCTTGTATGT
TGTCAAGGATGTGCAGATGATGGAATCAGCTCTTGTAGTTTCATCGGATGGGCCTACTGAAAGTGACTTATGGCACAAAAGGCTTTCCCATATTAGCTCAAGAGGACTAC
AAGAATTGGCCAAACAAGGAATTCTTCCAACTGGAACAGGAAATGATCTGTCTTTTTGTGAGCAGTGTGTTATTGGAAAGGCAAAAAGACAGAACTTTTCTAAATCTCAA
CAAACAACAAAGGGTATTCTGGAATATGTGCATTCAGACCTATGGGGGCCAGCCTCTACAAACTCCCTGAGTAACTCAAGGTATTTCTTAACCTTCATAGACGATTACTC
TAGAAAGAGTTGGGTTTATTTCCTAAAAACTAAAGACCAAGTATTTGAAAAATTCAAAGAATGGAAAGCACTAGTTGAGAAGCAAACCAACAAAGTTATTAAATGCCTAA
GAACTGATAACGGTTTAGAATTTTGTAATGAGGAGTTTAATGCTTTCTGTAAGTCACAAGGGATACTTAGACACAGAACCGTGAGGTACACTCCTCAGCAAAATGGAGTG
GCAGAAAGATTATACAGAACCATCATGGAAAGAGTTCGCTGCCAGTTATCAGATGCTATGTTGCCTGAACGATATTGGGCAGAAGCTGCTTCCTACACGGTATACACTCT
AAACAGGTGTCCACATACTTCCTTGAATTTCCTAACTCCTGAAGAGAAATGGTCAGGGAAACCTCCAAAACTACAACACCTAAAGGTATTTGGATGTGTGAGTTTTGTCC
ATCAGAACCAAGGGAAATTGAAGGCCAGAGCCATAAAATGTATGTTCTTAGGCTTTACCGAGGGAGTTAAGGGATACAGATTATGGCATCCTGTGGAAAAGAAGTGTATA
AACAGTCGTGATGTGTTTTTTAGGGAACAAGAGATGTATATGCTCCAAAAACAGTCAGTGGAGAAAGAATTGATAGAAAAAGAAACTCAGATTGAGGTGGAGCATGAATT
ACTTCCTTCAGATTCTGCTCTAGAACCAGAAACTTCACTAGAAAGAGGTGGAGGTTCAGAAGAAGGAGAAGAAGACACAGTTGAAGTATCTCAGGAAGTACAAGACCTTC
AAAATTACTCTTTGGCGAGAGATAGACAAAGGAGACAAATAGTTCCACCAGCCAGGTTTAGTGAAGCCGACTGTATATCCATGGCTTTAAATGTGGCTGACTCCATAAGC
AGTGAAGAGCCAAAGAATTTCGAAGAAGCAGTGAATGGTCCGAATGGGAGGCAATGGATCGAAGCCATGAATGAAGAGATGAGATCTTTAGAGGAAAATGATACTTGGAC
TCTAAAATCTCTACCTAATGGCTACAAACCGATTGCCTCCAAATGGATATACAAAATTAAGGAAGGAATCACAGGAATCCTTAAACCAAGGTTCAAAGCTCGGTTAGTAG
CAAAGGGCTACACGCAAAGGGAAGGCATTGACTACTCAGAAATATTCTCACCTGTAGTCAAGATGACCTCCATCAGGATGCTGTTATCTATTGTGGTTCAAGAAGACCTA
GAATTGGACCAATTAGATGTAAAAACAGCATTTCTTCATGGGTTCTTGTCAGAAAACATTTACATGACACAACCTCAGGGGTATGTGAGAAAGGGGCAAGAGAACCTGGT
TTGTTACTTAAATAAGTCTATTTATGGGTTAAACAATCACCAAGGTGCTGGTATCAAAGGTTCAATGAAACTATCTCTAAGATTGGTTTCCAGAAAAGTTCCTTCGACAG
TTGTGTGTGCTAAAGAGGTTCAAACTATCCGAAGCAAAACAAGTTTCAACCCCTTGGCACACCACTTCAAATTGTCAGCAATGAACTCTCCAAAGGAAAATGAACTGGAA
CACCAAGCATTTATGTCCAAGATACCCTACTCCCAAGCCGTGGGAAGTCTAATGTATCTTATGGTATCCACGAGACCGGATCTTTCCTATGCTACAAGTCTAGTGAGCAG
ATATATGTCCAATCCGGGCAAAAGGCATTGGGAGGCCGTGAAATGGATACTAAGACTCAGATTTCGCAGGGGATTTAGACCGAAGAAGGTCCCTATCGGGCTATTTTTCC
TACTTGGTCAGAACCTATTGAGCTGGAAAGCTACCTTGCAATCAGTAGTAGCTCTATCGACTACTGAAGCGGAATTCATAGCGTTGTCAGAGGCAATTAAGGAAGGAATA
TGGCTTAAGGGATTGTTGAATGACTTTGGGATAAAACAAACAACAGTGCGCATATACTGTGATAATCAAAGTACCATATTCCTAACTAAACATCCTCAATTCAACAACCG
AACAAAGCACATAGATATTAAATATCATTTTGTGCGAGAGGAAATAGAAAAGGGAGAAGTTGAAGTCCTAAAGGTTCATACCTCTGACAACGCTGCCGATTTGTTAACCA
AACCCCTTCCTCAACTGAAATTTCAGCATTGTTTGGAAATGGTAGGCTTTTGCTTACCGGAAAAGGGTGATAAAAGACAAATCAGAAAAGGAGGGAAGAATATTGCTCAG
ATCTTGATAGATTGCAAAGGTGGAGAATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTCGACTGTCCATGTAGATACAGTGTAATCTTGTCGAGAATCTTGGTTATGAGACGAAGCTTGAAGGCGTCTGTAAAGAAGATGATTGATGGTAGAACGATGGA
AGAGTTTGTCTCAGAGGTTCGAATGATGGAGGAAGGGGAAGGGGTTCTCTCGCGAACGAGCAAGACGTACCCAACAAAAAACAACAAGAAGATCAAAACCTCAATACAAA
CAAGTCATGGGTGTCGGGTTTCCGGCGAGGGACGAAAGAGAGATCTGACGAAATCCTTAATACAATGGTACATCATCGTCTTCACTAACTTCCTCCTCTTCGGAATCACA
GATTTCATTTTCCCGCCGCCGGCTCTGCAGTGTGGAATGAGCAGCACTCGGAATTCCGTCGTCTTCTCCAAGTTTCTTCTCGCCGGAGTTACAGTGGCAACCGTCGGCGG
ACTCAGAATTGCGACGACGGTGGCCTGCAAGTGGATCGGATTGTTATGTGGGACTGGAACACGAACGGACGAGGACTCTGTTAGCCATTTTTCTTGTCGACACGAGGATT
ACAAACGGACTGGAACCAACGTGAATCGACTGTCGACGACTGTGTCCGAGGAAGAGAACGACGTTTCGTCTCTGAACAATACGATTTCTAGCATCAGTGGAGTTCAGCCT
CGAAATTCCGATGAGATTCCGATGAGATTTGGGCGTGTGTGCGTGAGAGGGACAGTGAGAGAGAGAGGAATGGTGAGGGAGAGAAAGAGAGGGACGGTGAGGGAGAGAAA
GAGAGAAGAAAAAATTGTCGCCGGCGACCGGATCGACGGTCGATGGTGGTCCTCAGCAGCCGACGGCAGGAGGGTTAGGGTTTTGTGGTATCAGAGCAAGGAAGATCAAC
CAAGAACTCAGATCTTGAAGCATTCAATGGCGTTGGCAAGATTTGATATAGAGAAATTTGATGGAAAGGGAGATTTCGACTTGTGGAAGGCCAAGATCAAGGCAATTCTT
GGCCAACAGAAGGCACTTTACGCACTGGCAGACCCTGATAAACTGCCAGCAACCATCTCTGCCGAAGACAAAGAAAGCATGAACATGACTGCATATGGGACCATCATTCT
GAACTTGAGTAACAATGTGTTGCGGCAAGTCATAGATGAGGAGACTCCATTAAAGATTTGGGCAAAGTTGAATAAACTCTATGAGACTAAGGATGCACATAATAAGATGT
ATCTTAGAGAGAAGTTCTTTACATTCAAGATGGATGCTGGAAAACCCCTATCTGAAAATCTGGATGAATTTAAGAAGATGACCTCAGAATTTAAGAATCTTGGAGAAAAG
ATTGGGGATGAGAATGAAGCCTTTGTCCTATTAAATTCACTTCCAGAAGCCTATAAAGATGTTAAAAATGCTTTGAAATATGGCAGGGAGAATATAACCACTGATGGGAT
AATCTCTGCTATTCGAACAAAGGAGCTTGAGCTTCTGTCAGTAAAGAAAGAGTCTTCAGAAGGACTCTTTGTGAAAGGGAAAGGAAAGAACAAGGAAACCAAAGGCCAAT
CCGAAGAAAAATCAAAGCCCAAGGTGAGATGTAATTACTGCCACAAGGAAGGGCACATCAAGCGAGAATGTTACTCATTGAAGCGGAAAAATCAGTACCATAAATCAAAG
AAAAACAGACAATCTGAGGCTGCTGTGGGAGAAAACACTATCACTTACTCAGATGCATTAGCAACCTCAGACAAGAGTTGTACACAAAATGAAACAGCAGATAAATTTGA
CTGGGTGATAGATTCAGGATGCTCGTTTCACATGACTCCATCCAAGGGTTGGTTCAGCACTTATAAAGAATGGGATGGTGGAATTGTGTACATGGGAAACAACCATACCT
GTCGGGTGGCAGGAATTGGATCAGTTTCTTTAAGGCTGAAGGATGGCTCAGTTATCTTGTTGAGGAATGTTCGACTTGTGCCTAACCTTAAAAGGAACTTGATTTCTTTA
GGCATGTTGGACTCAATTGGATGCACATATGGAGGCAGCAGTGGTACTATTGAAATCAGAAGAGAATCTCGAGTTGTACTGACAGGAAACAAGATAAATGGCTTGTATGT
TGTCAAGGATGTGCAGATGATGGAATCAGCTCTTGTAGTTTCATCGGATGGGCCTACTGAAAGTGACTTATGGCACAAAAGGCTTTCCCATATTAGCTCAAGAGGACTAC
AAGAATTGGCCAAACAAGGAATTCTTCCAACTGGAACAGGAAATGATCTGTCTTTTTGTGAGCAGTGTGTTATTGGAAAGGCAAAAAGACAGAACTTTTCTAAATCTCAA
CAAACAACAAAGGGTATTCTGGAATATGTGCATTCAGACCTATGGGGGCCAGCCTCTACAAACTCCCTGAGTAACTCAAGGTATTTCTTAACCTTCATAGACGATTACTC
TAGAAAGAGTTGGGTTTATTTCCTAAAAACTAAAGACCAAGTATTTGAAAAATTCAAAGAATGGAAAGCACTAGTTGAGAAGCAAACCAACAAAGTTATTAAATGCCTAA
GAACTGATAACGGTTTAGAATTTTGTAATGAGGAGTTTAATGCTTTCTGTAAGTCACAAGGGATACTTAGACACAGAACCGTGAGGTACACTCCTCAGCAAAATGGAGTG
GCAGAAAGATTATACAGAACCATCATGGAAAGAGTTCGCTGCCAGTTATCAGATGCTATGTTGCCTGAACGATATTGGGCAGAAGCTGCTTCCTACACGGTATACACTCT
AAACAGGTGTCCACATACTTCCTTGAATTTCCTAACTCCTGAAGAGAAATGGTCAGGGAAACCTCCAAAACTACAACACCTAAAGGTATTTGGATGTGTGAGTTTTGTCC
ATCAGAACCAAGGGAAATTGAAGGCCAGAGCCATAAAATGTATGTTCTTAGGCTTTACCGAGGGAGTTAAGGGATACAGATTATGGCATCCTGTGGAAAAGAAGTGTATA
AACAGTCGTGATGTGTTTTTTAGGGAACAAGAGATGTATATGCTCCAAAAACAGTCAGTGGAGAAAGAATTGATAGAAAAAGAAACTCAGATTGAGGTGGAGCATGAATT
ACTTCCTTCAGATTCTGCTCTAGAACCAGAAACTTCACTAGAAAGAGGTGGAGGTTCAGAAGAAGGAGAAGAAGACACAGTTGAAGTATCTCAGGAAGTACAAGACCTTC
AAAATTACTCTTTGGCGAGAGATAGACAAAGGAGACAAATAGTTCCACCAGCCAGGTTTAGTGAAGCCGACTGTATATCCATGGCTTTAAATGTGGCTGACTCCATAAGC
AGTGAAGAGCCAAAGAATTTCGAAGAAGCAGTGAATGGTCCGAATGGGAGGCAATGGATCGAAGCCATGAATGAAGAGATGAGATCTTTAGAGGAAAATGATACTTGGAC
TCTAAAATCTCTACCTAATGGCTACAAACCGATTGCCTCCAAATGGATATACAAAATTAAGGAAGGAATCACAGGAATCCTTAAACCAAGGTTCAAAGCTCGGTTAGTAG
CAAAGGGCTACACGCAAAGGGAAGGCATTGACTACTCAGAAATATTCTCACCTGTAGTCAAGATGACCTCCATCAGGATGCTGTTATCTATTGTGGTTCAAGAAGACCTA
GAATTGGACCAATTAGATGTAAAAACAGCATTTCTTCATGGGTTCTTGTCAGAAAACATTTACATGACACAACCTCAGGGGTATGTGAGAAAGGGGCAAGAGAACCTGGT
TTGTTACTTAAATAAGTCTATTTATGGGTTAAACAATCACCAAGGTGCTGGTATCAAAGGTTCAATGAAACTATCTCTAAGATTGGTTTCCAGAAAAGTTCCTTCGACAG
TTGTGTGTGCTAAAGAGGTTCAAACTATCCGAAGCAAAACAAGTTTCAACCCCTTGGCACACCACTTCAAATTGTCAGCAATGAACTCTCCAAAGGAAAATGAACTGGAA
CACCAAGCATTTATGTCCAAGATACCCTACTCCCAAGCCGTGGGAAGTCTAATGTATCTTATGGTATCCACGAGACCGGATCTTTCCTATGCTACAAGTCTAGTGAGCAG
ATATATGTCCAATCCGGGCAAAAGGCATTGGGAGGCCGTGAAATGGATACTAAGACTCAGATTTCGCAGGGGATTTAGACCGAAGAAGGTCCCTATCGGGCTATTTTTCC
TACTTGGTCAGAACCTATTGAGCTGGAAAGCTACCTTGCAATCAGTAGTAGCTCTATCGACTACTGAAGCGGAATTCATAGCGTTGTCAGAGGCAATTAAGGAAGGAATA
TGGCTTAAGGGATTGTTGAATGACTTTGGGATAAAACAAACAACAGTGCGCATATACTGTGATAATCAAAGTACCATATTCCTAACTAAACATCCTCAATTCAACAACCG
AACAAAGCACATAGATATTAAATATCATTTTGTGCGAGAGGAAATAGAAAAGGGAGAAGTTGAAGTCCTAAAGGTTCATACCTCTGACAACGCTGCCGATTTGTTAACCA
AACCCCTTCCTCAACTGAAATTTCAGCATTGTTTGGAAATGGTAGGCTTTTGCTTACCGGAAAAGGGTGATAAAAGACAAATCAGAAAAGGAGGGAAGAATATTGCTCAG
ATCTTGATAGATTGCAAAGGTGGAGAATGTTAG
Protein sequenceShow/hide protein sequence
MFFDCPCRYSVILSRILVMRRSLKASVKKMIDGRTMEEFVSEVRMMEEGEGVLSRTSKTYPTKNNKKIKTSIQTSHGCRVSGEGRKRDLTKSLIQWYIIVFTNFLLFGIT
DFIFPPPALQCGMSSTRNSVVFSKFLLAGVTVATVGGLRIATTVACKWIGLLCGTGTRTDEDSVSHFSCRHEDYKRTGTNVNRLSTTVSEEENDVSSLNNTISSISGVQP
RNSDEIPMRFGRVCVRGTVRERGMVRERKRGTVRERKREEKIVAGDRIDGRWWSSAADGRRVRVLWYQSKEDQPRTQILKHSMALARFDIEKFDGKGDFDLWKAKIKAIL
GQQKALYALADPDKLPATISAEDKESMNMTAYGTIILNLSNNVLRQVIDEETPLKIWAKLNKLYETKDAHNKMYLREKFFTFKMDAGKPLSENLDEFKKMTSEFKNLGEK
IGDENEAFVLLNSLPEAYKDVKNALKYGRENITTDGIISAIRTKELELLSVKKESSEGLFVKGKGKNKETKGQSEEKSKPKVRCNYCHKEGHIKRECYSLKRKNQYHKSK
KNRQSEAAVGENTITYSDALATSDKSCTQNETADKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGIVYMGNNHTCRVAGIGSVSLRLKDGSVILLRNVRLVPNLKRNLISL
GMLDSIGCTYGGSSGTIEIRRESRVVLTGNKINGLYVVKDVQMMESALVVSSDGPTESDLWHKRLSHISSRGLQELAKQGILPTGTGNDLSFCEQCVIGKAKRQNFSKSQ
QTTKGILEYVHSDLWGPASTNSLSNSRYFLTFIDDYSRKSWVYFLKTKDQVFEKFKEWKALVEKQTNKVIKCLRTDNGLEFCNEEFNAFCKSQGILRHRTVRYTPQQNGV
AERLYRTIMERVRCQLSDAMLPERYWAEAASYTVYTLNRCPHTSLNFLTPEEKWSGKPPKLQHLKVFGCVSFVHQNQGKLKARAIKCMFLGFTEGVKGYRLWHPVEKKCI
NSRDVFFREQEMYMLQKQSVEKELIEKETQIEVEHELLPSDSALEPETSLERGGGSEEGEEDTVEVSQEVQDLQNYSLARDRQRRQIVPPARFSEADCISMALNVADSIS
SEEPKNFEEAVNGPNGRQWIEAMNEEMRSLEENDTWTLKSLPNGYKPIASKWIYKIKEGITGILKPRFKARLVAKGYTQREGIDYSEIFSPVVKMTSIRMLLSIVVQEDL
ELDQLDVKTAFLHGFLSENIYMTQPQGYVRKGQENLVCYLNKSIYGLNNHQGAGIKGSMKLSLRLVSRKVPSTVVCAKEVQTIRSKTSFNPLAHHFKLSAMNSPKENELE
HQAFMSKIPYSQAVGSLMYLMVSTRPDLSYATSLVSRYMSNPGKRHWEAVKWILRLRFRRGFRPKKVPIGLFFLLGQNLLSWKATLQSVVALSTTEAEFIALSEAIKEGI
WLKGLLNDFGIKQTTVRIYCDNQSTIFLTKHPQFNNRTKHIDIKYHFVREEIEKGEVEVLKVHTSDNAADLLTKPLPQLKFQHCLEMVGFCLPEKGDKRQIRKGGKNIAQ
ILIDCKGGEC