; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0063751 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0063751
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:4260078..4265233
RNA-Seq ExpressionCmc03g0063751
SyntenyCmc03g0063751
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]0.0e+0095.11Show/hide
Query:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR
        MKIDLPSYDGKRNIE FLDWLKNTENFFAYM T KNKKVHLVALKLKGG SAWWDQITVN QKQGKHPIRSWEKMKKLMKQRFVP NYEQTLYTQYQNCR
Subjt:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR

Query:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE
        QGMRKTAEYIEEFHRLG RTNLMEGE+HLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKN+TSE
Subjt:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE

Query:  KPVEQEESSGKKEVPEGEKKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDEKTEVIEADEGDSLSCILLRVLISPKEENQ
        KPVEQEESSGKKEVPEGEKKGKNPYQR FSGNCYR GQ+GHPSNQ PQRKTIAVA+DNDDGSNRSLGEFDE+TEVIEADEGDSLSCIL RVLISPKEENQ
Subjt:  KPVEQEESSGKKEVPEGEKKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDEKTEVIEADEGDSLSCILLRVLISPKEENQ

Query:  LQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCHILLGRPWQ
        LQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVP+SIGNSYKDQMVCDVI++DVCHILLGRPWQ
Subjt:  LQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCHILLGRPWQ

Query:  FDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENEILGIVMSGTEDTTRDEQIPEAIKELFKKYPKISKEPTCL
        FDVQSMHRGRENTYEF+W+NKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENEILGIVMSGTEDTTRDEQIPEAIKELFKKYPKISKEPTCL
Subjt:  FDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENEILGIVMSGTEDTTRDEQIPEAIKELFKKYPKISKEPTCL

Query:  PPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPRVSDLLDQLGG
        PPLRDI H+IELL GASFPHLPHY MSPNEYKILHD IEELLKKGHIKPSFSLC VPALLTPKKDGTWRMCVDSRAINKITVKYRF IPRVSDLLDQLGG
Subjt:  PPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPRVSDLLDQLGG

Query:  ACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNEL
        ACIF KIDLRS YHQIRIRPGDEWKT FKTNEGLFEWLVMPF LSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNEL
Subjt:  ACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNEL

Query:  YVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQQDSFNLLKQSLSN
        YVNLK+CIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTV QVQ FLGLASFYRKFIQN SSIAAPITDCLKKGAFEWGPKQQDSFNLLK+SL N
Subjt:  YVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQQDSFNLLKQSLSN

Query:  ISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMHARW
        I VLKLPDFFQ FEVAVDTCGTGIGAVLSQQ HPIEYFSE+LSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMHARW
Subjt:  ISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMHARW

Query:  ISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK
        ISY QRFDFVIKHQAGKENKV DAL+RKSNLLFILSSEIT FNHIT+LYKKDRDFESIWEK
Subjt:  ISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK

XP_023878077.1 uncharacterized protein LOC111990526 [Quercus suber]2.6e-27348.78Show/hide
Query:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR
        MKI+LPS++G  +IE +LDW+   E FF YM TA++K+V LVA KLKGG SAWWD++ +N  ++ K PIRSW +MK+LM  RF+P +Y+Q L+ QYQ+CR
Subjt:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR

Query:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE
        QG R   EY+EEF RL  R +L E E   IS FV GLR  ++++V LQ    L+EA+T ++ +E            R  E S+S     G   +    S+
Subjt:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE

Query:  KPVEQEESSGKKEVPEGEKKGK-----NPYQRLFSGNCYRRGQIGHPSNQYPQRKTI--AVAEDND-DGSNRSLGEFDE-------KTEVIEADEGDSL-
          V +     +  V       +     NPY R     CYR G+ GH S+  P+R  +   VAE+ + +G       ++E         EV E +EG  L 
Subjt:  KPVEQEESSGKKEVPEGEKKGK-----NPYQRLFSGNCYRRGQIGHPSNQYPQRKTI--AVAEDND-DGSNRSLGEFDE-------KTEVIEADEGDSL-

Query:  -SCILLRVLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVC
         S ++ R+L++P+ E   QR+ +F+ RCTI  +VC++IIDSGS EN  SK LVT L LKT+ H  PYKIGWIKKG ETL+++ C V  S+G  Y D++VC
Subjt:  -SCILLRVLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVC

Query:  DVIKIDVCHILLGRPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERE--NEILGIVMSGTEDTTRDEQIPE
        DV+++D CH++LGRPWQ+DV + H+ ++N Y F    +K++L P+++       K + K +L +  +   F RE +   ++  +V++  E   +  +IPE
Subjt:  DVIKIDVCHILLGRPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERE--NEILGIVMSGTEDTTRDEQIPE

Query:  AIKELFKKYPKISKE--PTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINK
        AI+ L K++ ++  E  P  LPP+RDIQH I+L+ GAS P+LPHYRM+P E +IL   ++ELL KG I+ S S CAVPALLTPKKDG+WRMCVDSRAINK
Subjt:  AIKELFKKYPKISKE--PTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINK

Query:  ITVKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK
        ITV+YRF IPR+ D+ D L G+  + K+DL+S YHQIRIRPGDEWKT FKT EGL+EW+VMPFGLSNAPSTFMRLMN+VL PF+ KF++VYFDDIL++SK
Subjt:  ITVKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK

Query:  TYDQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKG
        T   H  H+ ++  VL  NELY+NLK+C F ++++ FLG+++  D + +DE KV A++ W TP TV  V+ F GLA+FYR+F+++FSSI APIT+CLKKG
Subjt:  TYDQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKG

Query:  AFEWGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLT
         F WG + + SF L+K+ LS   VL LP+F + F+V  D    GIGAVLSQ   P+ +FSEK+ ++R  WS YE E +A+VR LK WEHYL+ REF+L T
Subjt:  AFEWGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLT

Query:  DHFSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK
        DH +LK+++SQ SI+ MHARW++YIQRF F +KH++G  NKV DAL+R+++LL  L +E+  F+ + +LY+ D DF  IW K
Subjt:  DHFSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK

XP_024932991.1 uncharacterized protein LOC112492819 [Ziziphus jujuba]1.2e-28150.1Show/hide
Query:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR
        +K+D+P++DG  NIE FLDW++  E+FF YM   ++K+V LVA K +GG SAWW+Q+  N +KQGK PI+SW +++++++ RF+P ++EQ LY QY +C 
Subjt:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR

Query:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE
        QG R  +E  EEF+RL AR NL E E  L++ +V GL   ++E+++L P  +LSEA+  A  +E+ IE     T  + W+P +        S    +  +
Subjt:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE

Query:  KPVEQEESSGKKEVPEGE-KKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDEKTEVIEADEGDSLSCILLRVLISPKEEN
        K    + S      P+ +  +  NPY R F   C++ GQ GH SN+ P RK + + E  DD S        ++TE+++ D+G+ + CI+ ++L SPK   
Subjt:  KPVEQEESSGKKEVPEGE-KKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDEKTEVIEADEGDSLSCILLRVLISPKEEN

Query:  QLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCHILLGRPW
        +LQRHS+FKT+CTI  KVC VIIDSGSSEN VSK LV AL L T  H  PYK+GWIKKG ET ++E+C V  SIG  Y D++VCDV+++D CHILLGRPW
Subjt:  QLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCHILLGRPW

Query:  QFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNI--------EKNQKKGSLFVTISGKKF-LRERENEI-LGIVMSGTEDTTRDEQIPEAIKELFKK
        QFD      GR+NT+ F W  KK++LLP + + D  +        E+   KG + +T SGK F L+ + ++I  GIV S       D Q P+ I +L ++
Subjt:  QFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNI--------EKNQKKGSLFVTISGKKF-LRERENEI-LGIVMSGTEDTTRDEQIPEAIKELFKK

Query:  YPKI--SKEPTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFS
        + +I  S+ P  LPP+RDIQH I+LL GA  P+LPHYRM P E +IL   +E+LLKK  I+ S S CAVPALL PKK+G WRMC+DSRAINKIT KYRF 
Subjt:  YPKI--SKEPTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFS

Query:  IPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQH
        IPR+ D+LD+L GA +F K+DLRS YHQIRIRP DEWKT FKT  GL+EW VMPFGL NAPSTFMRLMN+VL P +   ++VYFDDILV+S + + H++H
Subjt:  IPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQH

Query:  IDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQ
        + ++F +L  N LY+NLK+C+F + E+ FLGFI+ K  +  D  KV AI++W TP+TV +V+ F GLA+FYR+F++NFSSIA P+TDCLKKG F+WGP Q
Subjt:  IDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQ

Query:  QDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYL
        Q SF  LK+ L+   VL LPDF + FEV  D   TGIGAVL Q+   IE+FSEKL  + Q WS YEQELYA+VRA K WEHYL+ REF+L  DH +L++ 
Subjt:  QDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYL

Query:  HSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK
         SQK I+ MH +W  ++Q+F +   H+AGK N+V DAL+RK  LL  L  E+  F+ +   Y  D D   IW+K
Subjt:  HSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK

XP_034219000.1 uncharacterized protein LOC117630375 [Prunus dulcis]4.9e-27248.42Show/hide
Query:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR
        +K ++P++ G   IE FLDWL   E FF  M   ++K V +VA +LK   + WWDQ+  + Q+QGK  +R+W KMK LM +RF+P++YEQ LY  Y  C 
Subjt:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR

Query:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEM-IENRAKSTRKRPWEPSASKKTTAGNSKLKNSTS
        QG R  +EY EEF RL  R +L E +   ++ +  GL+  ++EK+ +Q    L EAI  A   E +  E R  + R+   E S      +  S  K    
Subjt:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEM-IENRAKSTRKRPWEPSASKKTTAGNSKLKNSTS

Query:  EKP---------VE----QEESSGKKEVPEGEKKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNR-SLGEFDEKTEVIEADEG-DSL
        ++P         V+     E SS      +   + +NPY +  +  CYR  + GH SN  P+RK     E+ D+   +  +GE D        +EG + +
Subjt:  EKP---------VE----QEESSGKKEVPEGEKKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNR-SLGEFDEKTEVIEADEG-DSL

Query:  SCILLRVLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCD
        + +L RVL++PKEE   QRHS+F++ C+I+ KVC+VI+D+GS ENFVSKKLV  L L T+PH  PY +GW+KKG    ++E C VP+SIG  Y D ++CD
Subjt:  SCILLRVLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCD

Query:  VIKIDVCHILLGRPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENE---ILGIVMSGTEDTTRDE-QIP
        VI +D CHILLGRPWQFDV + ++GR+N   F W N+K+ +   +  K  ++E   +  S    IS ++ L E   E      +V+ G     + E  IP
Subjt:  VIKIDVCHILLGRPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENE---ILGIVMSGTEDTTRDE-QIP

Query:  EAIKELFKKYPKISKE--PTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAIN
        + ++++  ++ ++  E  P  LPP+RDIQH I+L+ GAS P+LPHYR SP E  IL + IEELL+KG I+ S S CAVP LL   KD TWRMCVDSRAIN
Subjt:  EAIKELFKKYPKISKE--PTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAIN

Query:  KITVKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFS
        KITVKYRF IPR+ D+LD L G+ +F KIDLRS YHQIRIRPGDEWKT FK+ +GLFEWLVMPFGLSNAPSTFMRLMN+VL PF+  F++VYFDDIL++S
Subjt:  KITVKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFS

Query:  KTYDQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKK
         T ++HL H+ Q+  VL  N+LYVNLK+C FC+N++ FLGF++ ++ + +D+KK++AI +W  P TV +V+ F GLA+FYR+F+++FSSIAAPIT+CLKK
Subjt:  KTYDQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKK

Query:  GAFEWGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILL
        G F WG +Q+ SF  +K+ L    VL LP+F + FEV  D  G G+GAVLSQ   P+ +FS+KLS +RQ WSTY+QE YA+VRALKQWEHYL+ +EF+L 
Subjt:  GAFEWGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILL

Query:  TDHFSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK
        TDH +LKY++SQK+I  MHARW++++Q+F FVIKH +GK N+V DAL+R+++LL  L+ E+  F  + +LY+ D DF  IW K
Subjt:  TDHFSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK

XP_040994264.1 uncharacterized protein LOC121240799 [Juglans microcarpa x Juglans regia]1.6e-29150.36Show/hide
Query:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR
        +KIDLP ++G  ++E FLDWL   ENFF YM+  + ++V LVA KL+GG SAWW+Q   N ++QGK P+R W KMK+LM+ RF+P +YEQ LY QYQNCR
Subjt:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR

Query:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIEN-RAKSTRKRPWEPSASKKTTAGNSKLKNSTS
        QG R   EY EEF+RL +R NL E E   ++ ++GGLR  +++KV L     LSEA+  A  +E  +     ++    P    A   T +      +S+ 
Subjt:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIEN-RAKSTRKRPWEPSASKKTTAGNSKLKNSTS

Query:  EKPVEQEESSGK-KEVPEGEKKG--KNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDEKTEVIEADEGDSLSCILLRVLISPK
        ++  +      K   +P G + G  +NPY +  +G C+R  Q GH SN+ P RK++ +  D DD +  S  E +E  E +E DEGD ++C++ R+L++PK
Subjt:  EKPVEQEESSGK-KEVPEGEKKG--KNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDEKTEVIEADEGDSLSCILLRVLISPK

Query:  EENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCHILLG
        +E+  QRH +FKTRCT+  KVCN+IIDSGS EN VS+ LV+ L L T+ H KPYKI WIKKG ET ++  C +P SIG  Y D + CDV+++D CH+LLG
Subjt:  EENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCHILLG

Query:  RPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFL--RERENEILGIVMSGTEDTTRDEQIPEAIKELFKKYPKIS
        RPWQ+DV + ++GR+NTY F W  +KV+LLP  +R  D    ++KK S F+T+S  +FL   +   +IL +V+ G  +    E  P  +++L +++  I+
Subjt:  RPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFL--RERENEILGIVMSGTEDTTRDEQIPEAIKELFKKYPKIS

Query:  KE--PTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPRVS
         +  P  LPPLRDIQH I+L+ G S P+LPHYRMSPNE+KIL D +E+L++KG I+ S S CAVPALL PKKDG+WRMCVDSRAINKITVKYRF IPR++
Subjt:  KE--PTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPRVS

Query:  DLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLF
        D+LD L G+ +F K+DLRS YHQIR+RPGDEWKT FKT EGL+EWLVMPFGLSNAPSTFMR+M++ L PF+ KF++VYFDDIL++S+   QH++H+ ++ 
Subjt:  DLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLF

Query:  QVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQQDSFN
         VL  N+LY+NLK+C F ++ + FLGF++ ++ V +DE+KV  I+ W  P+T+ QV+ F GLA+FYR+FI+NFS++AAPIT+C+KKG   WG  Q  SF 
Subjt:  QVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQQDSFN

Query:  LLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKS
         +K+ LS   VL LP F + FEV  D    GIGAVLSQ+G P+E+FSEKL+++R+ W+ YE E YA++RALK WEHYL+ REF+L +DH +LK++++Q  
Subjt:  LLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKS

Query:  ISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK
        ++ MHARW+++IQ+F  V+K+++G+ NKV DAL+R+  LL  L +EIT F  +  LY +D DF   W +
Subjt:  ISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK

TrEMBL top hitse value%identityAlignment
A0A5B7BER3 Uncharacterized protein8.7e-29951.79Show/hide
Query:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR
        MKIDLPS++G  +IE FLDW+   E FF  M  + +K+V LVA KLKGG SAWWDQ+  N ++QGK P+R+W+KM++L+++RF+P +YEQ LY QYQNCR
Subjt:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR

Query:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVE-EMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTS
        QG R  +EY +EF+ L +R NL E E   ++ +VGGLR  +++++ L+   +L+EA + A  VE +      +S       P +S+     + +++    
Subjt:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVE-EMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTS

Query:  E----KPVEQEESSGKKEVP-EGEKKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRK---TIAVAEDND-DGSNRSLGEFDEK---TEVIEADEGDSLSCI
        +     P +Q  SS  +  P    +K  NPY R   G C+R  Q GH SN+ P R+    + V EDN  D  N    E+ ++    E+ E DEG+ +SC+
Subjt:  E----KPVEQEESSGKKEVP-EGEKKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRK---TIAVAEDND-DGSNRSLGEFDEK---TEVIEADEGDSLSCI

Query:  LLRVLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIK
        + R+L+ PK+E   QRH++F+TRCTI  KVC+VIIDSGSSEN VSK LV AL LKT+ H  PYKIGWIKKG ET ++EIC VP SIG  YKD++ CD++ 
Subjt:  LLRVLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIK

Query:  IDVCHILLGRPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQK-KGSLFVTISGKKFLRERE--NEILGIVMSGTEDTTRDEQIPEAIK
        +D CH+LLGRPWQFDV + H+G++NTY F W +KKV+L+P    K  N+ K  K +G   +T++G +F+ + +   +I+ +++ G +       +PE ++
Subjt:  IDVCHILLGRPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQK-KGSLFVTISGKKFLRERE--NEILGIVMSGTEDTTRDEQIPEAIK

Query:  ELFKKYPKI--SKEPTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITV
         L  ++  I  S+ P  LPP+RDIQH I+L+ GAS P+LPHYRMSP E +IL   +E+L+ KG I+ S S CAVPALLTPKKDG+WRMCVDSRAINKITV
Subjt:  ELFKKYPKI--SKEPTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITV

Query:  KYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYD
        KYRF IPR++D+LD L G+ IF KIDLRS YHQIRIRPGDEWKT FKT EGL+EWLVMPFGLSNAPSTFMR+MN+VL PF+ KF++VYFDDIL++SK+  
Subjt:  KYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYD

Query:  QHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFE
        +HL+H+ ++   L  ++LY+N+K+C F +  + FLGFII  + + +DE+KV AI++W TP TV  ++ F GLA+FYR+FI+NFSSI APITDC+KKG F 
Subjt:  QHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFE

Query:  WGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHF
        W   Q+ SF L+K+ LS   VL LP F + F+V  D   TGIGAVLSQ+G P+E+FSEKL+++RQ W+TYE EL+A+VRALK WEHYL+ +EF++ +DH 
Subjt:  WGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHF

Query:  SLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK
        +LK++++Q S+S MH RWI+++QRF FV+KH+AG++NKV DAL+R++ LL ++SSEIT+F  + +LY++D DF+  W K
Subjt:  SLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK

A0A5D3DGR0 Reverse transcriptase0.0e+0095.11Show/hide
Query:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR
        MKIDLPSYDGKRNIE FLDWLKNTENFFAYM T KNKKVHLVALKLKGG SAWWDQITVN QKQGKHPIRSWEKMKKLMKQRFVP NYEQTLYTQYQNCR
Subjt:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR

Query:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE
        QGMRKTAEYIEEFHRLG RTNLMEGE+HLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKN+TSE
Subjt:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE

Query:  KPVEQEESSGKKEVPEGEKKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDEKTEVIEADEGDSLSCILLRVLISPKEENQ
        KPVEQEESSGKKEVPEGEKKGKNPYQR FSGNCYR GQ+GHPSNQ PQRKTIAVA+DNDDGSNRSLGEFDE+TEVIEADEGDSLSCIL RVLISPKEENQ
Subjt:  KPVEQEESSGKKEVPEGEKKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDEKTEVIEADEGDSLSCILLRVLISPKEENQ

Query:  LQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCHILLGRPWQ
        LQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVP+SIGNSYKDQMVCDVI++DVCHILLGRPWQ
Subjt:  LQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCHILLGRPWQ

Query:  FDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENEILGIVMSGTEDTTRDEQIPEAIKELFKKYPKISKEPTCL
        FDVQSMHRGRENTYEF+W+NKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENEILGIVMSGTEDTTRDEQIPEAIKELFKKYPKISKEPTCL
Subjt:  FDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENEILGIVMSGTEDTTRDEQIPEAIKELFKKYPKISKEPTCL

Query:  PPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPRVSDLLDQLGG
        PPLRDI H+IELL GASFPHLPHY MSPNEYKILHD IEELLKKGHIKPSFSLC VPALLTPKKDGTWRMCVDSRAINKITVKYRF IPRVSDLLDQLGG
Subjt:  PPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPRVSDLLDQLGG

Query:  ACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNEL
        ACIF KIDLRS YHQIRIRPGDEWKT FKTNEGLFEWLVMPF LSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNEL
Subjt:  ACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNEL

Query:  YVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQQDSFNLLKQSLSN
        YVNLK+CIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTV QVQ FLGLASFYRKFIQN SSIAAPITDCLKKGAFEWGPKQQDSFNLLK+SL N
Subjt:  YVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQQDSFNLLKQSLSN

Query:  ISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMHARW
        I VLKLPDFFQ FEVAVDTCGTGIGAVLSQQ HPIEYFSE+LSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMHARW
Subjt:  ISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMHARW

Query:  ISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK
        ISY QRFDFVIKHQAGKENKV DAL+RKSNLLFILSSEIT FNHIT+LYKKDRDFESIWEK
Subjt:  ISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK

A0A6N2LVR1 Uncharacterized protein2.3e-27548.67Show/hide
Query:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR
        MK+DLPS++G+  IE FLDWL   E FF YM   ++KKV LVA +L GG SAWW+Q+ +   +Q K  ++SW KM++L++ R++P +YEQ L+ QYQNC+
Subjt:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR

Query:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE
        QG R    Y+EEFHRL +R NL+E +   ++ FVGGLR++++++V +     L+EAI  A   E  +  R  +T + P++P+ +           ++ S 
Subjt:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE

Query:  KPVEQEESSGKKEVPE-------GEKKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDE------KTEVIEADEGDSLS--
            +  SS + +V +         +  +N Y R  +  CYR GQ GH SN  P+R  + + E  ++      G   E      + EV   DEG+ LS  
Subjt:  KPVEQEESSGKKEVPE-------GEKKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDE------KTEVIEADEGDSLS--

Query:  CILLRVLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDV
         ++ R++++PK EN  QR+++F+TRCT+  KVC+VIIDSGSSEN +SK +V  L LKT+ H  PYKIGWIKKG ET ++E C    SIG +Y D++VCDV
Subjt:  CILLRVLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDV

Query:  IKIDVCHILLGRPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLR--ERENEILGIVMSGTEDTTRDEQIPEAI
        +++D CH++LGRPWQ+DV   ++G++N Y F+   +KVIL PL++       + Q++  L V   G+ FL   ++++E+  ++M G E  T    IPE +
Subjt:  IKIDVCHILLGRPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLR--ERENEILGIVMSGTEDTTRDEQIPEAI

Query:  KELFKKYPKISKE--PTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKIT
        + L  ++  I  E  P  LPP+RDIQH I+L+ GAS P+ PHYRMSP E  IL   +EEL+KKG ++ S S CAVPALL PKKDG+WRMC+DSRAINKIT
Subjt:  KELFKKYPKISKE--PTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKIT

Query:  VKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTY
        +KYRF IPR+ D+LD L G+ IF KIDLRS YHQIRIRPGDEWKT FKT EGL+EWLVMPFGLSNAPSTFMRLMN+VL PF   F++VYFDDIL++S++ 
Subjt:  VKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTY

Query:  DQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAF
          H+ H+ ++F VL HN+L+VNL +C F ++ + FLGF++  D + +DE+KV AI++W TP  + +V+ F GLA+FYR+F+++FS I APIT+C+KKG F
Subjt:  DQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAF

Query:  EWGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDH
         WG + + SF L+K+ L++  VL LPDF + FEV  D    GIGAVLSQ+  P+ ++SEKLS++R+ WSTYE ELYA+ RA+K WEHYL+ REFIL +DH
Subjt:  EWGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDH

Query:  FSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK
         +LK++++Q +++ MHARW+++IQRF+F +KH++G+ NKV DAL+RK +LL  L +E+  F  I  LY  D DF + W+K
Subjt:  FSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK

A0A6P6GFU0 uncharacterized protein LOC1124928195.7e-28250.1Show/hide
Query:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR
        +K+D+P++DG  NIE FLDW++  E+FF YM   ++K+V LVA K +GG SAWW+Q+  N +KQGK PI+SW +++++++ RF+P ++EQ LY QY +C 
Subjt:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR

Query:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE
        QG R  +E  EEF+RL AR NL E E  L++ +V GL   ++E+++L P  +LSEA+  A  +E+ IE     T  + W+P +        S    +  +
Subjt:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE

Query:  KPVEQEESSGKKEVPEGE-KKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDEKTEVIEADEGDSLSCILLRVLISPKEEN
        K    + S      P+ +  +  NPY R F   C++ GQ GH SN+ P RK + + E  DD S        ++TE+++ D+G+ + CI+ ++L SPK   
Subjt:  KPVEQEESSGKKEVPEGE-KKGKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDEKTEVIEADEGDSLSCILLRVLISPKEEN

Query:  QLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCHILLGRPW
        +LQRHS+FKT+CTI  KVC VIIDSGSSEN VSK LV AL L T  H  PYK+GWIKKG ET ++E+C V  SIG  Y D++VCDV+++D CHILLGRPW
Subjt:  QLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCHILLGRPW

Query:  QFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNI--------EKNQKKGSLFVTISGKKF-LRERENEI-LGIVMSGTEDTTRDEQIPEAIKELFKK
        QFD      GR+NT+ F W  KK++LLP + + D  +        E+   KG + +T SGK F L+ + ++I  GIV S       D Q P+ I +L ++
Subjt:  QFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNI--------EKNQKKGSLFVTISGKKF-LRERENEI-LGIVMSGTEDTTRDEQIPEAIKELFKK

Query:  YPKI--SKEPTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFS
        + +I  S+ P  LPP+RDIQH I+LL GA  P+LPHYRM P E +IL   +E+LLKK  I+ S S CAVPALL PKK+G WRMC+DSRAINKIT KYRF 
Subjt:  YPKI--SKEPTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFS

Query:  IPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQH
        IPR+ D+LD+L GA +F K+DLRS YHQIRIRP DEWKT FKT  GL+EW VMPFGL NAPSTFMRLMN+VL P +   ++VYFDDILV+S + + H++H
Subjt:  IPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQH

Query:  IDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQ
        + ++F +L  N LY+NLK+C+F + E+ FLGFI+ K  +  D  KV AI++W TP+TV +V+ F GLA+FYR+F++NFSSIA P+TDCLKKG F+WGP Q
Subjt:  IDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQ

Query:  QDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYL
        Q SF  LK+ L+   VL LPDF + FEV  D   TGIGAVL Q+   IE+FSEKL  + Q WS YEQELYA+VRA K WEHYL+ REF+L  DH +L++ 
Subjt:  QDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYL

Query:  HSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK
         SQK I+ MH +W  ++Q+F +   H+AGK N+V DAL+RK  LL  L  E+  F+ +   Y  D D   IW+K
Subjt:  HSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK

M5W531 Reverse transcriptase7.0e-27248.28Show/hide
Query:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR
        +K ++P++ G   IE FLDWL   E FF  M   ++K V +VA +LK   + WWDQ+    Q+QGK  +R+W KMK LM +RF+P++YEQ LY  Y  C 
Subjt:  MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCR

Query:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE
        QG R  +EY EEF RL  R +L E +   ++ +  GL+  ++EK+ +Q    L EAI  A    E++E   +    R  +  AS  T   +S       +
Subjt:  QGMRKTAEYIEEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSE

Query:  KPVEQEESSG----------KKEVPEGEKK----------GKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNR-SLGEFDEKTEVIEAD
        K   Q+++SG           K   EG  +           +NPY +  +  CYR  + GH SN  P+RK     E+ D+   +  +GE D        +
Subjt:  KPVEQEESSG----------KKEVPEGEKK----------GKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNR-SLGEFDEKTEVIEAD

Query:  EG-DSLSCILLRVLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYK
        EG + ++ +L RVL++PKEE   QRH++F++ C+I+ KVC+VI+D+GS ENFVSKKLV  L L T+PH  PY +GW+KKG    ++E C VP+SIG  Y+
Subjt:  EG-DSLSCILLRVLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYK

Query:  DQMVCDVIKIDVCHILLGRPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVT-ISGKKFLRERENEILGIVMSGTEDTTRDE
        D ++CDVI +D CHILLGRPWQFDV +  +GR+N   F W N+K+ +   Q  +     K + + S F+T IS ++ L E   E  G           + 
Subjt:  DQMVCDVIKIDVCHILLGRPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQKKGSLFVT-ISGKKFLRERENEILGIVMSGTEDTTRDE

Query:  QIPEAIKELFKKYPKISKE--PTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSR
         IP+ ++++  ++ ++  E  P  LPP+RDIQH I+L+ GAS P+LPHYRMSP E  IL + IEELL+KG I+ S S CAVP LL PKKD TWRMCVDSR
Subjt:  QIPEAIKELFKKYPKISKE--PTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSR

Query:  AINKITVKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDIL
        A+NKI VKYRFSIPR+ D+LD L G+ +F KIDLRS YHQIRIRPGDEWKT FK+ +GLFEWLVMPFGLSNAPSTFMRLMN+VL PF+  F++VYFDDIL
Subjt:  AINKITVKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDIL

Query:  VFSKTYDQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDC
        ++S T ++HL H+ Q+  VL  N+LYVNLK+C FC+N++ FLGF++ ++ + +D++K++AI +W  P TV +V+ F GLA+FY +F+++FSSIAAPIT+C
Subjt:  VFSKTYDQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDC

Query:  LKKGAFEWGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREF
        LKKG F WG +Q+ SF  +K+ L    VL LP+F + FEV  D  G G+GAVL Q   P+ +FSEKLS +RQ WSTY+QE YA+VRALKQWEHYL+ +EF
Subjt:  LKKGAFEWGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREF

Query:  ILLTDHFSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK
        +L TDH +LKY++SQK+I  MHARW++++Q+F FVIKH +GK N+V DAL+R+++LL  L+ E+  F  + +LY+ D DF  IW K
Subjt:  ILLTDHFSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.2e-8338.72Show/hide
Query:  IEELLKKGHIKPSFSLCAVPALLTPKKDGT-----WRMCVDSRAINKITVKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNE
        I+++L +G I+ S S    P  + PKK        +R+ +D R +N+ITV  R  IP + ++L +LG    F  IDL   +HQI + P    KT F T  
Subjt:  IEELLKKGHIKPSFSLCAVPALLTPKKDGT-----WRMCVDSRAINKITVKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNE

Query:  GLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKK
        G +E+L MPFGL NAP+TF R MN +L P LNK  +VY DDI+VFS + D+HLQ +  +F+ L    L + L +C F   E  FLG ++  D +  + +K
Subjt:  GLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKK

Query:  VEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKG--AFEWGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQ
        +EAI+ +  PT  K+++ FLGL  +YRKFI NF+ IA P+T CLKK        P+   +F  LK  +S   +LK+PDF + F +  D     +GAVLSQ
Subjt:  VEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKG--AFEWGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQ

Query:  QGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNR---
         GHP+ Y S  L++   ++ST E+EL A+V A K + HYLL R F + +DH  L +L+  K  +    RW   +  FDF IK+  GKEN V DAL+R   
Subjt:  QGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNR---

Query:  ----------------KSNLLFILSSEITAFNH------------ITKLYKK
                         S+L+FI    +  FN             +TK +KK
Subjt:  ----------------KSNLLFILSSEITAFNH------------ITKLYKK

P0CT41 Transposon Tf2-12 polyprotein5.6e-7728.13Show/hide
Query:  NQLQRHSLFKTRCTIQGKV----CNV--IIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCH
        N+   H+  +    +Q ++    C++  +ID+G+  N ++++ V A  L T+P        W K              +  G  Y +++    IK+++  
Subjt:  NQLQRHSLFKTRCTIQGKV----CNV--IIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCH

Query:  ILLGRPWQFDVQSMHRGRENTYEFIWINK--KVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENEILGIVMSGTEDTTRDEQIPEAIKELFKKY
                        G     EF+ + K      +      D+NIE           IS  K    + N++  IV        ++ ++P+  KE FK  
Subjt:  ILLGRPWQFDVQSMHRGRENTYEFIWINK--KVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENEILGIVMSGTEDTTRDEQIPEAIKELFKKY

Query:  PKISKEPTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPR
           +       P++ ++ ++EL        + +Y + P + + ++D I + LK G I+ S ++ A P +  PKK+GT RM VD + +NK      + +P 
Subjt:  PKISKEPTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPR

Query:  VSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQ
        +  LL ++ G+ IF K+DL+S YH IR+R GDE K  F+   G+FE+LVMP+G+S AP+ F   +N +L       ++ Y DDIL+ SK+  +H++H+  
Subjt:  VSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQ

Query:  LFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGA-FEWGPKQQD
        + Q L +  L +N  +C F  +++ F+G+ I +      ++ ++ +  W  P   K++++FLG  ++ RKFI   S +  P+ + LKK   ++W P Q  
Subjt:  LFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGA-FEWGPKQQD

Query:  SFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQG-----HPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLS--REFILLTDHF
        +   +KQ L +  VL+  DF +   +  D     +GAVLSQ+      +P+ Y+S K+SK++ ++S  ++E+ A++++LK W HYL S    F +LTDH 
Subjt:  SFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQG-----HPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLS--REFILLTDHF

Query:  SL--KYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNR
        +L  +  +  +  +   ARW  ++Q F+F I ++ G  N + DAL+R
Subjt:  SL--KYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNR

P20825 Retrovirus-related Pol polyprotein from transposon 2974.3e-7739.15Show/hide
Query:  LHDTIEELLKKGHIKPSFSLCAVPALLTPKKD-----GTWRMCVDSRAINKITVKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTF
        + + ++E+L +G I+ S S    P  + PKK        +R+ +D R +N+IT+  R+ IP + ++L +LG    F  IDL   +HQI +      KT F
Subjt:  LHDTIEELLKKGHIKPSFSLCAVPALLTPKKD-----GTWRMCVDSRAINKITVKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTF

Query:  KTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLM
         T  G +E+L MPFGL NAP+TF R MN +L P LNK  +VY DDI++FS +  +HL  I  +F  L    L + L +C F   E  FLG I+  D +  
Subjt:  KTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLM

Query:  DEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQQ--DSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGA
        +  KV+AI ++  PT  K+++ FLGL  +YRKFI N++ IA P+T CLKK       K +  ++F  LK  +    +L+LPDF + F +  D     +GA
Subjt:  DEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFEWGPKQQ--DSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGA

Query:  VLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALN
        VLSQ GHPI + S  L+    ++S  E+EL A+V A K + HYLL R+F++ +DH  L++LH+ K       RW   +  + F I +  GKEN V DAL+
Subjt:  VLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALN

Query:  R
        R
Subjt:  R

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.1e-8038.14Show/hide
Query:  IQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPRVSDLLDQLGGACIFL
        ++HDIE+  GA  P L  Y ++    + ++  +++LL    I PS S C+ P +L PKKDGT+R+CVD R +NK T+   F +PR+ +LL ++G A IF 
Subjt:  IQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPRVSDLLDQLGGACIFL

Query:  KIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLK
         +DL S YHQI + P D +KT F T  G +E+ VMPFGL NAPSTF R M         +F+ VY DDIL+FS++ ++H +H+D + + L +  L V  K
Subjt:  KIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLK

Query:  RCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPIT--DCLKKGAFEWGPKQQDSFNLLKQSLSNISV
        +C F S E  FLG+ I    +   + K  AI+++ TP TVKQ Q FLG+ ++YR+FI N S IA PI    C K    +W  KQ  +   LK +L N  V
Subjt:  RCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPIT--DCLKKGAFEWGPKQQDSFNLLKQSLSNISV

Query:  LKLPDFFQDFEVAVDTCGTGIGAVLSQQGHP------IEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMH
        L   +   ++ +  D    GIGAVL +  +       + YFS+ L  +++++   E EL  +++AL  + + L  + F L TDH SL  L ++   +   
Subjt:  LKLPDFFQDFEVAVDTCGTGIGAVLSQQGHP------IEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMH

Query:  ARWISYIQRFDFVIKHQAGKENKVVDALNR
         RW+  +  +DF +++ AG +N V DA++R
Subjt:  ARWISYIQRFDFVIKHQAGKENKVVDALNR

Q99315 Transposon Ty3-G Gag-Pol polyprotein6.4e-8138.14Show/hide
Query:  IQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPRVSDLLDQLGGACIFL
        ++HDIE+  GA  P L  Y ++    + ++  +++LL    I PS S C+ P +L PKKDGT+R+CVD R +NK T+   F +PR+ +LL ++G A IF 
Subjt:  IQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPSFSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPRVSDLLDQLGGACIFL

Query:  KIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLK
         +DL S YHQI + P D +KT F T  G +E+ VMPFGL NAPSTF R M         +F+ VY DDIL+FS++ ++H +H+D + + L +  L V  K
Subjt:  KIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLK

Query:  RCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPIT--DCLKKGAFEWGPKQQDSFNLLKQSLSNISV
        +C F S E  FLG+ I    +   + K  AI+++ TP TVKQ Q FLG+ ++YR+FI N S IA PI    C K    +W  KQ  + + LK +L N  V
Subjt:  RCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPIT--DCLKKGAFEWGPKQQDSFNLLKQSLSNISV

Query:  LKLPDFFQDFEVAVDTCGTGIGAVLSQQGHP------IEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMH
        L   +   ++ +  D    GIGAVL +  +       + YFS+ L  +++++   E EL  +++AL  + + L  + F L TDH SL  L ++   +   
Subjt:  LKLPDFFQDFEVAVDTCGTGIGAVLSQQGHP------IEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTDHFSLKYLHSQKSISMMH

Query:  ARWISYIQRFDFVIKHQAGKENKVVDALNR
         RW+  +  +DF +++ AG +N V DA++R
Subjt:  ARWISYIQRFDFVIKHQAGKENKVVDALNR

Arabidopsis top hitse value%identityAlignment
AT2G15180.1 Zinc knuckle (CCHC-type) family protein2.4e-0632.39Show/hide
Query:  FLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSN
        +L W  N   +F +  TA+  K+ +   +LKG    WWDQ   N   + + PIR+WE++K  M  ++ P +
Subjt:  FLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSN

AT4G13320.1 unknown protein1.1e-1431.97Show/hide
Query:  LFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKI--DVCHILLGRPWQFDV
        +F+T+C I  + C +++  G+  N +SK LV  L LKT       ++    +  + +  E C VP+SIG+ YKD++ C V+ +  +   +L G PW + V
Subjt:  LFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKI--DVCHILLGRPWQFDV

Query:  QSMHRGRENTYEFIWINKKVIL
        Q+ H GR+++   IW +  ++L
Subjt:  QSMHRGRENTYEFIWINKKVIL

ATMG00860.1 DNA/RNA polymerases superfamily protein4.2e-1933.33Show/hide
Query:  LQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLG--FIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFE
        + H+  + Q+   ++ Y N K+C F   +IA+LG   II  + V  D  K+EA+  W  P    +++ FLGL  +YR+F++N+  I  P+T+ LKK + +
Subjt:  LQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLG--FIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAAPITDCLKKGAFE

Query:  WGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAV
        W      +F  LK +++ + VL LPD    F   V
Subjt:  WGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATTGATCTCCCGAGCTACGACGGTAAACGCAACATTGAAATTTTTTTAGATTGGTTAAAGAACACAGAGAATTTTTTTGCTTATATGAGGACAGCCAAAAACAA
AAAAGTTCACTTAGTAGCGTTGAAGTTAAAAGGTGGTGTTTCAGCGTGGTGGGATCAAATCACAGTGAATCATCAAAAACAAGGAAAGCACCCTATTCGATCGTGGGAAA
AGATGAAGAAACTGATGAAACAGAGGTTTGTACCTTCTAATTATGAGCAGACACTTTACACACAATACCAAAATTGTAGACAAGGAATGCGAAAAACGGCTGAATACATC
GAAGAATTTCACCGGTTGGGGGCTAGAACAAATCTAATGGAAGGAGAACAACACTTAATCTCATGGTTTGTGGGAGGCCTACGATTTGATTTAAAAGAGAAGGTTAAGCT
CCAACCATTTCAACATTTATCAGAAGCCATTACTTATGCTGAAACTGTAGAAGAGATGATTGAAAACAGAGCAAAATCAACAAGAAAAAGACCCTGGGAACCTTCTGCCT
CAAAGAAAACTACTGCAGGAAATTCCAAATTAAAGAACTCAACATCTGAAAAACCAGTAGAACAAGAAGAGAGTAGTGGAAAGAAAGAAGTTCCAGAAGGAGAAAAGAAA
GGAAAAAATCCTTATCAACGTCTCTTCTCAGGTAACTGCTATAGACGTGGACAGATAGGACATCCTTCAAACCAATATCCACAAAGGAAGACTATAGCAGTAGCAGAAGA
TAATGATGATGGAAGTAATAGAAGTCTGGGAGAATTTGATGAAAAGACTGAAGTGATTGAAGCTGATGAAGGGGATAGCCTATCTTGCATCTTACTGAGAGTTCTTATTT
CTCCAAAAGAAGAAAATCAATTGCAGCGACATAGTTTATTCAAGACAAGATGCACAATTCAAGGCAAAGTTTGCAATGTAATCATCGATAGTGGAAGCAGCGAAAACTTC
GTGTCCAAAAAACTAGTTACAGCCCTCAACCTCAAAACTCAACCGCATGAAAAACCTTACAAAATCGGCTGGATAAAGAAAGGAGGAGAAACGTTGATCTCTGAAATTTG
TTATGTTCCTATCTCTATTGGGAATAGCTACAAAGATCAAATGGTATGTGATGTAATTAAAATAGACGTTTGCCACATCCTATTGGGGAGACCTTGGCAGTTTGATGTCC
AAAGTATGCACAGAGGCCGAGAGAATACCTATGAGTTCATTTGGATAAATAAAAAAGTAATTCTCCTTCCTTTACAAAAAAGAAAAGATGATAATATTGAGAAAAATCAG
AAGAAAGGCAGCCTATTCGTAACTATCAGTGGAAAGAAATTTCTTAGAGAAAGAGAGAATGAAATTTTGGGCATTGTCATGTCCGGGACTGAAGATACAACAAGAGATGA
ACAAATCCCTGAAGCCATTAAAGAATTGTTTAAGAAGTACCCTAAAATCAGCAAAGAACCTACATGCCTTCCTCCTTTGCGTGATATACAGCACGATATTGAATTACTTC
TAGGTGCCTCTTTTCCTCACTTACCTCACTATCGCATGAGCCCAAATGAATATAAAATTTTACATGATACCATTGAAGAACTACTAAAAAAAGGGCACATTAAGCCAAGC
TTCAGTCTTTGTGCAGTACCAGCACTACTAACTCCCAAAAAAGATGGAACTTGGAGGATGTGTGTGGACAGTAGAGCTATCAACAAAATCACAGTAAAATATAGATTTTC
AATCCCGAGGGTCAGTGATTTGTTAGATCAATTGGGAGGTGCTTGTATTTTTTTGAAGATTGATCTAAGAAGTTATTATCATCAAATACGTATCAGACCTGGAGATGAAT
GGAAAACAACCTTCAAGACCAATGAAGGACTTTTTGAGTGGCTTGTAATGCCATTTGGCCTTTCTAATGCTCCTAGCACTTTCATGAGGCTGATGAACAAGGTACTGCAC
CCTTTCTTAAACAAGTTTATTATAGTCTACTTTGATGACATTCTTGTCTTTAGCAAAACCTATGATCAACACCTCCAACACATTGACCAGCTGTTCCAAGTACTTAACCA
CAATGAACTTTATGTAAATCTCAAGAGGTGTATTTTCTGCTCCAATGAAATAGCCTTCTTGGGGTTCATAATCAGAAAAGACCATGTTTTAATGGATGAGAAGAAGGTAG
AGGCAATTAAAAACTGGTCAACTCCAACTACTGTCAAACAAGTGCAAGAATTCTTGGGATTGGCTTCATTCTATAGGAAGTTTATTCAAAACTTTAGTTCCATTGCTGCA
CCTATAACAGATTGTTTAAAGAAAGGAGCATTCGAATGGGGCCCTAAACAACAAGACAGTTTCAACCTATTAAAACAAAGTCTAAGCAACATCTCAGTTCTTAAACTACC
AGATTTTTTCCAAGATTTTGAAGTAGCAGTAGACACCTGTGGCACTGGTATTGGAGCTGTCCTTTCTCAACAAGGACACCCAATCGAATATTTTAGTGAGAAACTAAGCA
AGTCTAGACAGTCATGGAGTACTTATGAACAAGAACTGTACGCATTGGTGAGGGCTCTAAAACAATGGGAGCACTATCTTCTCTCTAGAGAGTTTATACTCTTGACAGAT
CACTTCTCATTGAAATACCTACACTCTCAAAAATCAATCAGCATGATGCATGCTAGATGGATTTCCTATATCCAAAGATTCGACTTTGTCATTAAACACCAAGCTGGAAA
GGAGAATAAAGTTGTTGATGCACTGAATAGAAAAAGCAACTTATTATTTATTCTATCCTCAGAAATCACTGCCTTCAATCACATAACCAAACTATATAAGAAAGACCGTG
ATTTTGAAAGTATATGGGAGAAGTGA
mRNA sequenceShow/hide mRNA sequence
TTTAGTTGTTACCAAACTTGGTATCAGAGCATCTTCGGACTTGGGACCGATGATCAATCAGCGTGGTAGAGCTTCAGCGGCTGAGGAACGACAAGAGGTGGGAGAAACCC
CCATCCTTTCTCCAAGGACTTCCTCAAGATGCTTACGGTCAGTGGAAGCATCCATCGAAGAAATCCGGCAACTGTTGAACGGCGTTGTACATCGATTAGATGAAGAAAAT
GCACGGCGAAATGATAGAGATGTAGAACCGCCGACATTACAAAATTGGGGAAGGAGAGGTAGAAGAGGTCTTGAATATTTCAGACCTCAACGAAATTTTCAAGAAAGAAT
AATACCAGAAGATCAGTGGCTACTTCCGCAGGGCAGACGCGATCGAAGGATCTAGTGGCGAGCGCGCAAAGAAGAAATTCAAAACTCCTCTTCTTCAGAAGAAAGCGACA
ACGATGATATTAATGAATTCAGACGAAACCGATATGTCCAAAACGAAAGACAGCAACGAGAAAATTCTAAATATAAAATGAAGATTGATCTCCCGAGCTACGACGGTAAA
CGCAACATTGAAATTTTTTTAGATTGGTTAAAGAACACAGAGAATTTTTTTGCTTATATGAGGACAGCCAAAAACAAAAAAGTTCACTTAGTAGCGTTGAAGTTAAAAGG
TGGTGTTTCAGCGTGGTGGGATCAAATCACAGTGAATCATCAAAAACAAGGAAAGCACCCTATTCGATCGTGGGAAAAGATGAAGAAACTGATGAAACAGAGGTTTGTAC
CTTCTAATTATGAGCAGACACTTTACACACAATACCAAAATTGTAGACAAGGAATGCGAAAAACGGCTGAATACATCGAAGAATTTCACCGGTTGGGGGCTAGAACAAAT
CTAATGGAAGGAGAACAACACTTAATCTCATGGTTTGTGGGAGGCCTACGATTTGATTTAAAAGAGAAGGTTAAGCTCCAACCATTTCAACATTTATCAGAAGCCATTAC
TTATGCTGAAACTGTAGAAGAGATGATTGAAAACAGAGCAAAATCAACAAGAAAAAGACCCTGGGAACCTTCTGCCTCAAAGAAAACTACTGCAGGAAATTCCAAATTAA
AGAACTCAACATCTGAAAAACCAGTAGAACAAGAAGAGAGTAGTGGAAAGAAAGAAGTTCCAGAAGGAGAAAAGAAAGGAAAAAATCCTTATCAACGTCTCTTCTCAGGT
AACTGCTATAGACGTGGACAGATAGGACATCCTTCAAACCAATATCCACAAAGGAAGACTATAGCAGTAGCAGAAGATAATGATGATGGAAGTAATAGAAGTCTGGGAGA
ATTTGATGAAAAGACTGAAGTGATTGAAGCTGATGAAGGGGATAGCCTATCTTGCATCTTACTGAGAGTTCTTATTTCTCCAAAAGAAGAAAATCAATTGCAGCGACATA
GTTTATTCAAGACAAGATGCACAATTCAAGGCAAAGTTTGCAATGTAATCATCGATAGTGGAAGCAGCGAAAACTTCGTGTCCAAAAAACTAGTTACAGCCCTCAACCTC
AAAACTCAACCGCATGAAAAACCTTACAAAATCGGCTGGATAAAGAAAGGAGGAGAAACGTTGATCTCTGAAATTTGTTATGTTCCTATCTCTATTGGGAATAGCTACAA
AGATCAAATGGTATGTGATGTAATTAAAATAGACGTTTGCCACATCCTATTGGGGAGACCTTGGCAGTTTGATGTCCAAAGTATGCACAGAGGCCGAGAGAATACCTATG
AGTTCATTTGGATAAATAAAAAAGTAATTCTCCTTCCTTTACAAAAAAGAAAAGATGATAATATTGAGAAAAATCAGAAGAAAGGCAGCCTATTCGTAACTATCAGTGGA
AAGAAATTTCTTAGAGAAAGAGAGAATGAAATTTTGGGCATTGTCATGTCCGGGACTGAAGATACAACAAGAGATGAACAAATCCCTGAAGCCATTAAAGAATTGTTTAA
GAAGTACCCTAAAATCAGCAAAGAACCTACATGCCTTCCTCCTTTGCGTGATATACAGCACGATATTGAATTACTTCTAGGTGCCTCTTTTCCTCACTTACCTCACTATC
GCATGAGCCCAAATGAATATAAAATTTTACATGATACCATTGAAGAACTACTAAAAAAAGGGCACATTAAGCCAAGCTTCAGTCTTTGTGCAGTACCAGCACTACTAACT
CCCAAAAAAGATGGAACTTGGAGGATGTGTGTGGACAGTAGAGCTATCAACAAAATCACAGTAAAATATAGATTTTCAATCCCGAGGGTCAGTGATTTGTTAGATCAATT
GGGAGGTGCTTGTATTTTTTTGAAGATTGATCTAAGAAGTTATTATCATCAAATACGTATCAGACCTGGAGATGAATGGAAAACAACCTTCAAGACCAATGAAGGACTTT
TTGAGTGGCTTGTAATGCCATTTGGCCTTTCTAATGCTCCTAGCACTTTCATGAGGCTGATGAACAAGGTACTGCACCCTTTCTTAAACAAGTTTATTATAGTCTACTTT
GATGACATTCTTGTCTTTAGCAAAACCTATGATCAACACCTCCAACACATTGACCAGCTGTTCCAAGTACTTAACCACAATGAACTTTATGTAAATCTCAAGAGGTGTAT
TTTCTGCTCCAATGAAATAGCCTTCTTGGGGTTCATAATCAGAAAAGACCATGTTTTAATGGATGAGAAGAAGGTAGAGGCAATTAAAAACTGGTCAACTCCAACTACTG
TCAAACAAGTGCAAGAATTCTTGGGATTGGCTTCATTCTATAGGAAGTTTATTCAAAACTTTAGTTCCATTGCTGCACCTATAACAGATTGTTTAAAGAAAGGAGCATTC
GAATGGGGCCCTAAACAACAAGACAGTTTCAACCTATTAAAACAAAGTCTAAGCAACATCTCAGTTCTTAAACTACCAGATTTTTTCCAAGATTTTGAAGTAGCAGTAGA
CACCTGTGGCACTGGTATTGGAGCTGTCCTTTCTCAACAAGGACACCCAATCGAATATTTTAGTGAGAAACTAAGCAAGTCTAGACAGTCATGGAGTACTTATGAACAAG
AACTGTACGCATTGGTGAGGGCTCTAAAACAATGGGAGCACTATCTTCTCTCTAGAGAGTTTATACTCTTGACAGATCACTTCTCATTGAAATACCTACACTCTCAAAAA
TCAATCAGCATGATGCATGCTAGATGGATTTCCTATATCCAAAGATTCGACTTTGTCATTAAACACCAAGCTGGAAAGGAGAATAAAGTTGTTGATGCACTGAATAGAAA
AAGCAACTTATTATTTATTCTATCCTCAGAAATCACTGCCTTCAATCACATAACCAAACTATATAAGAAAGACCGTGATTTTGAAAGTATATGGGAGAAGTGATGTAACC
ATACATATGCTGGAGATTTCCACATATTTGATGGTTTCCTATTAAAAAATGATCTCCTTTGCATACCACACATGTTCCCTAGACCTTGTTGCAGACTCCCTATTCTTGAT
CATCCTCCTTTGTCTTTGCTCTGCAGCTTTATCTAAAGGTTCCAAAGTAGCTCGACCTCTTTTACCTCTCCCCCCACTTCCACTACTACCAATCAAATCAACCCCATTAG
CAAATCCAATAATGGATCCTTCAACCTTATCCAATGCCTGGAATGTGGTGGAAGGAATTGGATCAAACAAAAAAATCCCACCACTGAGCCTCTCAGTCTGAGGAAATTTA
ATATCCTTGACAGGCACAACTCCATATTTCATAAGAAAATCCTCAAGGGTTATGATCTTATTAGCAACCTCCTCCTTCAATTCCTTCCTCTCACTAGAAACAATCTCCCT
CTACACATCATCCACAGTTTTAGTGGGTGGTGCAGTAGAGGAAGAATTATGATTACTAACTATGACGACAGCGGAATTAGTGATGGTGGGGGTGGTTGTATTGATGGGAA
GAGAATAGGGGTTAGAAAAATCAACTAATCTGTGCATCTAAGAGAATAGACGACTCCGTGGGATTAGAATCGTAGCCATTAGGAAGTAAGCCATTGACGGTCATTGTATG
GGGTGGTGGATTACGAGATGGGAAGTCATAATTCCTCGAATGATTGGAGAGAAATTGGGGTTTTAAAAGGGAAGCAGAAGAGGAAGAAGAAGAAGAACCCTGAGATAGAT
TTGAATTTCGCGAATTTGAAGAAGGGAAGAGCTTGGATGAAGCCATATTGGACGTCAAAATAGGCAGGGAAATAGGAGACATTCTCTCGAAACTCGAGTAGGAATGTTGT
GGAAGAATGAGTATGGGGAACTGAGAGGGGCGAGGTTTTGAGGTTTTTTGTTGTGATGATGGGTGTGAAGAGGGAGAGAAGAAGAAGAGAGGGCATTGGGTTTGGGACAG
AGAGAAAGAAAGAAAGATGGCGAGTGAGAGCGAGTGAGAAGTTGCAGAAAGCGAAGGAGGAAGATGGAACCGGGAGTAGGCGGTTACCGGTGAAGGGTTAATATTTATTT
TTATATATTTTGAAGTTTCTAGATGCTTTAGGAAAACTCTAAGCCTATGGCCGGCCTCCACGGCCTCCACTGTCCCATCACCCAAATGCCGTCTTTTTCCTTTTCTTTTT
CTTTTCGTTTTCTTTTCTTTTCCTTTTAAAATAATAATCAATTAAAATATCTTTTCATAATTACCATTCCTTTTGATTTTTATTTCGTTTTTAAACACATGGAAATGATC
TTCTTTTTGTTTTAAATTCAAAAACTTAAATTCCAATGTTTTTTAGTATTTTAAATCAATCTACCATTTATCATCTTCGCATGTTGGTATGGTACAAATCAGTTAGAAAT
GAGGGTAACATGATGACTATTGTGTTTTGTGGTTTGTATGGTTGCATCAATAGAGTAGAGTAGTGATTCAACTTAGGCTTAGGCATAGTTCTAACATATATAGGTCTAGT
ATTGATTTGGGTGAGCTTGAAGTAGCCTAACTTCTTAAGCATCGGTCCATAGCCCAATCTTTTTTGTAAGTCAAAATGTGGATTGGACTTTGCCTTGCATCCGCTTTGGA
TCAACCCATCAAATAAAAAAGATATTTAAAGGGGTTTTGATTATTAAATTATGGGGTTTCAGCTAATATAGCCTAAATTACTAAATGATATTATAC
Protein sequenceShow/hide protein sequence
MKIDLPSYDGKRNIEIFLDWLKNTENFFAYMRTAKNKKVHLVALKLKGGVSAWWDQITVNHQKQGKHPIRSWEKMKKLMKQRFVPSNYEQTLYTQYQNCRQGMRKTAEYI
EEFHRLGARTNLMEGEQHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKRPWEPSASKKTTAGNSKLKNSTSEKPVEQEESSGKKEVPEGEKK
GKNPYQRLFSGNCYRRGQIGHPSNQYPQRKTIAVAEDNDDGSNRSLGEFDEKTEVIEADEGDSLSCILLRVLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENF
VSKKLVTALNLKTQPHEKPYKIGWIKKGGETLISEICYVPISIGNSYKDQMVCDVIKIDVCHILLGRPWQFDVQSMHRGRENTYEFIWINKKVILLPLQKRKDDNIEKNQ
KKGSLFVTISGKKFLRERENEILGIVMSGTEDTTRDEQIPEAIKELFKKYPKISKEPTCLPPLRDIQHDIELLLGASFPHLPHYRMSPNEYKILHDTIEELLKKGHIKPS
FSLCAVPALLTPKKDGTWRMCVDSRAINKITVKYRFSIPRVSDLLDQLGGACIFLKIDLRSYYHQIRIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMNKVLH
PFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKRCIFCSNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVKQVQEFLGLASFYRKFIQNFSSIAA
PITDCLKKGAFEWGPKQQDSFNLLKQSLSNISVLKLPDFFQDFEVAVDTCGTGIGAVLSQQGHPIEYFSEKLSKSRQSWSTYEQELYALVRALKQWEHYLLSREFILLTD
HFSLKYLHSQKSISMMHARWISYIQRFDFVIKHQAGKENKVVDALNRKSNLLFILSSEITAFNHITKLYKKDRDFESIWEK