; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021057 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021057
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:4285873..4290752
RNA-Seq ExpressionLag0021057
SyntenyLag0021057
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_018816058.1 uncharacterized protein LOC108987582 [Juglans regia]1.4e-21434.2Show/hide
Query:  FQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFDDPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFI
        F+ +M KIW  +G +Q K  G N  + +F+  RDK +++ G PW +D A++   +  G   +K ++F    FW   HN+P+ C++++    L + +   +
Subjt:  FQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFDDPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFI

Query:  KLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIEKLPDFCYQCGRLGHVMRDCEEDENNELEAEECQYGPWMR-------ESFIF
         +E       WG  LR KVEVD+ KP+ RG  + V     +KWIP   E+LP FC++CG + H  +       N+LEA   QYGPW+R       ++   
Subjt:  KLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIEKLPDFCYQCGRLGHVMRDCEEDENNELEAEECQYGPWMR-------ESFIF

Query:  RNESKKKKEKQTDEQERNRGDNTTVE--DSKNHSSDGELPESHRRRKSDEKKATESTTEAEIIKKLTLTENSEQRAVETEHSRKGPLE--------EEIK
        R    K  + Q  ++  N     T+E  ++++++ D        + + +E+K             +T+ +N  +          GP+E        +E  
Subjt:  RNESKKKKEKQTDEQERNRGDNTTVE--DSKNHSSDGELPESHRRRKSDEKKATESTTEAEIIKKLTLTENSEQRAVETEHSRKGPLE--------EEIK

Query:  TRQPEETGRASGEQLDENIVEEGPTLNSMVQLEKKEEAKGPQTEGGRAHESQGIVKKNQDAQK--GKDRI---SPNITQTKKWKRLARIDPMQQDEDNRI
        + + E  G     Q  ++I +    +N+ + L     +         +  +Q I + ++ A+K   K R    SP +T              +Q E   +
Subjt:  TRQPEETGRASGEQLDENIVEEGPTLNSMVQLEKKEEAKGPQTEGGRAHESQGIVKKNQDAQK--GKDRI---SPNITQTKKWKRLARIDPMQQDEDNRI

Query:  METQG--TKRKTEGEESEEG-----------TPPDAMKTLCWNVRGLGNPRTLRALRHVVESEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWCVPSKGS
          + G  +K+    +   EG            PP  M  + WN RGLGNPRT+  L  +++ + P ++F +ETKC   + E++ K L FD C  V S+GS
Subjt:  METQG--TKRKTEGEESEEG-----------TPPDAMKTLCWNVRGLGNPRTLRALRHVVESEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWCVPSKGS

Query:  SGGLLLLWNSGVKLSIKSWSEGHIDSMF----TFNGWTGRFTGFYGNPEVVKRRDSWSLLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGF
        SGGL L+W   + +SI ++S  HI +        N W    TGFYG+PE  KR  SW LL+ +   + V WL  GDFNEI   +EK GG  R   QM  F
Subjt:  SGGLLLLWNSGVKLSIKSWSEGHIDSMF----TFNGWTGRFTGFYGNPEVVKRRDSWSLLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGF

Query:  QEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRPIIATIDTSPVLLANRVKGRLLR-------------
        +EVI  C L     RG  FTW   R+ +   KE+LDR   N +   +      Q L   +SDH P++  + T  V+  N+ K  L R             
Subjt:  QEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRPIIATIDTSPVLLANRVKGRLLR-------------

Query:  --------------GAHQINNRLRVCIKNLHRWS-KDRLKGSISSAIRKKEEEIQKLEAENHSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGD
                      GA  +N RL+ C   L +W+  D ++     AI  K + +++++  N       I+  + EL   L E++  W+ R+++ WL++GD
Subjt:  --------------GAHQINNRLRVCIKNLHRWS-KDRLKGSISSAIRKKEEEIQKLEAENHSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGD

Query:  KNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPG
        KNTK+FH  A+QRRK N+I  + D       ++E I  V T +F D+F SSNP    I    + V+ K+S     +L Q ++++EV+VA+  +    +PG
Subjt:  KNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPG

Query:  IDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLIS
         DG  A F+Q++WDI+G++  R  L ILN    +   N+T ITLIPKV++PKR+ +YRPISLCNV YKIVAK ++NR K +L  +IS  Q+AFVPGR IS
Subjt:  IDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLIS

Query:  DNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMC
        +N++V +E +H +  R + K G +A+KLD+SKAYDRVEW+++ +V +++GF + WI  ++ C+ + ++S+L+NG  Q  FKP RG+RQGDPLSPY+F++C
Subjt:  DNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMC

Query:  AEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNVLQGEAQAIGDILGIKQAMNL
         E  S ++ +     ++    +      +SHLFFADDSLIFC+S+  E   + ++   YE ASGQ +N +KS    SKN      + I  I G++ + + 
Subjt:  AEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNVLQGEAQAIGDILGIKQAMNL

Query:  GVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKS
          YLG+PS   R+KT  F  L DK    +  W     SA GKE+L+K+V+QAIPTY+M  F+LP+S+ K ++ +C +FWWG+T  + K+ W KW+ L + 
Subjt:  GVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKS

Query:  KTLG
        K  G
Subjt:  KTLG

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]1.3e-19944.55Show/hide
Query:  EQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMF--TFNGWTGRFTGFYGNPEVVKRRDSWSLLERLVENDDVPWLVGGDFNEILS
        E+IK  + F     VP  G SGG+ LLW   + L +KS++  HID++   T   +  R TGFYG+PE  KR DSW LL  L     +PWL  GDFNEILS
Subjt:  EQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMF--TFNGWTGRFTGFYGNPEVVKRRDSWSLLERLVENDDVPWLVGGDFNEILS

Query:  ADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRPIIAT-------------
         +EK GGA R+Q QM+GF++++N C   D G+ G  +TW   ++ + +I  RLDR       S K   ++V HL   + DH  ++ T             
Subjt:  ADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRPIIAT-------------

Query:  ---------IDTSPVLLANRVKGRLLRGAHQINNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAENHSLD-GNEIEAAELELNALLEEEEYYW
                  D   ++ A+   G  L     I+  LR+C   L +WS   + G I   I+ K   +  L       D   EI     E+NALL++EE YW
Subjt:  ---------IDTSPVLLANRVKGRLLRGAHQINNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAENHSLD-GNEIEAAELELNALLEEEEYYW

Query:  RSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRGVLDQPYSKQEVE
          R++  WLK GD+NTK+FHA+AS+RRK+N I GI+D QG W ++EE IA+ A  YF +I+ SS+P   QI EV++ +  K++++    L + ++K+EV 
Subjt:  RSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRGVLDQPYSKQEVE

Query:  VAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVIS
        VA+K +HP KAPG DG  A FFQ YW IVG +     L +LNH+  I   NKT I+LIPK  +PKRM ++RPISLCNV YK+++K +ANR K +L  +IS
Subjt:  VAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVIS

Query:  PTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIR
          Q+AF   RLI+DNVLV FE +H ++++  GKEG +AIKLDMSKA+DRVEW +I  V  +MGF  +W   +M+C+ +V++S+L+NGV      P RG+R
Subjt:  PTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIR

Query:  QGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNVLQGEAQA
        QGDPLSP +FL+CAEG S L+N+   ++ + G  IN  CP ++HLFFADDS++FC+++ EECH ++ +   YE ASGQ IN DKS    S N  Q     
Subjt:  QGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNVLQGEAQA

Query:  IGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMCARFWWGATGDKK
        I +ILG  Q      YLG+PS   RSK+Q+F+ LK+K+ + L GW  KL S GGKE+LIKAV QAIPTY+MSCF LP+ LC ++  M   FWWG    + 
Subjt:  IGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMCARFWWGATGDKK

Query:  KLHWAKWNDLCKSKTLG
        K+ W  W  +C SK  G
Subjt:  KLHWAKWNDLCKSKTLG

XP_023886153.1 uncharacterized protein LOC111998282 [Quercus suber]4.8e-20735.21Show/hide
Query:  MEEGKIVQELRKMKITEEEKSGLVEVVDEDIESANKDIENTVVC-KILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGP
        MEE  ++ + ++  + EEE S +   +DED    N D +  ++  K +T + + V+   R++  +W  +   +++  G ++ +  F ++ +  R++   P
Subjt:  MEEGKIVQELRKMKITEEEKSGLVEVVDEDIESANKDIENTVVC-KILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGP

Query:  WIYDNAILVFDDPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKW
        WIYD  +++     G+  I+SL F    FW   H LPV  LN + A  +G S+G   K  + T E + G  LRV+V ++V +P+ +G  + +G   +E W
Subjt:  WIYDNAILVFDDPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKW

Query:  IPITIEKLPDFCYQCGRLGHVMRDCEEDENNELEAEECQYGPWMRESFIFRNES------KKKKEKQTDEQERNRGDNTTVEDSKNHSSDGELPESHRRR
        +    EKLP+FCY CG + H       D    L A+    G   ++SF+F   +      +    +   E E  R       ++  +  D E  E ++  
Subjt:  IPITIEKLPDFCYQCGRLGHVMRDCEEDENNELEAEECQYGPWMRESFIFRNES------KKKKEKQTDEQERNRGDNTTVEDSKNHSSDGELPESHRRR

Query:  KSDEKKATESTTEAEIIKKLTLTENSEQRAVETE-HSRKGPLEEEIKTRQPEETGRASGEQLDEN--IVEE-GPTLNSMVQLEKKEEAKGPQTEGGRAHE
        +   K A  S T    + K   + ++    + ++ H+     EE+I+    E     S      N  I++E  P+L+S VQ+  +++   P T    AH+
Subjt:  KSDEKKATESTTEAEIIKKLTLTENSEQRAVETE-HSRKGPLEEEIKTRQPEETGRASGEQLDEN--IVEE-GPTLNSMVQLEKKEEAKGPQTEGGRAHE

Query:  SQGIVKKNQDAQKGKDRISPNITQTKKWKRLARIDPMQQDEDNRIMETQGTKRKTEGEESEEGTPPDAMKTLCWNVRGLGNPRTLRALRHVVESEKPQLV
        S    +  ++ Q G           + WKRLAR+   Q +E N  + T G K  +  +  +E T P   K L      + +   +R L  ++ ++ P +V
Subjt:  SQGIVKKNQDAQKGKDRISPNITQTKKWKRLARIDPMQQDEDNRIMETQGTKRKTEGEESEEGTPPDAMKTLCWNVRGLGNPRTLRALRHVVESEKPQLV

Query:  FIIETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFTFNGWTG--RFTGFYGNPEVVKRRDSWSLLERLVENDDVP
        F+ ET  D  + ++IK+ LSF+    V     +GGL L W + + L +++ S+ HID++    G  G  RFTGFYG P   KR +SW+LL  L     +P
Subjt:  FIIETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFTFNGWTG--RFTGFYGNPEVVKRRDSWSLLERLVENDDVP

Query:  WLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRPI----
        W+  GDFNEI    EK GG+VR+ SQM  F++ I+ C  MD GF G  FTW++       + ERLDR   N     +     V HL  FSSDH PI    
Subjt:  WLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRPI----

Query:  ---IATIDTSP-----VLLANRVKGRLLRGAHQINN----------RLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAEN-HSLDGNEIEAAELE
           I    + P     + L  R     ++    +N+          ++  C   L +WS  R  GS+   + +K++E+ + E E   S     +    LE
Subjt:  ---IATIDTSP-----VLLANRVKGRLLRGAHQINN----------RLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAEN-HSLDGNEIEAAELE

Query:  LNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRG
        LN L ++E   W  RS+  W K GDKN+K+FHA+A+QR ++N I  +    G W  D+  +A     YF+D+F S+NP    +   +  +   +SD+   
Subjt:  LNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRG

Query:  VLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIA
         L   +   EV+ AIK + P KAPG DG    F+Q YWD++GE+     L  LN  +     N T ITLIPKV++P+ + E+RPISLCNV YKI +K +A
Subjt:  VLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIA

Query:  NRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGV
        NR KKIL  +I+  Q+AF   RLISDN+LV FE ++++  R  GKE  +AIKLDMSK+YDRVEW Y+  V  +MGF EKWI+ +M  + TV++S+L+N  
Subjt:  NRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGV

Query:  PQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFM
        P     P RGIRQGDPLSP++FL+C EG +GL+++     ++KGF  +   P L+HL FADDSL+FCR++++EC+ I ++ + Y   SGQ IN +K+   
Subjt:  PQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFM

Query:  VSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMC
         SK   +   Q I   LG+ +      YLG+PS   R K   F+ +K+K+   LQGW EKL S  G+E+LIKAV+QAIPTY+MSCFKLP  LC E+ ++ 
Subjt:  VSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMC

Query:  ARFWWGATGDKKKLHWAKWNDLCKSKTLG
         +FW G  GD++K+HW KW  L +SK +G
Subjt:  ARFWWGATGDKKKLHWAKWNDLCKSKTLG

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]3.7e-19934.03Show/hide
Query:  EGKIVQELRKMKITEEEKSGLVEVVDEDIESANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIY
        E  +   L KMK+T EE+   + + D+    A +    +++ K LT K+      + ++ + WGL   +Q+ + G NL+  KF+++ D  RI+ GGPW +
Subjt:  EGKIVQELRKMKITEEEKSGLVEVVDEDIESANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIY

Query:  DNAILVFDDPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPI
        DN +L+    K    + ++ F  A  W    + P   ++   A  +G+ +G   ++E    +      +RV+V + + KPIRRG  I  GS   + W+  
Subjt:  DNAILVFDDPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPI

Query:  TIEKLPDFCYQCGRLGHVMRDCEEDENNELEAE--ECQYGPWMRESFIFRNESKKKKEKQTDEQERNRGDNTTVEDSKNHSSDGELPESHRRRKSDEKKA
          E+LP FC+ CG LGH +R C      E + E  E QYG ++R        SK    K +       G +   E+  + +++        R K   +K 
Subjt:  TIEKLPDFCYQCGRLGHVMRDCEEDENNELEAE--ECQYGPWMRESFIFRNESKKKKEKQTDEQERNRGDNTTVEDSKNHSSDGELPESHRRRKSDEKKA

Query:  TESTTEAEIIKKLTLTENSEQRAVETEHSRKGPLEEEIKTRQPEETGRASGEQLDENIVEEGPTLNSMVQLEKKE-------EAKGPQTEGGRAHESQ--
          STT A +     L ++    AV  E+ R+   +E +      E    + EQ+  +    G  + +    +K +       E  G     G + +S   
Subjt:  TESTTEAEIIKKLTLTENSEQRAVETEHSRKGPLEEEIKTRQPEETGRASGEQLDENIVEEGPTLNSMVQLEKKE-------EAKGPQTEGGRAHESQ--

Query:  ---GIVKKNQDAQKGKDRISP------NITQTKKWKRLARIDPMQQDEDNRIMETQGTKRKTEGEESEEGTPPDAMKTLCWNVRGLGNPRTLRALRHVVE
            +V + Q        +SP      N+     W R+ R+D    D  N +M   G  ++   E  EE                   P   +  +H   
Subjt:  ---GIVKKNQDAQKGKDRISP------NITQTKKWKRLARIDPMQQDEDNRIMETQGTKRKTEGEESEEGTPPDAMKTLCWNVRGLGNPRTLRALRHVVE

Query:  SEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFT-FNGWTGRFTGFYGNPEVVKRRDSWSLLERLV
                                          P  G  GGL  LW + V L + +++  H+ +  T  +G+    TGFYG P   ++ +SW LL+ L 
Subjt:  SEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFT-FNGWTGRFTGFYGNPEVVKRRDSWSLLERLV

Query:  ENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHR
             PW+V GDFN  L A EK        SQ+  F+E ++ C+L D GF+G  +TW   R  +   K RLDR   N   + + Q  RV HLS  +SDH 
Subjt:  ENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHR

Query:  PIIATIDT---------------SPVLLANRVKGRLLR----------GAHQINNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAENHSLDGN
        P++  + +                  LL +     +            G   +  +++ C   L  W    +    + AI++ ++++ +L  E    + +
Subjt:  PIIATIDT---------------SPVLLANRVKGRLLR----------GAHQINNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAENHSLDGN

Query:  EIEAAEL--ELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRG-QISEVSDC
        + E   L  +++ LL+++E YW  RSR  WL+ GD+NTK+FHAKASQRR++N I GI +SQG W E+ E + +VA  YF ++FQ+   G G Q+ E  D 
Subjt:  EIEAAEL--ELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRG-QISEVSDC

Query:  VNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCN
        V+ K+++  R  L   ++ +EV+ A+  + P KAPG DG +A F+Q +W IVG+  V   L  LN+   +   N T I LIPKV++P+RM E+RPISLCN
Subjt:  VNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCN

Query:  VTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVE
        V YKI++K +ANR K++L  +IS TQ+AFVPGRLI+DNVLV +E +H ++ RK+GK+G VA+KLD+SKAYDRVEW +++++  KMGF   WI+ +M CV 
Subjt:  VTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVE

Query:  TVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASG
        T +FS+L+NG P E  +P RGIRQGDP+SPY+FL+CAEG + LLN+   +  + G  I    P +++L FADDSL+FC+++  E  TI ++ + YE ASG
Subjt:  TVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASG

Query:  QAINLDKSRFMVSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLP
        Q+INL+KS    S N  +G+   I +ILG+K+      YLG+P+   R+K   FS+LKD++   LQGW   L S  GKE+LIKAV QAIPTY+MS F++P
Subjt:  QAINLDKSRFMVSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLP

Query:  KSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSKTLG
          LC E+  +CARFWWG  G+++K+HW  W+ L   K  G
Subjt:  KSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSKTLG

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]1.2e-21034.06Show/hide
Query:  IVQELRKMKITEEEKSGLVEVVDEDIESANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNA
        I    + +K+TEEE+  +V + +E + S+N   E  ++ KI   +    + F+ +M KIW  +G +  K    N+Y+ +F+   DK +++ G PW +D  
Subjt:  IVQELRKMKITEEEKSGLVEVVDEDIESANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNA

Query:  ILVFDDPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIE
        ++   + +G   +  + F    FW   HNLP   ++++    +G+ +G  +++E  TE   WG  LR+K EV+V K + RG  +K GS  ++ W+    E
Subjt:  ILVFDDPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIE

Query:  KLPDFCYQCGRLGHVMRDCEE--DENNELEAEECQYGPWMRESFIFRNESKKKKEKQTDEQERNRGDNTTVEDSKNHSSDGELPESHRRRKSDEKKATES
        +LP FC++CGR  H    C+E   +N+  +    QYG W+R + +       ++     E+E + G   +    +     G    S  +  S+     ES
Subjt:  KLPDFCYQCGRLGHVMRDCEE--DENNELEAEECQYGPWMRESFIFRNESKKKKEKQTDEQERNRGDNTTVEDSKNHSSDGELPESHRRRKSDEKKATES

Query:  TTEAEIIKKLTLTENSEQRAVETEHSRKGPLEEEIKTRQPEETGRASGEQLDENIVEEGPTLNSMVQLEKK-----------EEAKGPQTEGGRAHESQG
        T   E+   L  TE +  +  +       P  ++     P     AS +   E + ++  TL+  V                 EA GP +      +  G
Subjt:  TTEAEIIKKLTLTENSEQRAVETEHSRKGPLEEEIKTRQPEETGRASGEQLDENIVEEGPTLNSMVQLEKK-----------EEAKGPQTEGGRAHESQG

Query:  IVKKNQDAQKGKDRISPNITQTKKWKRLARIDPMQ-QDEDNRIMETQGTKRKTEGEESEEGTPPDAMKTLCWNVRGLGNPRTLRALRHVVESEKPQLVFI
         V   +   +  +  S +  + K WKR AR  P    D  N I +                T  D     C   +      +++ L  +V++++P LVF+
Subjt:  IVKKNQDAQKGKDRISPNITQTKKWKRLARIDPMQ-QDEDNRIMETQGTKRKTEGEESEEGTPPDAMKTLCWNVRGLGNPRTLRALRHVVESEKPQLVFI

Query:  IETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFT--FNGWTGRFTGFYGNPEVVKRRDSWSLLERLVENDDVPWL
         ETKC+  + ++IK +L ++ C+ V SKG SG L LLW   VK+ + +++  HI ++ T   +    + TGFYG+P   KR +SW LL  L    ++PWL
Subjt:  IETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFT--FNGWTGRFTGFYGNPEVVKRRDSWSLLERLVENDDVPWL

Query:  VGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRP-IIATID
          GDFNEI    EK G A R   QM  F+  ++ C+L D GF GD FTW   R+  +  KERLDR   N+   +   +  V HL    SDH+  ++ T D
Subjt:  VGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRP-IIATID

Query:  TSPVLLANRV-------------------KGRLLRG---AHQINNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAENHSLDGNEIEAAELELN
         + +    RV                     RL  G    H+    L  C   L  WS+++ +     A++ K E ++ L+  N      EI+     +N
Subjt:  TSPVLLANRV-------------------KGRLLRG---AHQINNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAENHSLDGNEIEAAELELN

Query:  ALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVN---MKLSDQQR
         +++ E    + R+++ WLK+GD+NTK+FH  +SQR++ N I  I    G+  +D + I +    +F D+F SS+P     S + +C++     ++D   
Subjt:  ALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVN---MKLSDQQR

Query:  GVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTI
          L   +++ EV+ A+ +++P  +PG DG  A FFQ YWD VG +  +  L++LN        N+TLI LIPK  +P  + ++RPISLCNV YKI+AKT+
Subjt:  GVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTI

Query:  ANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNG
        ANR KKIL A+ISPTQAAFVPGRLI+DN++V FE +H +  R +G EG++A+KLDMSKAYDR+EW ++R+V  KMGF  KWI+ +MKCV  V++S+++NG
Subjt:  ANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNG

Query:  VPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRF
         PQ  F P RG+RQGDP+SPY+F++C+E  S LLN   ++  + G  I      ++HLFFADDSL+FC+++  E   +  +  +YE ASGQ +N DK+  
Subjt:  VPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRF

Query:  MVSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNM
          S N  +   + +  I GI+   +   YLG+P+   R + Q F  + D+++  L  W  KL S  GKE LIKAVIQAIPTYSM  FKLP+ L  E+N  
Subjt:  MVSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNM

Query:  CARFWWGATGDKKKLHWAKWNDLCKSKTLG
           FWWG    + K+HW  W  + KSK  G
Subjt:  CARFWWGATGDKKKLHWAKWNDLCKSKTLG

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein2.4e-22835.96Show/hide
Query:  ANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFDDPKGNKDIKSLEFRYAQFWFHFH
        + ++ + +V+ K+LT K    D  + ++ ++W   G + +++   NL++  F+N+ D  RI    PW +D  ++      G+    +++F +  FW    
Subjt:  ANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFDDPKGNKDIKSLEFRYAQFWFHFH

Query:  NLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNI-----KVGSMAEEKWIPITIEKLPDFCYQCGRLGHVMRDCEEDE
        NLP+  + R+    +G ++G  ++++   +   WG  LR++VEV++ KP+ RG  +     + G++ +  W+    E LP FCY+CGRLGH   DC E  
Subjt:  NLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNI-----KVGSMAEEKWIPITIEKLPDFCYQCGRLGHVMRDCEEDE

Query:  --NNELEAEECQYGPWMRESFIFRNESKKKKEKQTDEQERNRGDNTTVEDSKNHSSDGELPESHRRRKSDEKKATESTTEAEIIKKLTLTENSEQRAVET
            +LE  E +YG W+R        S+  ++ +        GD    + S++H   GE+                +   A  ++K    E S     E 
Subjt:  --NNELEAEECQYGPWMRESFIFRNESKKKKEKQTDEQERNRGDNTTVEDSKNHSSDGELPESHRRRKSDEKKATESTTEAEIIKKLTLTENSEQRAVET

Query:  EHSRKGPLEEEIKTRQPEETGRASGE----QLDENIVEEGPTLNSMVQLEKKEEAKGPQTEGGRAHESQGIVKKNQDAQKGKDRISPNITQTKKWKRLAR
        E S    +E  +         RA  E      D N+  E    N +     ++E       G  A   + I     ++  G      ++T+ K    +  
Subjt:  EHSRKGPLEEEIKTRQPEETGRASGE----QLDENIVEEGPTLNSMVQLEKKEEAKGPQTEGGRAHESQGIVKKNQDAQKGKDRISPNITQTKKWKRLAR

Query:  IDPMQQDEDNRIMET--QGTKRKTEG-----EESEEGTPPDAMKTLCWNVRGLGNPRTLRALRHVVESEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWC
        ++ +  +       +  Q  +   EG        +   PP  M+ +  N RGLGNP T+  L H V+ E PQ+VF++ET+   +K E I+  L    C+ 
Subjt:  IDPMQQDEDNRIMET--QGTKRKTEG-----EESEEGTPPDAMKTLCWNVRGLGNPRTLRALRHVVESEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWC

Query:  VPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDS-MFTFNGWTGRFTGFYGNPEVVKRRDSWSLLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQM
        V   G  GGL LLW+  V + ++S+S+ HIDS + + +G   RFTGFYG+P    RR SW LL RL    D PWLV GDFNEI+S+DEK+G   R  +QM
Subjt:  VPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDS-MFTFNGWTGRFTGFYGNPEVVKRRDSWSLLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQM

Query:  NGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRPIIATIDTSPVLLANRVKGRLLR----------
          F+E +N C L+D GF+G  FTW   R   + + ERLDR              R+QH+ F  SDH  ++  +++  V+   +   R  R          
Subjt:  NGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRPIIATIDTSPVLLANRVKGRLLR----------

Query:  -----------GAHQ-------INNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAENHSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLK
                     HQ       ++ +++ C   L  WSK  L+      + K++   +    +   ++ +E  A   EL +LL++EE YWR RSR  WL+
Subjt:  -----------GAHQ-------INNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAENHSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLK

Query:  SGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGK
         GD+NT +FH+ ASQR+K N I GI D+Q VW+ +E  IA V   YF +I+ +++P    I  V   V+  +S      L +P++++EV +A+  + P K
Subjt:  SGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGK

Query:  APGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGR
        APG DG  A FFQ +W IVG D     L  LN    +   N T I LIPKV+ P+ M  +RPISLCNV YKI++K + NR K IL  V+S +Q+AFVPGR
Subjt:  APGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGR

Query:  LISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIF
        +ISDN+++ FE IH + N++ GK   +A KLDMSKAY+RVEW+Y++ +  K+GF EKW+  IM+CV +V++S+L+NG P+   +P RG+RQGDPLSPY+F
Subjt:  LISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIF

Query:  LMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNVLQGEAQAIGDILGIKQA
        L+CAEG S LL +    + ++G  ++   P +SHLFFADDSLIFCR++  +C  ++++   YE ASGQ IN DK+    S+N        I  + G    
Subjt:  LMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNVLQGEAQAIGDILGIKQA

Query:  MNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDL
             YLG+P    RSK Q FS++KD+I   LQGW EK  S  GKE+LIKAVIQAIPTY+MSCFKLP  LC EI+NM  RFWWG  G+++K+HW     L
Subjt:  MNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDL

Query:  CKSKTLG
        C++K  G
Subjt:  CKSKTLG

A0A2N9HDH5 Uncharacterized protein5.0e-23436.93Show/hide
Query:  NKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFDDPKGNKDIKSLEFRYAQFWFHFHN
        ++D +  +  + +T + + ++   R+   +W        +  G NL V  F++  D  R++ G PW YD  ++ F     + D+K +E  +  FW   HN
Subjt:  NKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFDDPKGNKDIKSLEFRYAQFWFHFHN

Query:  LPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIEKLPDFCYQCGRLGHVMRDCE---EDENNE
        LP+  + R+ A AL  +VG    + E  EE+     +RV+V++D+ KP+ RG   ++ S  +E WI    E+LP FCY CG L H  +DC+    ++   
Subjt:  LPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIEKLPDFCYQCGRLGHVMRDCE---EDENNE

Query:  LEAEECQYGPWMR---ESFIFRNESKKKKEKQTDEQERNRGDNTTVEDSKNHSSD-------------GELPESHR---RRKSDEKKATESTTEAEIIKK
        + AE+ QYG W+R   E  + R E K           + +        + +H SD               LP S +     +SD ++  E+   A    +
Subjt:  LEAEECQYGPWMR---ESFIFRNESKKKKEKQTDEQERNRGDNTTVEDSKNHSSD-------------GELPESHR---RRKSDEKKATESTTEAEIIKK

Query:  LTLTENSEQRAVETEHSRKGPLEEEIKTRQPEETGRASGEQLDENIVEEGPTLNSMVQLEKKEEAKGPQTEGGRAHESQGIVKKNQDAQKGKDRISPNIT
        +   E SE            P+  E++    +E      + +D+ I +    L + V L+  +       +G ++H        N+    GK+ +  N  
Subjt:  LTLTENSEQRAVETEHSRKGPLEEEIKTRQPEETGRASGEQLDENIVEEGPTLNSMVQLEKKEEAKGPQTEGGRAHESQGIVKKNQDAQKGKDRISPNIT

Query:  QTKK------------WKRLARIDPMQ----QDEDNRIMETQGTKRKTEGEESEEGT-PPDAMKTLCWNVRGLGNPRTLRALRHVVESEKPQLVFIIETK
        Q               WKRL R    +      E  +   + G  R   G      T PP  M  + WN RGLGN R + AL ++V+S+ P+++F++ETK
Subjt:  QTKK------------WKRLARIDPMQ----QDEDNRIMETQGTKRKTEGEESEEGT-PPDAMKTLCWNVRGLGNPRTLRALRHVVESEKPQLVFIIETK

Query:  CDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFTFNG---WTGRFTGFYGNPEVVKRRDSWSLLERLVENDDVPWLVGG
         D  K E I+  L F FC+ VPS G SGGL LLWN  V+L+I+++S  HID+     G   W  RFTGFYGNP   +RR SW+LL++L     +PWL+ G
Subjt:  CDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFTFNG---WTGRFTGFYGNPEVVKRRDSWSLLERLVENDDVPWLVGG

Query:  DFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRPIIATIDTSPV
        DFNEILS+DE+ G A  +Q  M  F EVIN C L+D G+RG  FTW  GR+ +  +++RLDR   +++         + HL    SDH PI+   D    
Subjt:  DFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSFFSSDHRPIIATIDTSPV

Query:  LLANRVKGR---------------LLRGAHQIN----------NRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAENHSLDGN-EIEAAELELN
            + + R               + +   Q+N           +++ C + L RW K  +       IR K   +  L A N   + N  I A + E+N
Subjt:  LLANRVKGR---------------LLRGAHQIN----------NRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAENHSLDGN-EIEAAELELN

Query:  ALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRGVL
         LL  EE +WR RSR VWL++GD+NTK+FH  A+QR+++N I G+ +   VW   E  I  +A  YF++IF +S P R  I E    V   +S++    L
Subjt:  ALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRGVL

Query:  DQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIANR
         QPY+ +EV  A+  +HP KAPG DG  + FFQ YW IVG       L ILN    +   N T ++LIPK ++P+++ +YRPISLCNV YKI++K +ANR
Subjt:  DQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIANR

Query:  FKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQ
         K +L  +IS +Q+AFVPGRLI+DNV V FE +H +  ++RG++G +A+KLDMSKAYDRVEW ++  +  K+GF+EKWI  IM+C+ TV +S++++GVP+
Subjt:  FKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQ

Query:  EEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVS
            P RGIRQGDP+SPY+FL+CAEG S LL +      LKG + +   P +SHLFFADDSL+F ++S+ E     ++   YE +SGQ +N +K+    S
Subjt:  EEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVS

Query:  KNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMCAR
         N      QAI D  G     N   YLG+P+   RSK  +F+ LK++I   LQGW EK  S  G+E+LIKAV QAIPTY+M+CF+LPK+ C E+N++ A+
Subjt:  KNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMCAR

Query:  FWWGATGDKKKLHWAKWNDLCKSKTLG
        +WWG T DK+K+HW KW+ LC +K  G
Subjt:  FWWGATGDKKKLHWAKWNDLCKSKTLG

A0A2N9J809 Uncharacterized protein1.0e-22635.54Show/hide
Query:  EKSGLVEVVDE--DIESANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFDDPKGNK
        EK  L EV     D+ S +   ++ +  K LT + + V+   R+   +W       ++    N  V  F++  D+ R++ G PW YD  ++V    + ++
Subjt:  EKSGLVEVVDE--DIESANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFDDPKGNK

Query:  DIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIEKLPDFCYQCGR
         I  + FR   FW   H LPV  +N + A  LG+S+GT + + EG      G  +R++V +D+ KP+ RG  ++      E WI    E+LP+FCY CG 
Subjt:  DIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIEKLPDFCYQCGR

Query:  LGHVMRDCEEDENNE--LEAEECQYGPWMRESF-------------IFRNESKKKKEKQTDEQ----------ERNRGDNTTVEDSKNHSSDGELPESHR
        + H  +DC     N+  L+ E+ Q+GPW+R S              I R    K K  QT  Q          +    + TT   ++    D   P+++ 
Subjt:  LGHVMRDCEEDENNE--LEAEECQYGPWMRESF-------------IFRNESKKKKEKQTDEQ----------ERNRGDNTTVEDSKNHSSDGELPESHR

Query:  RRKSDEKKATESTTEA---------------EIIKKLTLTENSEQRAVETEHSRKGPLEEEIKTRQPE-ETGRASGEQLDENIVEEGPTLNSMVQLEKKE
           S +   T S T                 +    + + EN       +E   +G  E+  ++R  E +T      +  ENI E   T      L    
Subjt:  RRKSDEKKATESTTEA---------------EIIKKLTLTENSEQRAVETEHSRKGPLEEEIKTRQPE-ETGRASGEQLDENIVEEGPTLNSMVQLEKKE

Query:  EAKGPQTEGGRAHESQGIVKKNQDAQKGKDRISPNITQTKK-WKRLARIDPMQQDEDNRIME---TQGTKRKTEGEESEEGTPPDAMKTLCWNVRGLGNP
            PQ+E    H    ++    +       + P     KK WK+LAR    + + D+ I+E   T+ T    E  ES    PP  M  L WN RGLGNP
Subjt:  EAKGPQTEGGRAHESQGIVKKNQDAQKGKDRISPNITQTKK-WKRLARIDPMQQDEDNRIME---TQGTKRKTEGEESEEGTPPDAMKTLCWNVRGLGNP

Query:  RTLRALRHVVESEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFTFNGW-TGRFTGFYGNPEVVKR
        RT++ L  +V ++ P +VF+IET  D    E+++  L ++  +   S+   GGL L W   V L + S+S  HIDS+   +   T R TGFYG PE   R
Subjt:  RTLRALRHVVESEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFTFNGW-TGRFTGFYGNPEVVKR

Query:  RDSWSLLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRV
         DSW+LL RL     +PW   GDFNE++  +EK+G   R+  QM  F++V++ C  +D GF G  FTW   R       ERLDR            S RV
Subjt:  RDSWSLLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRV

Query:  QHLSFFSSDHRPI---------------------IATIDTSPVLLANRVKGRLLRGAHQINNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAE
         HL    SDH+PI                      + +  +  + A+  K +     + +  +++ C + L +WSK    GSI   I + E++++  EA+
Subjt:  QHLSFFSSDHRPI---------------------IATIDTSPVLLANRVKGRLLRGAHQINNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAE

Query:  N-HSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQIS
        +    D  ++   + +L+ LL ++E  WR RSR  WL+ GD+NT++FH+KA+ RR+RN +  + D+ G+W   ++ +  +   Y+  +F ++NP    I 
Subjt:  N-HSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQIS

Query:  EVSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRP
        +V+DC+   ++D+    L + ++  EV  A+K + P KAPG DG    F+Q YW ++G D  +  L  LN    +   N T ITLIPKV++P+ ++E+RP
Subjt:  EVSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRP

Query:  ISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNI
        ISLCNV YK++AK + NR K IL  V+S +Q+AFVPGRLI+DN+LV FE +H ++++K GK G +A+KLDMSKAYDRVEW+Y++ V  +MGF  KW+  +
Subjt:  ISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNI

Query:  MKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAY
        M+C+ TV++S+L+NG P    KP RG+RQGDPLSPY+FL CAEGF  L+ R      LKG  I+   P L+HLFFADDSL+FC+++  +   ++ + + Y
Subjt:  MKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAY

Query:  ELASGQAINLDKSRFMVSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMS
        E ASGQ IN  K+    SK+  Q +   I  +LG+        YLG+PS   R+K   F+++K+++ + L+GW E+L S  G+E+LIKAV QAIPTY+MS
Subjt:  ELASGQAINLDKSRFMVSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMS

Query:  CFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSKTLG
        CF+LP  L KEI  +  RFWWG +GD+ K+HW  W+ LCKSK LG
Subjt:  CFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSKTLG

A0A7N2LIH6 Uncharacterized protein9.0e-23636.16Show/hide
Query:  RKMKITEEEKSGLVEVVDEDIESANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFD
        +K+ +TE E    +++      +A +  +N VV KILT + ++++  +++M  +W     +Q+ + G++L++ +F + RDK ++++  PW Y+  +++  
Subjt:  RKMKITEEEKSGLVEVVDEDIESANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFD

Query:  DPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIEKLPDF
        + +G    K ++ ++  FW    NLP+ C+ R+    +G  +G  ++++   +   WG+ LRV++  D    + RG  + +    E +W+    E+LP+F
Subjt:  DPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIEKLPDF

Query:  CYQCGRLGHVMRDCEE--DENNELEAEECQYGPWMRES-------FIFRNESKKKKEKQTDEQERNRGDNTTVED--------SKNHSSDGELPESHRRR
        CYQCGRL H  +DC E  D  N  + E  QYG W+R            R   +   E++ D QE      T V+          + H S+  + +    R
Subjt:  CYQCGRLGHVMRDCEE--DENNELEAEECQYGPWMRES-------FIFRNESKKKKEKQTDEQERNRGDNTTVED--------SKNHSSDGELPESHRRR

Query:  KSDEKKATESTTEAEIIKKLTLTENSEQRAVETEHSRKGPLEEEIKTRQPEE-TGRASGEQLDENIVEEGPTLNSMVQLEKKEEAKGPQTEGGR------
        KS  +            K   + EN +        S K   E++I     +  TG  + + +++ +  E   L+ +  ++ +E +K  Q   G       
Subjt:  KSDEKKATESTTEAEIIKKLTLTENSEQRAVETEHSRKGPLEEEIKTRQPEE-TGRASGEQLDENIVEEGPTLNSMVQLEKKEEAKGPQTEGGR------

Query:  ----AHESQGIVKKNQDAQKG--KDRISPNITQTKKWKRLARIDPMQQDEDNRIMETQGTKRKTEGEESEEG---TPPDAMKTLCWNVRGLGNPRTLRAL
            A      +    D +KG   + + P   ++  WKRLAR     +D         G++++ +G     G    PP +M  L WN RGLG    +R L
Subjt:  ----AHESQGIVKKNQDAQKG--KDRISPNITQTKKWKRLARIDPMQQDEDNRIMETQGTKRKTEGEESEEG---TPPDAMKTLCWNVRGLGNPRTLRAL

Query:  RHVVESEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFTFNGWTG--RFTGFYGNPEVVKRRDSWS
           V+ + P LVF++ETK    K +  +  L F     VPS G SGGL LLW  G  +  KS S  HID +    G  G  R TGFYG+P+  KR  SW 
Subjt:  RHVVESEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFTFNGWTG--RFTGFYGNPEVVKRRDSWS

Query:  LLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSF
        LLE L    ++PWLV GDFNEI+  DEK G   R+ +QM+ F+EV++ C L+D GF G  FTW  GR   ++   RLDR   N A S      +V H+S 
Subjt:  LLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQHLSF

Query:  FSSDHRPIIATIDTSPVLLANRVKGRLL------------------------RGAHQINNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAEN-
         +SDH   +  +  + V    R K R                            A  +  RL  C K L +W+++   G++   I++K+  +Q+LE+ N 
Subjt:  FSSDHRPIIATIDTSPVLLANRVKGRLL------------------------RGAHQINNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAEN-

Query:  -HSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISE
         H     EI+  + E+N L   EE  W+ RSR  WL+ GDKN+K+FHA ASQRR++NRI G+ D  GVW ED+E   ++   YF+DI+ S+ P    +S 
Subjt:  -HSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISE

Query:  VSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPI
          + ++ +++ +    L + +   EV  A++ +HP KAPG DG    F+Q YWDIVG       LQ LN        NKT I LIPK ++P+++ E+RPI
Subjt:  VSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPI

Query:  SLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIM
        SLCNV YKI++K +ANR KK+L  VI   Q+AFVPGR+I+DNV+V FE +H+IN R++GKEG +AIKLDMSKAYDRVEW Y+ ++  KMGF ++WI  IM
Subjt:  SLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIM

Query:  KCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYE
         CV +V+FSVL+NG P+  F P RG+RQGDP+SPY+FL+C EG S ++ ++     ++G       P +SHLFFADDS+IFCR++++EC  + KV + YE
Subjt:  KCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYE

Query:  LASGQAINLDKSRFMVSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSC
          SGQ +N DK+    S+N      +    I G +   +   YLG+P    R+K + F+++KD++   + GW  KL S  G+EVLIKAV QA PTY+M+ 
Subjt:  LASGQAINLDKSRFMVSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSC

Query:  FKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSKTLG
        FKLP SLC E+N+M   FWWG  G +KK+ W  W +LCK K  G
Subjt:  FKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSKTLG

A0A7N2R0C3 Reverse transcriptase domain-containing protein1.8e-22835.84Show/hide
Query:  RKMKITEEEKSGLVEVVDEDIESANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFD
        +++K+TEEE+  ++ + DE + +A +  +  V  K+++ K +MV+  ++++  +W     +QL   G+ L++ +F+++RDK R++D  PW Y+  +++F 
Subjt:  RKMKITEEEKSGLVEVVDEDIESANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDNAILVFD

Query:  DPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIEKLPDF
        + +G++  K +  +++ FW   +NLP+    ++   A+G S+G FI+++       WG  LRV+VE+DV + + RG  I +    E +W+    E+LP+F
Subjt:  DPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIEKLPDF

Query:  CYQCGRLGHVMRDCEEDENNELEAEE--CQYGPWMRESFIFRNE-----SKKK-----KEKQTDEQERNRGDNTTVEDSKNHSSDGE---LPESHRRRKS
        CY+CG L H ++DC E+   +   EE   QYG W+R   I +       +KKK     K K+  +    +G +   E     + + E   L +S + R  
Subjt:  CYQCGRLGHVMRDCEEDENNELEAEE--CQYGPWMRESFIFRNE-----SKKK-----KEKQTDEQERNRGDNTTVEDSKNHSSDGE---LPESHRRRKS

Query:  DEKKATESTTEAEIIKKLTLTENSEQRAVETEHSRKGPLEEEIKTRQPEETGRASGEQLDENIVE--EGPTLNSMVQLEKKEEAKGPQTEG---GRAHES
        D     E TT    +KK    EN    ++   HS  G L E  +  +    G    E   +   E    P  N    L   E  K     G   G     
Subjt:  DEKKATESTTEAEIIKKLTLTENSEQRAVETEHSRKGPLEEEIKTRQPEETGRASGEQLDENIVE--EGPTLNSMVQLEKKEEAKGPQTEG---GRAHES

Query:  QGIVKKNQDAQKG--KDRISPNITQTKKWKRLARIDPMQQDEDNRIMETQGTKRKTEGE------------ESEEGTPPDAMKTLCWNVRGLGNPRTLRA
         G +    D ++G   +++ P+   +  WKR+ R  P  + +++        +RK +G+                 TPP  M  L WN RG+G+   +RA
Subjt:  QGIVKKNQDAQKG--KDRISPNITQTKKWKRLARIDPMQQDEDNRIMETQGTKRKTEGE------------ESEEGTPPDAMKTLCWNVRGLGNPRTLRA

Query:  LRHVVESEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFTFNGWTG----RFTGFYGNPEVVKRRD
        L   V++  P LVF+ ETK    + + +++ L       VPS G SGGL +LW  GV +S+KS S  HID      G  G    R TGFYG+P+   R  
Subjt:  LRHVVESEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGHIDSMFTFNGWTG----RFTGFYGNPEVVKRRD

Query:  SWSLLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQH
        SW LLE L    ++PW+V GDFNEIL++DEK G   R+  QM  F+E ++ C L+D GF G  FTW  GR  +++   RLDR   N          +V H
Subjt:  SWSLLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVNHALSQKSQSIRVQH

Query:  LSFFSSDHRPIIATI---DTSPVL--------LANRVKG--RLLRGAHQ---------INNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAEN
         S  +SDH  +  +I   +T  V         +  R +G   ++  A           I NRL+ C   L  W++ R+ G+++  +++K+  +Q+LE  N
Subjt:  LSFFSSDHRPIIATI---DTSPVL--------LANRVKG--RLLRGAHQ---------INNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAEN

Query:  --HSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQIS
          H     E++  + E+N ++  EE  W  RSR +W+K GD+NT++FHA A+ RR++N+IEGI DS+G WRE+ E +  +   YF++I+ S+ P   +  
Subjt:  --HSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQIS

Query:  EVSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRP
             V  ++++     L + + ++EV  A+  +HP K+PG DG    FFQ YWD+VG   V+  +  L         N+T I LIPKV+ P+++ EYRP
Subjt:  EVSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRP

Query:  ISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNI
        ISLCNV YK+V+K +ANR K +L  V+   Q+AFVPGR I+DNVLV FE +H IN R++GKEG +AIKLDMSKAYDRVEW Y+  +  +MGF E+WI  +
Subjt:  ISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNI

Query:  MKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAY
        M CV TV+FSVL+NG P+    P RG+RQGDP+SPY+FL+CAEG S +L R    + + G +I    P +SHL FADD ++F ++SMEE   + K+ + Y
Subjt:  MKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAY

Query:  ELASGQAINLDKSRFMVSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMS
        E  SGQ +N +K+    SKN      + + ++ G +       YLG+P    R K + F+++KD++   +  W  +L S  G+E+LIKAV QA PTY+M+
Subjt:  ELASGQAINLDKSRFMVSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMS

Query:  CFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSKTLG
        CF LP SLC E+N++   FWWG    +KKL W  W  LCK K  G
Subjt:  CFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSKTLG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.0e-3923.79Show/hide
Query:  NLHRWSKDRLK-GSISSAIRKKEEEIQKLEAENHSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGV
        N ++  ++R K  +++S +++ E++ Q     + +    EI     EL  +  ++     + SR  + +  +K  +       ++R++N+I+ I + +G 
Subjt:  NLHRWSKDRLK-GSISSAIRKKEEEIQKLEAENHSLDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGV

Query:  WREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNM-KLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQI
           D   I      Y++ ++ +      ++    D   + +L+ ++   L++P +  E+   I +L   K+PG DG  A F+Q Y + +    ++    I
Subjt:  WREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNM-KLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQI

Query:  LNHDADITPFNKTLITLIPKV-RDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAI
                 F +  I LIPK  RD  + + +RPISL N+  KI+ K +ANR ++ +  +I   Q  F+PG     N+      I  IN  K   + HV I
Subjt:  LNHDADITPFNKTLITLIPKV-RDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAI

Query:  KLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFC
         +D  KA+D+++  ++     K+G    +++ I    +  T ++++NG   E F    G RQG PLSP +F +  E  +  + +E E   +KG ++    
Subjt:  KLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFC

Query:  PPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNVLQGEAQAIGDI---LGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKD
          LS   FADD +++  + +     + K+   +   SG  IN+ KS+  +  N  Q E+Q +G++   +  K+   LG+ L    + +  +   +  L  
Subjt:  PPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNVLQGEAQAIGDI---LGIKQAMNLGVYLGMPSQAIRSKTQMFSKLKD

Query:  KIRNILQGWSEKLFSAGGKEVLIKAVI--QAIPTYSMSCFKLPKSLCKEINNMCARFWW
        +I+     W     S  G+  ++K  I  + I  ++    KLP +   E+     +F W
Subjt:  KIRNILQGWSEKLFSAGGKEVLIKAVI--QAIPTYSMSCFKLPKSLCKEINNMCARFWW

P08548 LINE-1 reverse transcriptase homolog3.6e-4025.6Show/hide
Query:  NNRLRVCIKNLHRWSKDRLKG---SISSAIRKKEEE--------IQKLEAENHS----LDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKW
        NN      +NL   +K  L+G   ++ + ++K E E        +++LE E HS        EI     ELN +  +      ++S+  + +  +K  K 
Subjt:  NNRLRVCIKNLHRWSKDRLKG---SISSAIRKKEEE--------IQKLEAENHS----LDGNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKW

Query:  FHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNM-KLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGT
              ++R ++ I  I +       D   I ++   Y++ ++        +I +  +  ++ +LS ++  +L++P S  E+   I+NL   K+PG DG 
Subjt:  FHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNM-KLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGT

Query:  HASFFQAYWDIVGEDTVRECLQILNH--DADITP--FNKTLITLIPKV-RDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLI
         + F+Q +     E+ V   L +  +     I P  F +  ITLIPK  +DP R + YRPISL N+  KI+ K + NR ++ +  +I   Q  F+PG   
Subjt:  HASFFQAYWDIVGEDTVRECLQILNH--DADITP--FNKTLITLIPKV-RDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLI

Query:  SDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLM
          N+      I  IN  K   + H+ + +D  KA+D ++  ++     K+G    +++ I       T ++++NGV  + F    G RQG PLSP +F +
Subjt:  SDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLM

Query:  CAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNVLQGE---AQAIGDILGIKQ
          E  +  +    E + +KG  I +    LS   FADD +++  ++ +    + +V K Y   SG  IN  KS   +  N  Q E     +I   +  K+
Subjt:  CAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNVLQGE---AQAIGDILGIKQ

Query:  AMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVI--QAIPTYSMSCFKLPKSLCKEINNMCARFWW
           LGVYL    + +  +   +  L+ +I   +  W     S  G+  ++K  I  +AI  ++    K P S  K++  +   F W
Subjt:  AMNLGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVI--QAIPTYSMSCFKLPKSLCKEINNMCARFWW

P11369 LINE-1 retrotransposable element ORF2 protein7.5e-3825.22Show/hide
Query:  SKDRLKGSISSAIRKKEEEIQKLEAENHSLD-GNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDE
        SK + + + +S++    + ++K EA +       EI     E+N +         +++R  + +  +K  K         R +  I  I + +G    D 
Subjt:  SKDRLKGSISSAIRKKEEEIQKLEAENHSLD-GNEIEAAELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDE

Query:  EGIARVATGYFRDIFQSSNPGRGQISEVSDCVNM-KLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDA
        E I      +++ ++ +      ++ +  D   + KL+  Q   L+ P S +E+E  I +L   K+PG DG  A F+Q +     ++ +   L  L H  
Subjt:  EGIARVATGYFRDIFQSSNPGRGQISEVSDCVNM-KLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDA

Query:  DI-----TPFNKTLITLIPK-VRDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAI
        ++       F +  ITLIPK  +DP +++ +RPISL N+  KI+ K +ANR ++ + A+I P Q  F+PG     N+      IH IN  K   + H+ I
Subjt:  DI-----TPFNKTLITLIPK-VRDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAI

Query:  KLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFC
         LD  KA+D+++  ++  V  + G    ++  I         ++ +NG   E      G RQG PLSPY+F +  E  +  + ++ E   +KG +I    
Subjt:  KLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFC

Query:  PPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKS-RFMVSKNVLQGEAQAIGDILGIKQAMNLGVYLG--MPSQAIRSKTQMFSKLKD
          +S L  ADD +++          +  +  ++    G  IN +KS  F+ +KN  Q E + I +        N   YLG  +  +      + F  LK 
Subjt:  PPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKS-RFMVSKNVLQGEAQAIGDILGIKQAMNLGVYLG--MPSQAIRSKTQMFSKLKD

Query:  KIRNILQGWSEKLFSAGGKEVLIKAVI--QAIPTYSMSCFKLPKSLCKEINNMCARFWW
        +I+  L+ W +   S  G+  ++K  I  +AI  ++    K+P     E+     +F W
Subjt:  KIRNILQGWSEKLFSAGGKEVLIKAVI--QAIPTYSMSCFKLPKSLCKEINNMCARFWW

P14381 Transposon TX1 uncharacterized 149 kDa protein6.4e-4524.6Show/hide
Query:  PEVVKRRDSWSLLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGD----MFTWRRGRQKKKQIKERLDRFFVNHA
        PE  +  +S S     +++D+   ++GGDFN  L A ++     R+ S+ +  +E+I    L+D     +     FT+ R R      + R+DR +++  
Subjt:  PEVVKRRDSWSLLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGD----MFTWRRGRQKKKQIKERLDRFFVNHA

Query:  LSQKSQSIRVQHLSFFSSDHRPIIATIDTSP------------VLLANRVKGRLLRGAHQINNRLRVCIKNLHRW---SKDRLK----GSISSAIRKKEE
        L  ++QS  ++   F  SDH  +   +  +P             LL +    + +R   +     +     L++W    K  LK        S   ++  
Subjt:  LSQKSQSIRVQHLSFFSSDHRPIIATIDTSP------------VLLANRVKGRLLRGAHQINNRLRVCIKNLHRW---SKDRLK----GSISSAIRKKEE

Query:  EIQKLEAE----NHSLDGNEIEAAELEL----NALLEEEEYYWRS---RSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVAT
        EI+ L  E       L G+E +A + E      AL   E+   R    RSR   L   D+ +++F+A   ++  R +I  +F   G   ED E I   A 
Subjt:  EIQKLEAE----NHSLDGNEIEAAELEL----NALLEEEEYYWRS---RSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVAT

Query:  GYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTL
         +++++F           E+ D + + +S++++  L+ P +  E+  A++ +   K+PG+DG    FFQ +WD +G D  R   +            + +
Subjt:  GYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTL

Query:  ITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEY
        ++L+PK  D + ++ +RP+SL +  YKIVAK I+ R K +L  VI P Q+  VPGR I DNV +  + +H     +R       + LD  KA+DRV+ +Y
Subjt:  ITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEY

Query:  IRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFA-DDSLI
        +        F  +++  +     +    V +N         GRG+RQG PLS  ++ +  E F  LL + +    LK        P +  +  A  D +I
Subjt:  IRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFA-DDSLI

Query:  FCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNV-LQGEAQAIGDILGIKQAMN-LGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWS--EKL
             + +    ++  + Y  AS   IN  KS  ++  ++ +     A  DI    + +  LGVYL      +   +Q F +L++ +   L  W    K+
Subjt:  FCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNV-LQGEAQAIGDILGIKQAMN-LGVYLGMPSQAIRSKTQMFSKLKDKIRNILQGWS--EKL

Query:  FSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMCARFWW
         S  G+ ++I  ++ +   Y + C    +    +I      F W
Subjt:  FSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMCARFWW

P92555 Uncharacterized mitochondrial protein AtMg012501.2e-1451.47Show/hide
Query:  LMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDS
        ++NG PQ    P RG+RQGDPLSPY+F++C E  SGL  R  E   L G R++N  P ++HL FADD+
Subjt:  LMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.5e-2025.75Show/hide
Query:  GDFNEILSADEKKGGAVRNQSQMNGFQEVINICR---LMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVN-HALSQKSQSIRVQHLSFFSSDHRPIIATI
        GDF++I +A       ++    M G +E  N  R   L+D   RG  +TW    Q    I  +LDR   N    S    +I V  LS   SDH P I  +
Subjt:  GDFNEILSADEKKGGAVRNQSQMNGFQEVINICR---LMDAGFRGDMFTWRRGRQKKKQIKERLDRFFVN-HALSQKSQSIRVQHLSFFSSDHRPIIATI

Query:  DTSP--------------------VLLANRVKGRLLRGAHQIN-----NRLRVCIKNLHRWSKDRLKGSISSAIRKKEE-EIQKLEAENHSLDGNEIEAA
        +  P                    V L    + ++  G+H  +        + C K L+R     ++     A+   E  + Q L   + SL   E   A
Subjt:  DTSP--------------------VLLANRVKGRLLRGAHQIN-----NRLRVCIKNLHRWSKDRLKGSISSAIRKKEE-EIQKLEAENHSLDGNEIEAA

Query:  ELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNP--GRGQISEVSDCVNMKLS
          + N      E ++R +SR  WL+ GD NT++FH      + +N I+ +     V  E+   +  +   Y+  +  S +       +  + D    + +
Subjt:  ELELNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNP--GRGQISEVSDCVNMKLS

Query:  DQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIV
        D     L    S +E+  A+  +   KAPG D   A FF   W +V + T+    +       +  FN T ITLIPKV    ++  +RP+S C V YKI+
Subjt:  DQQRGVLDQPYSKQEVEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.4e-1537.5Show/hide
Query:  IANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMK
        +  R K ++  +I P QA+F+PGR+ +DN++   E +H++  RK+G +G + +KLD+ KAYDR+ W+Y+ +     GF E W+  I +
Subjt:  IANRFKKILDAVISPTQAAFVPGRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMK

AT4G29090.1 Ribonuclease H-like superfamily protein9.8e-0945.28Show/hide
Query:  AIPTYSMSCFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSKTLG
        A+PTY+M+CF LPK++CK+I ++ A FWW    + K +HW  W+ L   K  G
Subjt:  AIPTYSMSCFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSKTLG

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.8e-1048Show/hide
Query:  AIPTYSMSCFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSK
        A+P Y+MSCF+L K LCK++ +    FWW +  +K+K+ W  W  LCKSK
Subjt:  AIPTYSMSCFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSK

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)8.3e-1651.47Show/hide
Query:  LMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDS
        ++NG PQ    P RG+RQGDPLSPY+F++C E  SGL  R  E   L G R++N  P ++HL FADD+
Subjt:  LMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFSGLLNREVESQNLKGFRINNFCPPLSHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAAAAAAACCGAAACAGAGGAAATCAGAATATTACGAGAGAAGGGTCGTCGTCAAGTGGGGTTAAGGAACGAGAAGCTGAATCTGGGGAATCGGCAAAAAGACG
GGAAAGGCTCTCCGTAACTTTATCTTTGAAGGCGAACATGGGTAACAGTGAAGAAAGAAGAGGGAGAGATTTGTGTTCCGTTGAGGGCTCTGCAACTTTTTTCTTCAAAT
TTGGAATCCACTTCTGCGAAGATGGATTAGAAGAGAAGGAGCGGAGGTGTCGGCGGATAGAGACAGCCGGAGAAATTGGAAGGAAGAGGAACGGTGAGAAAGTGAGAAAT
CTGAGAGAGTGGGATACTATGGAAGAAGGTAAAATCGTTCAAGAACTTAGGAAAATGAAGATCACTGAAGAAGAAAAGAGTGGGCTTGTGGAAGTTGTGGATGAAGACAT
AGAATCAGCAAACAAAGATATAGAGAATACCGTGGTCTGTAAAATTCTCACTTCGAAGAACATTATGGTGGACGTGTTTCAAAGATCGATGCCAAAAATTTGGGGGCTCC
AAGGAAGGGTTCAGTTGAAGAAATCTGGAAAGAACTTATATGTTTGTAAATTTAAAAACAAAAGGGACAAGATCAGAATAGTTGATGGAGGGCCATGGATATACGACAAT
GCCATATTAGTTTTCGATGACCCAAAAGGAAACAAGGATATAAAATCGCTGGAATTCAGGTATGCTCAGTTTTGGTTTCACTTTCACAACTTACCTGTAGTTTGTCTCAA
CAGGAAATATGCTACAGCTCTGGGTAACTCAGTTGGCACTTTTATCAAACTGGAAGAGGGAACAGAAGAACGCGTATGGGGAGAAACTCTAAGGGTAAAAGTAGAGGTTG
ATGTCAACAAACCGATTAGAAGAGGAACAAACATCAAGGTAGGTTCAATGGCGGAGGAAAAATGGATTCCGATTACCATAGAAAAGCTACCAGATTTTTGTTACCAATGT
GGAAGATTGGGTCATGTGATGAGGGACTGCGAAGAGGATGAAAACAACGAATTAGAGGCAGAAGAATGTCAATATGGCCCGTGGATGAGGGAAAGCTTCATTTTCAGAAA
TGAATCCAAAAAGAAAAAGGAAAAACAAACAGATGAGCAAGAGAGGAACAGGGGGGATAACACAACAGTTGAAGACAGCAAGAACCATTCCTCCGATGGGGAGTTGCCGG
AAAGTCATCGGAGAAGAAAGTCGGATGAGAAGAAAGCAACCGAAAGTACCACAGAAGCAGAAATTATTAAAAAATTAACTCTGACAGAAAATTCTGAGCAAAGAGCCGTT
GAAACTGAACACTCGAGAAAGGGACCGCTAGAAGAGGAAATCAAGACAAGACAACCCGAGGAGACAGGAAGGGCTAGCGGTGAACAACTTGATGAAAATATAGTAGAGGA
GGGGCCCACTTTGAACAGTATGGTGCAGTTGGAAAAAAAAGAAGAAGCTAAGGGGCCTCAAACAGAAGGTGGTAGAGCCCATGAATCGCAAGGAATAGTAAAAAAAAATC
AGGACGCACAGAAAGGAAAAGACAGGATAAGCCCAAATATCACCCAAACGAAAAAATGGAAACGTCTTGCACGAATTGATCCTATGCAACAGGACGAGGACAACAGAATC
ATGGAGACACAGGGAACAAAACGCAAAACGGAGGGAGAGGAGTCTGAGGAAGGAACCCCGCCGGACGCCATGAAAACCTTATGCTGGAACGTTCGAGGTCTGGGGAATCC
TCGAACGCTCCGAGCTCTTCGCCATGTGGTGGAAAGCGAGAAACCTCAGTTGGTGTTTATTATAGAAACAAAGTGTGATGGTTATAAAGGCGAGCAAATTAAGAAAAGTC
TTAGCTTTGATTTTTGCTGGTGTGTGCCGAGCAAAGGGAGCAGTGGCGGCTTGCTGCTTCTCTGGAATTCCGGAGTAAAGCTTAGCATTAAGTCTTGGTCGGAGGGTCAC
ATAGATTCAATGTTCACTTTCAATGGGTGGACTGGGAGATTCACCGGCTTTTATGGGAATCCGGAGGTGGTTAAAAGAAGGGATTCGTGGAGCTTGCTAGAGAGGCTAGT
CGAGAATGATGATGTTCCTTGGCTTGTTGGAGGGGATTTTAACGAAATTCTCTCAGCGGATGAAAAAAAAGGAGGGGCAGTGAGGAACCAATCCCAAATGAATGGGTTCC
AAGAGGTGATCAACATTTGTCGGCTCATGGATGCAGGCTTTAGGGGGGATATGTTCACGTGGAGAAGGGGCAGGCAGAAAAAGAAGCAGATTAAAGAGAGGCTTGATAGG
TTCTTTGTTAACCATGCTTTATCCCAAAAGTCTCAGAGCATTCGAGTTCAACACCTTAGTTTTTTTTCTTCGGATCACAGGCCCATTATAGCCACTATTGATACTTCCCC
TGTTTTGCTTGCCAATCGAGTCAAAGGCAGATTGTTGCGAGGAGCGCATCAGATCAATAATAGACTAAGAGTGTGTATTAAGAACCTGCATCGTTGGAGCAAGGACAGGC
TCAAAGGTAGCATCAGTTCGGCCATCAGGAAAAAAGAGGAGGAAATTCAGAAGCTAGAAGCCGAGAATCACTCTCTGGATGGGAATGAAATAGAAGCAGCAGAGTTGGAG
TTGAATGCACTTCTCGAAGAAGAAGAGTATTACTGGAGAAGCAGATCAAGGGAAGTGTGGCTTAAAAGTGGGGACAAAAACACTAAATGGTTCCACGCCAAAGCCTCCCA
AAGGAGGAAGAGGAATAGGATTGAAGGCATTTTTGATTCGCAGGGGGTGTGGAGAGAAGATGAAGAGGGAATTGCTAGAGTGGCAACAGGGTACTTCAGAGATATTTTCC
AATCTTCCAACCCTGGAAGGGGCCAGATCTCTGAGGTCTCTGACTGTGTTAACATGAAGCTTTCGGATCAGCAAAGGGGAGTCCTAGATCAGCCATATAGCAAGCAAGAA
GTTGAGGTTGCAATAAAAAATCTCCACCCTGGCAAGGCTCCGGGGATAGATGGAACCCATGCCTCCTTTTTCCAAGCTTATTGGGATATAGTGGGCGAAGATACGGTGAG
GGAATGCCTCCAAATCCTTAACCATGACGCTGATATAACTCCTTTTAACAAGACTTTGATAACGCTCATCCCAAAAGTCAGAGATCCGAAGAGAATGCAAGAATACAGGC
CTATTAGCTTATGCAACGTGACTTACAAGATTGTAGCCAAGACCATTGCTAACAGGTTTAAAAAGATTCTCGATGCTGTTATTTCCCCAACCCAAGCGGCTTTTGTTCCA
GGGCGGCTCATTTCTGATAATGTCCTAGTAGGGTTTGAGTGTATCCACGCTATAAATAATAGAAAAAGAGGGAAAGAAGGGCATGTAGCTATCAAGCTTGATATGAGCAA
AGCCTACGACCGCGTCGAATGGGAGTACATCAGAAATGTGTTTACAAAAATGGGGTTCAGTGAGAAGTGGATCCAGAACATCATGAAGTGCGTGGAGACGGTGACATTCT
CGGTCCTTATGAATGGAGTTCCACAAGAAGAATTCAAGCCGGGTCGAGGCATCCGGCAAGGGGATCCGCTATCCCCTTACATCTTCCTGATGTGTGCAGAAGGATTTTCA
GGCCTATTAAACAGGGAAGTAGAGTCTCAAAATTTAAAAGGCTTTAGAATTAACAATTTTTGTCCCCCCTTATCCCACCTTTTCTTCGCTGATGATAGCCTTATTTTTTG
CAGGTCCTCTATGGAGGAATGTCACACAATCAAGAAAGTTTTTAAGGCATATGAGCTGGCATCAGGTCAAGCTATAAATCTGGATAAATCTAGATTCATGGTTAGCAAGA
ATGTCCTCCAAGGAGAGGCCCAAGCCATTGGGGATATCCTCGGGATAAAGCAAGCCATGAACCTAGGGGTTTATCTTGGAATGCCCTCGCAAGCTATTAGGAGCAAGACT
CAAATGTTCAGCAAGCTTAAGGATAAGATAAGGAACATTCTTCAAGGATGGAGTGAGAAATTGTTTTCTGCAGGGGGCAAAGAAGTTCTTATAAAAGCAGTAATCCAGGC
GATTCCCACTTATTCCATGAGCTGTTTTAAGCTCCCAAAAAGCTTATGTAAGGAGATCAACAATATGTGTGCTAGATTTTGGTGGGGAGCGACCGGGGATAAGAAAAAAC
TTCATTGGGCTAAATGGAATGACCTTTGTAAAAGCAAGACTTTGGGGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAAAAAAAACCGAAACAGAGGAAATCAGAATATTACGAGAGAAGGGTCGTCGTCAAGTGGGGTTAAGGAACGAGAAGCTGAATCTGGGGAATCGGCAAAAAGACG
GGAAAGGCTCTCCGTAACTTTATCTTTGAAGGCGAACATGGGTAACAGTGAAGAAAGAAGAGGGAGAGATTTGTGTTCCGTTGAGGGCTCTGCAACTTTTTTCTTCAAAT
TTGGAATCCACTTCTGCGAAGATGGATTAGAAGAGAAGGAGCGGAGGTGTCGGCGGATAGAGACAGCCGGAGAAATTGGAAGGAAGAGGAACGGTGAGAAAGTGAGAAAT
CTGAGAGAGTGGGATACTATGGAAGAAGGTAAAATCGTTCAAGAACTTAGGAAAATGAAGATCACTGAAGAAGAAAAGAGTGGGCTTGTGGAAGTTGTGGATGAAGACAT
AGAATCAGCAAACAAAGATATAGAGAATACCGTGGTCTGTAAAATTCTCACTTCGAAGAACATTATGGTGGACGTGTTTCAAAGATCGATGCCAAAAATTTGGGGGCTCC
AAGGAAGGGTTCAGTTGAAGAAATCTGGAAAGAACTTATATGTTTGTAAATTTAAAAACAAAAGGGACAAGATCAGAATAGTTGATGGAGGGCCATGGATATACGACAAT
GCCATATTAGTTTTCGATGACCCAAAAGGAAACAAGGATATAAAATCGCTGGAATTCAGGTATGCTCAGTTTTGGTTTCACTTTCACAACTTACCTGTAGTTTGTCTCAA
CAGGAAATATGCTACAGCTCTGGGTAACTCAGTTGGCACTTTTATCAAACTGGAAGAGGGAACAGAAGAACGCGTATGGGGAGAAACTCTAAGGGTAAAAGTAGAGGTTG
ATGTCAACAAACCGATTAGAAGAGGAACAAACATCAAGGTAGGTTCAATGGCGGAGGAAAAATGGATTCCGATTACCATAGAAAAGCTACCAGATTTTTGTTACCAATGT
GGAAGATTGGGTCATGTGATGAGGGACTGCGAAGAGGATGAAAACAACGAATTAGAGGCAGAAGAATGTCAATATGGCCCGTGGATGAGGGAAAGCTTCATTTTCAGAAA
TGAATCCAAAAAGAAAAAGGAAAAACAAACAGATGAGCAAGAGAGGAACAGGGGGGATAACACAACAGTTGAAGACAGCAAGAACCATTCCTCCGATGGGGAGTTGCCGG
AAAGTCATCGGAGAAGAAAGTCGGATGAGAAGAAAGCAACCGAAAGTACCACAGAAGCAGAAATTATTAAAAAATTAACTCTGACAGAAAATTCTGAGCAAAGAGCCGTT
GAAACTGAACACTCGAGAAAGGGACCGCTAGAAGAGGAAATCAAGACAAGACAACCCGAGGAGACAGGAAGGGCTAGCGGTGAACAACTTGATGAAAATATAGTAGAGGA
GGGGCCCACTTTGAACAGTATGGTGCAGTTGGAAAAAAAAGAAGAAGCTAAGGGGCCTCAAACAGAAGGTGGTAGAGCCCATGAATCGCAAGGAATAGTAAAAAAAAATC
AGGACGCACAGAAAGGAAAAGACAGGATAAGCCCAAATATCACCCAAACGAAAAAATGGAAACGTCTTGCACGAATTGATCCTATGCAACAGGACGAGGACAACAGAATC
ATGGAGACACAGGGAACAAAACGCAAAACGGAGGGAGAGGAGTCTGAGGAAGGAACCCCGCCGGACGCCATGAAAACCTTATGCTGGAACGTTCGAGGTCTGGGGAATCC
TCGAACGCTCCGAGCTCTTCGCCATGTGGTGGAAAGCGAGAAACCTCAGTTGGTGTTTATTATAGAAACAAAGTGTGATGGTTATAAAGGCGAGCAAATTAAGAAAAGTC
TTAGCTTTGATTTTTGCTGGTGTGTGCCGAGCAAAGGGAGCAGTGGCGGCTTGCTGCTTCTCTGGAATTCCGGAGTAAAGCTTAGCATTAAGTCTTGGTCGGAGGGTCAC
ATAGATTCAATGTTCACTTTCAATGGGTGGACTGGGAGATTCACCGGCTTTTATGGGAATCCGGAGGTGGTTAAAAGAAGGGATTCGTGGAGCTTGCTAGAGAGGCTAGT
CGAGAATGATGATGTTCCTTGGCTTGTTGGAGGGGATTTTAACGAAATTCTCTCAGCGGATGAAAAAAAAGGAGGGGCAGTGAGGAACCAATCCCAAATGAATGGGTTCC
AAGAGGTGATCAACATTTGTCGGCTCATGGATGCAGGCTTTAGGGGGGATATGTTCACGTGGAGAAGGGGCAGGCAGAAAAAGAAGCAGATTAAAGAGAGGCTTGATAGG
TTCTTTGTTAACCATGCTTTATCCCAAAAGTCTCAGAGCATTCGAGTTCAACACCTTAGTTTTTTTTCTTCGGATCACAGGCCCATTATAGCCACTATTGATACTTCCCC
TGTTTTGCTTGCCAATCGAGTCAAAGGCAGATTGTTGCGAGGAGCGCATCAGATCAATAATAGACTAAGAGTGTGTATTAAGAACCTGCATCGTTGGAGCAAGGACAGGC
TCAAAGGTAGCATCAGTTCGGCCATCAGGAAAAAAGAGGAGGAAATTCAGAAGCTAGAAGCCGAGAATCACTCTCTGGATGGGAATGAAATAGAAGCAGCAGAGTTGGAG
TTGAATGCACTTCTCGAAGAAGAAGAGTATTACTGGAGAAGCAGATCAAGGGAAGTGTGGCTTAAAAGTGGGGACAAAAACACTAAATGGTTCCACGCCAAAGCCTCCCA
AAGGAGGAAGAGGAATAGGATTGAAGGCATTTTTGATTCGCAGGGGGTGTGGAGAGAAGATGAAGAGGGAATTGCTAGAGTGGCAACAGGGTACTTCAGAGATATTTTCC
AATCTTCCAACCCTGGAAGGGGCCAGATCTCTGAGGTCTCTGACTGTGTTAACATGAAGCTTTCGGATCAGCAAAGGGGAGTCCTAGATCAGCCATATAGCAAGCAAGAA
GTTGAGGTTGCAATAAAAAATCTCCACCCTGGCAAGGCTCCGGGGATAGATGGAACCCATGCCTCCTTTTTCCAAGCTTATTGGGATATAGTGGGCGAAGATACGGTGAG
GGAATGCCTCCAAATCCTTAACCATGACGCTGATATAACTCCTTTTAACAAGACTTTGATAACGCTCATCCCAAAAGTCAGAGATCCGAAGAGAATGCAAGAATACAGGC
CTATTAGCTTATGCAACGTGACTTACAAGATTGTAGCCAAGACCATTGCTAACAGGTTTAAAAAGATTCTCGATGCTGTTATTTCCCCAACCCAAGCGGCTTTTGTTCCA
GGGCGGCTCATTTCTGATAATGTCCTAGTAGGGTTTGAGTGTATCCACGCTATAAATAATAGAAAAAGAGGGAAAGAAGGGCATGTAGCTATCAAGCTTGATATGAGCAA
AGCCTACGACCGCGTCGAATGGGAGTACATCAGAAATGTGTTTACAAAAATGGGGTTCAGTGAGAAGTGGATCCAGAACATCATGAAGTGCGTGGAGACGGTGACATTCT
CGGTCCTTATGAATGGAGTTCCACAAGAAGAATTCAAGCCGGGTCGAGGCATCCGGCAAGGGGATCCGCTATCCCCTTACATCTTCCTGATGTGTGCAGAAGGATTTTCA
GGCCTATTAAACAGGGAAGTAGAGTCTCAAAATTTAAAAGGCTTTAGAATTAACAATTTTTGTCCCCCCTTATCCCACCTTTTCTTCGCTGATGATAGCCTTATTTTTTG
CAGGTCCTCTATGGAGGAATGTCACACAATCAAGAAAGTTTTTAAGGCATATGAGCTGGCATCAGGTCAAGCTATAAATCTGGATAAATCTAGATTCATGGTTAGCAAGA
ATGTCCTCCAAGGAGAGGCCCAAGCCATTGGGGATATCCTCGGGATAAAGCAAGCCATGAACCTAGGGGTTTATCTTGGAATGCCCTCGCAAGCTATTAGGAGCAAGACT
CAAATGTTCAGCAAGCTTAAGGATAAGATAAGGAACATTCTTCAAGGATGGAGTGAGAAATTGTTTTCTGCAGGGGGCAAAGAAGTTCTTATAAAAGCAGTAATCCAGGC
GATTCCCACTTATTCCATGAGCTGTTTTAAGCTCCCAAAAAGCTTATGTAAGGAGATCAACAATATGTGTGCTAGATTTTGGTGGGGAGCGACCGGGGATAAGAAAAAAC
TTCATTGGGCTAAATGGAATGACCTTTGTAAAAGCAAGACTTTGGGGGATTAG
Protein sequenceShow/hide protein sequence
MNKKNRNRGNQNITREGSSSSGVKEREAESGESAKRRERLSVTLSLKANMGNSEERRGRDLCSVEGSATFFFKFGIHFCEDGLEEKERRCRRIETAGEIGRKRNGEKVRN
LREWDTMEEGKIVQELRKMKITEEEKSGLVEVVDEDIESANKDIENTVVCKILTSKNIMVDVFQRSMPKIWGLQGRVQLKKSGKNLYVCKFKNKRDKIRIVDGGPWIYDN
AILVFDDPKGNKDIKSLEFRYAQFWFHFHNLPVVCLNRKYATALGNSVGTFIKLEEGTEERVWGETLRVKVEVDVNKPIRRGTNIKVGSMAEEKWIPITIEKLPDFCYQC
GRLGHVMRDCEEDENNELEAEECQYGPWMRESFIFRNESKKKKEKQTDEQERNRGDNTTVEDSKNHSSDGELPESHRRRKSDEKKATESTTEAEIIKKLTLTENSEQRAV
ETEHSRKGPLEEEIKTRQPEETGRASGEQLDENIVEEGPTLNSMVQLEKKEEAKGPQTEGGRAHESQGIVKKNQDAQKGKDRISPNITQTKKWKRLARIDPMQQDEDNRI
METQGTKRKTEGEESEEGTPPDAMKTLCWNVRGLGNPRTLRALRHVVESEKPQLVFIIETKCDGYKGEQIKKSLSFDFCWCVPSKGSSGGLLLLWNSGVKLSIKSWSEGH
IDSMFTFNGWTGRFTGFYGNPEVVKRRDSWSLLERLVENDDVPWLVGGDFNEILSADEKKGGAVRNQSQMNGFQEVINICRLMDAGFRGDMFTWRRGRQKKKQIKERLDR
FFVNHALSQKSQSIRVQHLSFFSSDHRPIIATIDTSPVLLANRVKGRLLRGAHQINNRLRVCIKNLHRWSKDRLKGSISSAIRKKEEEIQKLEAENHSLDGNEIEAAELE
LNALLEEEEYYWRSRSREVWLKSGDKNTKWFHAKASQRRKRNRIEGIFDSQGVWREDEEGIARVATGYFRDIFQSSNPGRGQISEVSDCVNMKLSDQQRGVLDQPYSKQE
VEVAIKNLHPGKAPGIDGTHASFFQAYWDIVGEDTVRECLQILNHDADITPFNKTLITLIPKVRDPKRMQEYRPISLCNVTYKIVAKTIANRFKKILDAVISPTQAAFVP
GRLISDNVLVGFECIHAINNRKRGKEGHVAIKLDMSKAYDRVEWEYIRNVFTKMGFSEKWIQNIMKCVETVTFSVLMNGVPQEEFKPGRGIRQGDPLSPYIFLMCAEGFS
GLLNREVESQNLKGFRINNFCPPLSHLFFADDSLIFCRSSMEECHTIKKVFKAYELASGQAINLDKSRFMVSKNVLQGEAQAIGDILGIKQAMNLGVYLGMPSQAIRSKT
QMFSKLKDKIRNILQGWSEKLFSAGGKEVLIKAVIQAIPTYSMSCFKLPKSLCKEINNMCARFWWGATGDKKKLHWAKWNDLCKSKTLGD