; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008677 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008677
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr9:27725128..27730187
RNA-Seq ExpressionLag0008677
SyntenyLag0008677
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]5.5e-16938.84Show/hide
Query:  ESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVW
        E++    D R V SVW++R+  W  L    + GGI+I+W   ++   + VLG+FS+SIK  L      W+S VYGP     +  F  EL DI GL    W
Subjt:  ESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVW

Query:  CLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVK
        C+ GDFN+IR   E+L       SMK F+ FIS  +LIDLP+ +  FTWS + V     RLDRFL S +W   F       L R TSDH+PI+L     K
Subjt:  CLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVK

Query:  WGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELL
        WGPTPFRFEN WL    FKE    WW E    GW G +FM KL+ +K K+K WN+        + ++IL  +   D LE+      + +  R   K EL 
Subjt:  WGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELL

Query:  EITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKE
        E+ + E+    +K +++W+ EGD NS FFHK A  ++N+ FI  LE E+G ++   E I+ EIL +F KLY+   G  + ++G++W PI  +    LE  
Subjt:  EITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKE

Query:  FKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL------------------------------------------------------------
        F EEEI+KAI  +   K+ GPDG T   +++ W ++K+DL                                                            
Subjt:  FKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL------------------------------------------------------------

Query:  -------------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVAS
                     V+GRQILDA+LIA E V+E +     GV+ K+D EKAYD V W FLD +L  KGFG RWR W++GCL + +F+++VNG  +G + AS
Subjt:  -------------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVAS

Query:  RGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSM
        RGLRQGDPLSPFLF +V D +SR +    E+ +L+G+K+G+NR  V+ LQ+ADDT+ F  + E D+    ++L V    SGL +N  K+ I G+N + + 
Subjt:  RGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSM

Query:  LANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGT
        L+  A  LDCKA   P+ YLG PLGGN     FWDP++++  R+LD W+  ++S  GR TL QS L  +  Y+ SL K P +    +E+  RDF+W G  
Subjt:  LANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGT

Query:  YKPGVNLVKWE
             +LV W+
Subjt:  YKPGVNLVKWE

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.4e-18530.09Show/hide
Query:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKLFRKVDCNNGFIWIQKISNKRGSFL--EITKVNNSGGKHNLVVPAGVDYKGWKSFSDLLNKFLNGEDDYL
        K FS+      + WI +  +DLL    T   F +    +  +W++K  NK  + +  EI +++N G K +++VP G D  GWKSF  L+    +     +
Subjt:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKLFRKVDCNNGFIWIQKISNKRGSFL--EITKVNNSGGKHNLVVPAGVDYKGWKSFSDLLNKFLNGEDDYL

Query:  EDSKQAETVVRRKGKGKSFADIVKRSSSNSIPNDPSTQITPQGSRGKERKNAPKVRRSHTGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRI
            + E V        S +D  ++S +  + +           R K   +    RRS +   +K   +           ++E+ ++IT+R FHDDW RI
Subjt:  EDSKQAETVVRRKGKGKSFADIVKRSSSNSIPNDPSTQITPQGSRGKERKNAPKVRRSHTGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRI

Query:  LEVMQQQLMEPLIINPFHPDKALIKCHSRELVDLLTKNQ---GWVSFGPIILKVEKWNKNAHSKISCIPSYGGWVKIRNLPLHLWNLQTFKAIGDNLGGF
        +  +++Q        PF  DKA++  +  +   LL  N+   GW + G   +K E W+ N HS  S IPSYGGW++ R +PLHLWN  TF+ IG   GGF
Subjt:  LEVMQQQLMEPLIINPFHPDKALIKCHSRELVDLLTKNQ---GWVSFGPIILKVEKWNKNAHSKISCIPSYGGWVKIRNLPLHLWNLQTFKAIGDNLGGF

Query:  IEYEEANSLLIECVEVKIKIKENYCGFIPAEVRIEDGE-DQYIVQIVTFQDGSLLIDRVAGIHGTFSSTVVHIFHRGPMDPLFCTAYIWRIDNGLHYPVV
        ++  +    + + ++ KIK++ NY GF+PA + I D + + +IV  V   +   L++R   +HG+F +     F +          Y +   NG      
Subjt:  IEYEEANSLLIECVEVKIKIKENYCGFIPAEVRIEDGE-DQYIVQIVTFQDGSLLIDRVAGIHGTFSSTVVHIFHRGPMDPLFCTAYIWRIDNGLHYPVV

Query:  IVPQSVDEPRICGGQLDDDKVDFEISRLRTKGK----NKAEAGPSKCAVILVRVE---AQPILVQPDGPSTMKRSKKEKRKGVSFAKEAQITLFKKGEVH
        I P    EP    G       D       T+ K    +++E  P    +   R E   A  I+   D     KRSK+   + VSF           G + 
Subjt:  IVPQSVDEPRICGGQLDDDKVDFEISRLRTKGK----NKAEAGPSKCAVILVRVE---AQPILVQPDGPSTMKRSKKEKRKGVSFAKEAQITLFKKGEVH

Query:  HTQKSPSPHLKGVTTDTWNGEQVLESDFS--FSSPTSLDGEGGKDSGLSSRKAFDCDIPEGYQLCFSNDTDQDISSIPLSVEESVELAEEAPSTSKEETI
            +   + KG + +        E  +S    + T L     KD   S+         E ++L    +T +    + LSV       +  P +  E  I
Subjt:  HTQKSPSPHLKGVTTDTWNGEQVLESDFS--FSSPTSLDGEGGKDSGLSSRKAFDCDIPEGYQLCFSNDTDQDISSIPLSVEESVELAEEAPSTSKEETI

Query:  QSPKAPPECRTIGEALHEENHFLHPKALAIIPAGKEDRADLETEKGELLLSKELILTLRRNNLCIRSIVGSSAKKGNSTKKKRSR-EVTNLFR----TWE
        QS               E NH      L        D     T+  E   +K L ++++      +S   S+A+  +   K  S  E+   F+     W 
Subjt:  QSPKAPPECRTIGEALHEENHFLHPKALAIIPAGKEDRADLETEKGELLLSKELILTLRRNNLCIRSIVGSSAKKGNSTKKKRSR-EVTNLFR----TWE

Query:  KEVEPIIEEENGNENGNVSFSEVKSHVERAETESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEG
        KE E  +  +  N+  + S+  V                       + S +++      P+   GGI++LW +   +V D  +G +SIS+ + L      
Subjt:  KEVEPIIEEENGNENGNVSFSEVKSHVERAETESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEG

Query:  WISGVYGPCGYQEKLHFLQELYDIHGLCQGVWCLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSK
        W++ VYGP  Y ++     EL  +  LC   W + GDFNI+RW  E  ++    R+M  FN FIS  +LID P LN  FTWS L V    SRLDRFLLSK
Subjt:  WISGVYGPCGYQEKLHFLQELYDIHGLCQGVWCLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSK

Query:  QWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVKWGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEI
         W + F       L+R  SDHFPI+L    +KWGP PFR  NS L  ++F++   +WW      G+ G+ F++ L  L   IKEW   +  L  A    +
Subjt:  QWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVKWGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEI

Query:  LEKIDYIDRLEECNEAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFR
        L++ID ID+LE   E      + R  LK +LL I   + +  +++ + RW   GDEN+S+FH+  T  + K  I  +   +G  +   +DI    +S F+
Subjt:  LEKIDYIDRLEECNEAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFR

Query:  KLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL----------------------------
         +Y+KE     ++D ++W+PI   C  +L K F E EI   I    N K+ GPDG T  FYK  W  LK DL                            
Subjt:  KLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL----------------------------

Query:  ---------------------------------------------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGF
                                                     ++GRQI DAILIA E+++  K RK  G +LKLD+EKA+D + W F+D +L++K F
Subjt:  ---------------------------------------------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGF

Query:  GHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAK
         H+WR WIK C+ N  +SI++NG P+G+I A RG+RQGDPLSPF+F++  D +SR +     K  +KG     N   ++ L +ADD L+F  +NE  +  
Subjt:  GHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAK

Query:  WWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNS
            L +  + SGL+ N+SK+ I  +N         A+    + + LPVNYLG PLGGN   + FWD  ++   +KL+ W+   ISK GR TL ++ L+S
Subjt:  WWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNS

Query:  LLIYYFSLLKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVKW
        L  Y  S  KAP +  + +EK  RDF+WGG   K   +L+ W
Subjt:  LLIYYFSLLKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVKW

RVW20303.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]6.5e-17040.39Show/hide
Query:  ESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVW
        E++ A  D R V SVW++R+  W VL    + GGI+++W   ++   + VLG+FS+S+K  +    + W+S +YGP     +  F +EL DI GL    W
Subjt:  ESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVW

Query:  CLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVK
        C+ GDFN+IR   E+L       SMK  + FI   +LID P+ +  FTWS +       RLDRFL S +W   F       L R TSDH+PI+L     K
Subjt:  CLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVK

Query:  WGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELL
        WGPTPFRFEN WL    FKE   SWW E    GW G +FM KL+ LK K+KEWN+        +   IL  I   D +E+      + +  R   K EL 
Subjt:  WGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELL

Query:  EITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKE
        E+ + E+    +K +++W+ EGD NS FFHK A  ++N+ FI VLE E G ++   + I+ EIL +F KLY+   G  + ++G++W PI ++    LE  
Subjt:  EITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKE

Query:  FKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL---------------------------VEGRQILDAILIATESVEELKSRKNSGVLLKLD
        F EEEI KAI  +   K+ GPDG T   +++ W+++K+DL                           V+GRQILDA+LIA E V+E K     GV+ K+D
Subjt:  FKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL---------------------------VEGRQILDAILIATESVEELKSRKNSGVLLKLD

Query:  LEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEV
         EKAYD V W FLD ++ +KGF  RWR WI+GCL + +F+I+VNG  +G + ASRGLRQGDPLSPFLF +V D +SR +    E+ + +G+++G+NR  V
Subjt:  LEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEV

Query:  AMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLD
        + LQ+ADDT+ F    E D+     +L V    SGL +N  K+ I G+N     L   A  LDCKA   P+ YLG PLGGN     FWDP++++   +LD
Subjt:  AMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLD

Query:  NWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVKWE
         W+  ++S  GR TL +S L  +  Y+ SL + P +    +E+  RDF+W G       +LV WE
Subjt:  NWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVKWE

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.4e-18830.4Show/hide
Query:  LWFDRNYKGRVIRIQEDHLRKKFSLSAEEIVIVWISDSIEDLLLAPATHKLFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVDYKGW
        L  D+  K     + E    K FS+      + WI  +++ L+  P T++ F +   +   IWI+K  N +G   EI +V+    K  ++VP G D  GW
Subjt:  LWFDRNYKGRVIRIQEDHLRKKFSLSAEEIVIVWISDSIEDLLLAPATHKLFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVDYKGW

Query:  KSFSDLLNKFLNGEDDYLEDSKQAETVVRRKGKGKSFA---DIVKRSSSNSIPNDPSTQITPQGSRGKERKNAPKVRRSHTGTEWKDIPVCRGCSEEVRK
         SF  ++   +  +      +K   T + R       +   D  KRS + ++  +     T   S   +  ++     SH+ +       C   S ++  
Subjt:  KSFSDLLNKFLNGEDDYLEDSKQAETVVRRKGKGKSFA---DIVKRSSSNSIPNDPSTQITPQGSRGKERKNAPKVRRSHTGTEWKDIPVCRGCSEEVRK

Query:  INWEEVIVITKRDFHDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSRELVDLLTKNQGWVSFGPIILKVEKWNKNAHSKISCIPSYGGWVKIRNLPL
           E  +VI +R FHDDW +IL+ +++Q  E    N FH +KAL+   S    +LL +N+GW + G   ++ EKW+   H+    IPSYGGW   R +PL
Subjt:  INWEEVIVITKRDFHDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSRELVDLLTKNQGWVSFGPIILKVEKWNKNAHSKISCIPSYGGWVKIRNLPL

Query:  HLWNLQTFKAIGDNLGGFIEYEEANSLLIECVEVKIKIKENYCGFIPAEVRIEDGE-DQYIVQIVTFQDGSLLIDRVAGIHGTFSSTVV-----------
        HLWN+ TF+ IG    G I+  E        +E +IK++ NY GF+PA VRI D E +++ VQ+VT  +G  LI+R   +HGTF                
Subjt:  HLWNLQTFKAIGDNLGGFIEYEEANSLLIECVEVKIKIKENYCGFIPAEVRIEDGE-DQYIVQIVTFQDGSLLIDRVAGIHGTFSSTVV-----------

Query:  HIFHRG--PMDPLF-CTAYIWRIDNGLHYP-----VVIVPQSVDEPRICGGQLDDDKV-DFEISRLRTKGKNKAEAGPSKCAVILVRVEAQPILVQPDGP
          F  G   + P F  T+   R  +    P     V+I P   D        L+++ V D  +     K K +  +G S   V+    +   I +QP+  
Subjt:  HIFHRG--PMDPLF-CTAYIWRIDNGLHYP-----VVIVPQSVDEPRICGGQLDDDKV-DFEISRLRTKGKNKAEAGPSKCAVILVRVEAQPILVQPDGP

Query:  STMKRSKKEKRKGVSFAKEAQITLFKKGEVHHTQKSPSPHLKGVTTDTWNGEQVLESDF-SFSSPTSLDGEGGKDSGLSSRKAFDCDIPEGYQLCFSNDT
          + +SK++    VSF   +      K  + +   +P+ H   + +     +   E      SS T  + +  ++ G+         I +  Q+  ++D 
Subjt:  STMKRSKKEKRKGVSFAKEAQITLFKKGEVHHTQKSPSPHLKGVTTDTWNGEQVLESDF-SFSSPTSLDGEGGKDSGLSSRKAFDCDIPEGYQLCFSNDT

Query:  DQDISSIPLSVEESVELAEEAPSTSKEETIQSPKAPPECRTIGEALHEENHFLHPKALAIIPAGKEDRADLETEKGELLLSKELILTLRRNNLCIRSIVG
        D     + L+V+   +L    P+ S E+   S  A     T  E + E                                + E+ + +  N+        
Subjt:  DQDISSIPLSVEESVELAEEAPSTSKEETIQSPKAPPECRTIGEALHEENHFLHPKALAIIPAGKEDRADLETEKGELLLSKELILTLRRNNLCIRSIVG

Query:  SSAKKGNSTKKKRSREVTNLFR-TWEKEVEPIIE----------EENG---NENGNVSFSEVKSHVERAETESRLAAIDSRLVKSVWSSRHIAWVVLDPV
        +S+ + N  K K   +    +R   EKE +P  E          ++NG   + + + S +   ++V   +  S L   + R++KS+W S  I W+  +  
Subjt:  SSAKKGNSTKKKRSREVTNLFR-TWEKEVEPIIE----------EENG---NENGNVSFSEVKSHVERAETESRLAAIDSRLVKSVWSSRHIAWVVLDPV

Query:  ASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVWCLVGDFNIIRWVEERLSRRNPLRSMKKFN
         S GGI+ILW      ++    G FS+S    L + S  W++G+YGP   +E++HF  EL+++  L    W L GD N+IR  EE  S  +   + +  N
Subjt:  ASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVWCLVGDFNIIRWVEERLSRRNPLRSMKKFN

Query:  RFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIML--SIGAVKWGPTPFRFENSWLDIRDFKEQLESWWT
         FIS+  LID P+ N  FTWS L      SR+DRFL +  W + FS      L R TSDHFP++   S   + WGP PFR  +  L   +FK  +  WW 
Subjt:  RFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIML--SIGAVKWGPTPFRFENSWLDIRDFKEQLESWWT

Query:  ELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGDENSS
             G+ GF F+++LK L   IK W + +        + I+ ++D ID+ E      +++   R  LK +L E++++E +   ++ K  WL EGDENSS
Subjt:  ELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGDENSS

Query:  FFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLY--SKEQGPRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMT
        FFH+  ++++ ++FI  ++ E G I      I    + FF ++Y  S +  P F+ + ++W+PI +     L   F E EI   I      K+ GPDG  
Subjt:  FFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLY--SKEQGPRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMT

Query:  REFYKNSWNILKKDL-----------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSI
          F+K+ W  LK  L           V+ RQI DAIL+A E+V+  K +K  G +LKLD+EKA+D ++  F+D +L +K F + WR WI+GC+ N  +S+
Subjt:  REFYKNSWNILKKDL-----------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSI

Query:  IVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSS
        I+NGRP+G+I A+RGLRQGDPLSPFLF++  D +SR +        +KG  +  N   ++ + +ADD L+F  +N+  +      L +  + SGL +N  
Subjt:  IVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSS

Query:  KTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAM
        K+A++ VN  +      A+       SLP++YLG PLGGN    +FW  + DK ++KL+NW+   ISK GR TL +S L+SL IY  S+ +AP  T + +
Subjt:  KTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAM

Query:  EKTVRDFVWGGGTYKPGVNLVKW
        EK  R F+W G     G +L+ W
Subjt:  EKTVRDFVWGGGTYKPGVNLVKW

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.5e-18530.09Show/hide
Query:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKLFRKVDCNNGFIWIQKISNKRGSFL--EITKVNNSGGKHNLVVPAGVDYKGWKSFSDLLNKFLNGEDDYL
        K FS+      + WI +  +DLL    T   F +    +  +W++K  NK  + +  EI +++N G K +++VP G D  GWKSF  L+    +     +
Subjt:  KKFSLSAEEIVIVWISDSIEDLLLAPATHKLFRKVDCNNGFIWIQKISNKRGSFL--EITKVNNSGGKHNLVVPAGVDYKGWKSFSDLLNKFLNGEDDYL

Query:  EDSKQAETVVRRKGKGKSFADIVKRSSSNSIPNDPSTQITPQGSRGKERKNAPKVRRSHTGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRI
            + E V        S +D  ++S +  + +           R K   +    RRS +   +K   +           ++E+ ++IT+R FHDDW RI
Subjt:  EDSKQAETVVRRKGKGKSFADIVKRSSSNSIPNDPSTQITPQGSRGKERKNAPKVRRSHTGTEWKDIPVCRGCSEEVRKINWEEVIVITKRDFHDDWGRI

Query:  LEVMQQQLMEPLIINPFHPDKALIKCHSRELVDLLTKNQ---GWVSFGPIILKVEKWNKNAHSKISCIPSYGGWVKIRNLPLHLWNLQTFKAIGDNLGGF
        +  +++Q        PF  DKA++  +  +   LL  N+   GW + G   +K E W+ N HS  S IPSYGGW++ R +PLHLWN  TF+ IG   GGF
Subjt:  LEVMQQQLMEPLIINPFHPDKALIKCHSRELVDLLTKNQ---GWVSFGPIILKVEKWNKNAHSKISCIPSYGGWVKIRNLPLHLWNLQTFKAIGDNLGGF

Query:  IEYEEANSLLIECVEVKIKIKENYCGFIPAEVRIEDGE-DQYIVQIVTFQDGSLLIDRVAGIHGTFSSTVVHIFHRGPMDPLFCTAYIWRIDNGLHYPVV
        ++  +    + + ++ KIK++ NY GF+PA + I D + + +IV  V   +   L++R   +HG+F +     F +          Y +   NG      
Subjt:  IEYEEANSLLIECVEVKIKIKENYCGFIPAEVRIEDGE-DQYIVQIVTFQDGSLLIDRVAGIHGTFSSTVVHIFHRGPMDPLFCTAYIWRIDNGLHYPVV

Query:  IVPQSVDEPRICGGQLDDDKVDFEISRLRTKGK----NKAEAGPSKCAVILVRVE---AQPILVQPDGPSTMKRSKKEKRKGVSFAKEAQITLFKKGEVH
        I P    EP    G       D       T+ K    +++E  P    +   R E   A  I+   D     KRSK+   + VSF           G + 
Subjt:  IVPQSVDEPRICGGQLDDDKVDFEISRLRTKGK----NKAEAGPSKCAVILVRVE---AQPILVQPDGPSTMKRSKKEKRKGVSFAKEAQITLFKKGEVH

Query:  HTQKSPSPHLKGVTTDTWNGEQVLESDFS--FSSPTSLDGEGGKDSGLSSRKAFDCDIPEGYQLCFSNDTDQDISSIPLSVEESVELAEEAPSTSKEETI
            +   + KG + +        E  +S    + T L     KD   S+         E ++L    +T +    + LSV       +  P +  E  I
Subjt:  HTQKSPSPHLKGVTTDTWNGEQVLESDFS--FSSPTSLDGEGGKDSGLSSRKAFDCDIPEGYQLCFSNDTDQDISSIPLSVEESVELAEEAPSTSKEETI

Query:  QSPKAPPECRTIGEALHEENHFLHPKALAIIPAGKEDRADLETEKGELLLSKELILTLRRNNLCIRSIVGSSAKKGNSTKKKRSR-EVTNLFR----TWE
        QS               E NH      L        D     T+  E   +K L ++++      +S   S+A+  +   K  S  E+   F+     W 
Subjt:  QSPKAPPECRTIGEALHEENHFLHPKALAIIPAGKEDRADLETEKGELLLSKELILTLRRNNLCIRSIVGSSAKKGNSTKKKRSR-EVTNLFR----TWE

Query:  KEVEPIIEEENGNENGNVSFSEVKSHVERAETESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEG
        KE E  +  +  N+  + S+  V                       + S +++      P+   GGI++LW +   +V D  +G +SIS+ + L      
Subjt:  KEVEPIIEEENGNENGNVSFSEVKSHVERAETESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEG

Query:  WISGVYGPCGYQEKLHFLQELYDIHGLCQGVWCLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSK
        W++ VYGP  Y ++     EL  +  LC   W + GDFNI+RW  E  ++    R+M  FN FIS  +LID P LN  FTWS L V    SRLDRFLLSK
Subjt:  WISGVYGPCGYQEKLHFLQELYDIHGLCQGVWCLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSK

Query:  QWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVKWGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEI
         W + F       L+R  SDHFPI+L    +KWGP PFR  NS L  ++F++   +WW      G+ G+ F++ L  L   IKEW   +  L  A    +
Subjt:  QWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVKWGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEI

Query:  LEKIDYIDRLEECNEAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFR
        L++ID ID+LE   E      + R  LK +LL I   + +  +++ + RW   GDEN+S+FH+  T  + K  I  +   +G  +   +DI    +S F+
Subjt:  LEKIDYIDRLEECNEAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFR

Query:  KLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL----------------------------
         +Y+KE     ++D ++W+PI   C  +L K F E EI   I    N K+ GPDG T  FYK  W  LK DL                            
Subjt:  KLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL----------------------------

Query:  ---------------------------------------------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGF
                                                     ++GRQI DAILIA E ++  K RK  G +LKLD+EKA+D + W F+D +L++K F
Subjt:  ---------------------------------------------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGF

Query:  GHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAK
         H+WR WIK C+ N  +SI++NG P+G+I A RG+RQGDPLSPF+F++  D +SR +     K  +KG     N   ++ L +ADD L+F  +NE  +  
Subjt:  GHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAK

Query:  WWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNS
            L +  + SGL+ N+SK+ I  +N         A+    + + LPVNYLG PLGGN   + FWD  ++   +KL+ W+   ISK GR TL ++ L+S
Subjt:  WWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNS

Query:  LLIYYFSLLKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVKW
        L  Y  S  KAP +  + +EK  RDF+WGG   K   +L+ W
Subjt:  LLIYYFSLLKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVKW

TrEMBL top hitse value%identityAlignment
A0A438CAL5 Transposon TX1 uncharacterized 149 kDa protein3.1e-17040.39Show/hide
Query:  ESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVW
        E++ A  D R V SVW++R+  W VL    + GGI+++W   ++   + VLG+FS+S+K  +    + W+S +YGP     +  F +EL DI GL    W
Subjt:  ESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVW

Query:  CLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVK
        C+ GDFN+IR   E+L       SMK  + FI   +LID P+ +  FTWS +       RLDRFL S +W   F       L R TSDH+PI+L     K
Subjt:  CLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVK

Query:  WGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELL
        WGPTPFRFEN WL    FKE   SWW E    GW G +FM KL+ LK K+KEWN+        +   IL  I   D +E+      + +  R   K EL 
Subjt:  WGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELL

Query:  EITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKE
        E+ + E+    +K +++W+ EGD NS FFHK A  ++N+ FI VLE E G ++   + I+ EIL +F KLY+   G  + ++G++W PI ++    LE  
Subjt:  EITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKE

Query:  FKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL---------------------------VEGRQILDAILIATESVEELKSRKNSGVLLKLD
        F EEEI KAI  +   K+ GPDG T   +++ W+++K+DL                           V+GRQILDA+LIA E V+E K     GV+ K+D
Subjt:  FKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL---------------------------VEGRQILDAILIATESVEELKSRKNSGVLLKLD

Query:  LEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEV
         EKAYD V W FLD ++ +KGF  RWR WI+GCL + +F+I+VNG  +G + ASRGLRQGDPLSPFLF +V D +SR +    E+ + +G+++G+NR  V
Subjt:  LEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEV

Query:  AMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLD
        + LQ+ADDT+ F    E D+     +L V    SGL +N  K+ I G+N     L   A  LDCKA   P+ YLG PLGGN     FWDP++++   +LD
Subjt:  AMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLD

Query:  NWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVKWE
         W+  ++S  GR TL +S L  +  Y+ SL + P +    +E+  RDF+W G       +LV WE
Subjt:  NWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVKWE

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein6.7e-18930.4Show/hide
Query:  LWFDRNYKGRVIRIQEDHLRKKFSLSAEEIVIVWISDSIEDLLLAPATHKLFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVDYKGW
        L  D+  K     + E    K FS+      + WI  +++ L+  P T++ F +   +   IWI+K  N +G   EI +V+    K  ++VP G D  GW
Subjt:  LWFDRNYKGRVIRIQEDHLRKKFSLSAEEIVIVWISDSIEDLLLAPATHKLFRKVDCNNGFIWIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVDYKGW

Query:  KSFSDLLNKFLNGEDDYLEDSKQAETVVRRKGKGKSFA---DIVKRSSSNSIPNDPSTQITPQGSRGKERKNAPKVRRSHTGTEWKDIPVCRGCSEEVRK
         SF  ++   +  +      +K   T + R       +   D  KRS + ++  +     T   S   +  ++     SH+ +       C   S ++  
Subjt:  KSFSDLLNKFLNGEDDYLEDSKQAETVVRRKGKGKSFA---DIVKRSSSNSIPNDPSTQITPQGSRGKERKNAPKVRRSHTGTEWKDIPVCRGCSEEVRK

Query:  INWEEVIVITKRDFHDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSRELVDLLTKNQGWVSFGPIILKVEKWNKNAHSKISCIPSYGGWVKIRNLPL
           E  +VI +R FHDDW +IL+ +++Q  E    N FH +KAL+   S    +LL +N+GW + G   ++ EKW+   H+    IPSYGGW   R +PL
Subjt:  INWEEVIVITKRDFHDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSRELVDLLTKNQGWVSFGPIILKVEKWNKNAHSKISCIPSYGGWVKIRNLPL

Query:  HLWNLQTFKAIGDNLGGFIEYEEANSLLIECVEVKIKIKENYCGFIPAEVRIEDGE-DQYIVQIVTFQDGSLLIDRVAGIHGTFSSTVV-----------
        HLWN+ TF+ IG    G I+  E        +E +IK++ NY GF+PA VRI D E +++ VQ+VT  +G  LI+R   +HGTF                
Subjt:  HLWNLQTFKAIGDNLGGFIEYEEANSLLIECVEVKIKIKENYCGFIPAEVRIEDGE-DQYIVQIVTFQDGSLLIDRVAGIHGTFSSTVV-----------

Query:  HIFHRG--PMDPLF-CTAYIWRIDNGLHYP-----VVIVPQSVDEPRICGGQLDDDKV-DFEISRLRTKGKNKAEAGPSKCAVILVRVEAQPILVQPDGP
          F  G   + P F  T+   R  +    P     V+I P   D        L+++ V D  +     K K +  +G S   V+    +   I +QP+  
Subjt:  HIFHRG--PMDPLF-CTAYIWRIDNGLHYP-----VVIVPQSVDEPRICGGQLDDDKV-DFEISRLRTKGKNKAEAGPSKCAVILVRVEAQPILVQPDGP

Query:  STMKRSKKEKRKGVSFAKEAQITLFKKGEVHHTQKSPSPHLKGVTTDTWNGEQVLESDF-SFSSPTSLDGEGGKDSGLSSRKAFDCDIPEGYQLCFSNDT
          + +SK++    VSF   +      K  + +   +P+ H   + +     +   E      SS T  + +  ++ G+         I +  Q+  ++D 
Subjt:  STMKRSKKEKRKGVSFAKEAQITLFKKGEVHHTQKSPSPHLKGVTTDTWNGEQVLESDF-SFSSPTSLDGEGGKDSGLSSRKAFDCDIPEGYQLCFSNDT

Query:  DQDISSIPLSVEESVELAEEAPSTSKEETIQSPKAPPECRTIGEALHEENHFLHPKALAIIPAGKEDRADLETEKGELLLSKELILTLRRNNLCIRSIVG
        D     + L+V+   +L    P+ S E+   S  A     T  E + E                                + E+ + +  N+        
Subjt:  DQDISSIPLSVEESVELAEEAPSTSKEETIQSPKAPPECRTIGEALHEENHFLHPKALAIIPAGKEDRADLETEKGELLLSKELILTLRRNNLCIRSIVG

Query:  SSAKKGNSTKKKRSREVTNLFR-TWEKEVEPIIE----------EENG---NENGNVSFSEVKSHVERAETESRLAAIDSRLVKSVWSSRHIAWVVLDPV
        +S+ + N  K K   +    +R   EKE +P  E          ++NG   + + + S +   ++V   +  S L   + R++KS+W S  I W+  +  
Subjt:  SSAKKGNSTKKKRSREVTNLFR-TWEKEVEPIIE----------EENG---NENGNVSFSEVKSHVERAETESRLAAIDSRLVKSVWSSRHIAWVVLDPV

Query:  ASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVWCLVGDFNIIRWVEERLSRRNPLRSMKKFN
         S GGI+ILW      ++    G FS+S    L + S  W++G+YGP   +E++HF  EL+++  L    W L GD N+IR  EE  S  +   + +  N
Subjt:  ASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVWCLVGDFNIIRWVEERLSRRNPLRSMKKFN

Query:  RFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIML--SIGAVKWGPTPFRFENSWLDIRDFKEQLESWWT
         FIS+  LID P+ N  FTWS L      SR+DRFL +  W + FS      L R TSDHFP++   S   + WGP PFR  +  L   +FK  +  WW 
Subjt:  RFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIML--SIGAVKWGPTPFRFENSWLDIRDFKEQLESWWT

Query:  ELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGDENSS
             G+ GF F+++LK L   IK W + +        + I+ ++D ID+ E      +++   R  LK +L E++++E +   ++ K  WL EGDENSS
Subjt:  ELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGDENSS

Query:  FFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLY--SKEQGPRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMT
        FFH+  ++++ ++FI  ++ E G I      I    + FF ++Y  S +  P F+ + ++W+PI +     L   F E EI   I      K+ GPDG  
Subjt:  FFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLY--SKEQGPRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMT

Query:  REFYKNSWNILKKDL-----------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSI
          F+K+ W  LK  L           V+ RQI DAIL+A E+V+  K +K  G +LKLD+EKA+D ++  F+D +L +K F + WR WI+GC+ N  +S+
Subjt:  REFYKNSWNILKKDL-----------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSI

Query:  IVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSS
        I+NGRP+G+I A+RGLRQGDPLSPFLF++  D +SR +        +KG  +  N   ++ + +ADD L+F  +N+  +      L +  + SGL +N  
Subjt:  IVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSS

Query:  KTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAM
        K+A++ VN  +      A+       SLP++YLG PLGGN    +FW  + DK ++KL+NW+   ISK GR TL +S L+SL IY  S+ +AP  T + +
Subjt:  KTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAM

Query:  EKTVRDFVWGGGTYKPGVNLVKW
        EK  R F+W G     G +L+ W
Subjt:  EKTVRDFVWGGGTYKPGVNLVKW

A0A803P8A0 Uncharacterized protein1.7e-16838.1Show/hide
Query:  ESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVW
        E + A +D R + S+W SR  AW++L  +   GG +++W    + V+D+++G FSIS+ +    K   W SGVYGPC Y+ +  F  EL  +  +C   W
Subjt:  ESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVW

Query:  CLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVK
        C+ GDFN+ R V E+L+  +  RSMK F+  I    LID  + NG FTWS        SRLDRFL    W   F   R   L R  SDH P+++     K
Subjt:  CLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVK

Query:  WGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELL
        WGP PFRF+N WL+ + F +  ESWW E    GW G +FM+KLK L+ K KEW+       +A  + +  ++  +DR E     ++   ++R+KLK E  
Subjt:  WGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELL

Query:  EITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKE
         +T EE+RS+  K K +W  EGD NS FFH    A+K +  IS +E+++G+II  E++I  E+++FF KLY+ E      ++G+ W  I       LE  
Subjt:  EITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKE

Query:  FKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL------------------------------------------------------------
        F+E+E+   +      K+ GPDG +   ++N+W ++K +L                                                            
Subjt:  FKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL------------------------------------------------------------

Query:  -------------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVAS
                     VEGRQILD++L+A E+VE+ +SR   G +LK+D EKAYD VDW FLD +L +KGFG RWR WI+GC+ +++FSI VNGR RGK   S
Subjt:  -------------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVAS

Query:  RGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSM
        RGLRQGDPLSPFLF +V D + R V   +E +   G++IG++ + ++ LQ+ADDTL F   +E  + K   I+      SGL +N +K+ ++G+      
Subjt:  RGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSM

Query:  LANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGT
        +A  AN + C+    P+ YLG PLGG+  +K FW+P+LDK  +++D W+   +S+ GR TL QSVL+SL IYY SL K PK  ++ +EK +RDF W GG 
Subjt:  LANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGT

Query:  YKPGVNLVKWE
           G +LV W+
Subjt:  YKPGVNLVKWE

A0A803QEA6 Uncharacterized protein8.6e-16837.85Show/hide
Query:  ESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVW
        E + A +D R + S+W SR  AW+++  +   GG +++W    + V+D+++G FSIS+ +    K   W SGVYGPC Y+ +  F  EL  +  +C   W
Subjt:  ESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVW

Query:  CLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVK
        C+ GDFN+ R V E+L+  +  RSMK F+  I    LID  + NG FTWS        SRLDRFL +  W   F   R   L R  SDH P+++     K
Subjt:  CLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVK

Query:  WGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELL
        WGP PFRF+N WLD + F +  E WW E    GW G +FM+KLK L+ K+KEW++      RAK   +  ++  +D+LE  +  ++  +++R+KLK E  
Subjt:  WGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELL

Query:  EITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKE
         +  EE+R    K K +W  EGD NS FFH    A+K +  IS +E+E+G+II  E++I  E+++FF KLY+ E      ++G+ W  I       LE  
Subjt:  EITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKE

Query:  FKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL------------------------------------------------------------
        F+EEE+   +      K+ GPDG +    +++W  +K DL                                                            
Subjt:  FKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL------------------------------------------------------------

Query:  -------------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVAS
                     VEGRQILD++L+A E+VE+ +SR   G +LK+D EKAYD VDW FLD +L +KGFG RWR WI+GC+ +++FSI +NGR RGK   S
Subjt:  -------------VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVAS

Query:  RGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSM
        RGLRQGDPLSPFLF ++ D + R V   IE + L G++IG++ + ++ LQ+ADDTL F   +EV + K   +++     SGL +N +K+ ++G+  +   
Subjt:  RGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSM

Query:  LANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGT
        +A  A  + C+    P+ YLG  LGG+  ++ FW+P+LDK  +++D W+   +S+ GR TL QSVL+SL IYY SL KAPK  ++ +EK +R+F W GG 
Subjt:  LANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGT

Query:  YKPGVNLVKWE
           G +LV W+
Subjt:  YKPGVNLVKWE

A0A803QI00 Uncharacterized protein7.0e-17828.38Show/hide
Query:  IVWISDSIEDLLLAPATHKLFRKVDCNNGFI--WIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVDYKGWKSFSDLLNKFLNGED-DYLEDSKQAETVV
        + WI  ++E+L       ++  K    N  I  +++  +N +GSFL+I+ + N+  K  ++VP   + KGW    + LN  +  +    +E S Q +   
Subjt:  IVWISDSIEDLLLAPATHKLFRKVDCNNGFI--WIQKISNKRGSFLEITKVNNSGGKHNLVVPAGVDYKGWKSFSDLLNKFLNGED-DYLEDSKQAETVV

Query:  RRK--GKGKSFADIVKRSSSNSIPND--PSTQITPQGSRGKER----KNAPKVRRSHTGTEWKDI-PVCRGCSEEVRKI--------NWEEVIVITKRDF
        + +     +S+A+IVK++   ++       T++T  G R + R    +  PK+   + G + K++ PV R    +  +         +W+  I++T+ + 
Subjt:  RRK--GKGKSFADIVKRSSSNSIPND--PSTQITPQGSRGKER----KNAPKVRRSHTGTEWKDI-PVCRGCSEEVRKI--------NWEEVIVITKRDF

Query:  HDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSR-ELVDLLTKNQGWVSFGPIILKVEKW---NKNAHSKISCIPSYGGWVKIRNLPLHLWNLQTFKA
        + DW  I   + ++L   L+++    D+ +I C    E  +L+  +   V     ++    W   N+  + K+ C    G W+ ++ LPL+LWN++ F+ 
Subjt:  HDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSR-ELVDLLTKNQGWVSFGPIILKVEKW---NKNAHSKISCIPSYGGWVKIRNLPLHLWNLQTFKA

Query:  IGDNLGGFIEYEEANSLLIECVEVKIKIKENYCGFIPAEVRIEDGEDQYIVQIVTFQDGSLLIDRVAGIHGTFSSTVVHIFHRGPMDPLFCTAYIWRIDN
        IG+  GG ++ ++  +       +++++  +  GFIP  V +    D    ++  F+    L D    +HG F++     F  G +      +    I+ 
Subjt:  IGDNLGGFIEYEEANSLLIECVEVKIKIKENYCGFIPAEVRIEDGEDQYIVQIVTFQDGSLLIDRVAGIHGTFSSTVVHIFHRGPMDPLFCTAYIWRIDN

Query:  GLHYPVV-IVPQ---------SVDEPRICGGQLDDDKVDFE------ISRLRTKGKNKAEAGPSKCAVILVRVEAQPILVQPDGP--------STMKRSK
        G H   V   P+         S  E    GG  +  K   +         LRT G   A A  + C + L   EA  + ++  G         + + RS 
Subjt:  GLHYPVV-IVPQ---------SVDEPRICGGQLDDDKVDFE------ISRLRTKGKNKAEAGPSKCAVILVRVEAQPILVQPDGP--------STMKRSK

Query:  KEKRKGVSFAKEAQITL------------FKKG---------------EVHHTQKSPS--------------PHLKGVTT-----DTW------------
        +  R   +    A I L            FK G               E+  + + PS               +L G T      D W            
Subjt:  KEKRKGVSFAKEAQITL------------FKKG---------------EVHHTQKSPS--------------PHLKGVTT-----DTW------------

Query:  -----------NGEQVLESDFSFS-----------SPTSLDGEGGKDSGLS----SRKAFDCDIPEGYQLCFSNDTDQDISSIPLSVEESVELAEEAPST
                     E ++ SDF+ +           +P  +  E  K    S    +R+ F   I + ++  F+ +  +   S+    +E V   E  P+ 
Subjt:  -----------NGEQVLESDFSFS-----------SPTSLDGEGGKDSGLS----SRKAFDCDIPEGYQLCFSNDTDQDISSIPLSVEESVELAEEAPST

Query:  S-----KEETIQSPKAPPECRTIGEALHEENHFLHPKALAIIPAGKEDR------ADLETEKGELLLSKELILTLRRNNLCIRSIVGSSAKKGNSTKKKR
        +     K ++I   +      +   +L  E +   PK +  +    ED        D E + GE   S++++  +  N L    +     ++    K   
Subjt:  S-----KEETIQSPKAPPECRTIGEALHEENHFLHPKALAIIPAGKEDR------ADLETEKGELLLSKELILTLRRNNLCIRSIVGSSAKKGNSTKKKR

Query:  SREVTNLFRTWEK------EVEPII--EEENGNENGNV---SFSEVKSHVERAE-----------TESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGG
        + EV     +WEK      E++  I  E E+G++ G     S  + K H  +A             E +  ++D R + S+W SR  AW+++  +   GG
Subjt:  SREVTNLFRTWEK------EVEPII--EEENGNENGNV---SFSEVKSHVERAE-----------TESRLAAIDSRLVKSVWSSRHIAWVVLDPVASGGG

Query:  IIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVWCLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISS
         +++W    + V+D+++G FSIS+ +    K   W SGVYGPC Y+ +  F  EL  +  +C   WC+ GDFN+ R   E+L+  +  RSMK F+  I  
Subjt:  IIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVWCLVGDFNIIRWVEERLSRRNPLRSMKKFNRFISS

Query:  ADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVKWGPTPFRFENSWLDIRDFKEQLESWWTELDPTGW
          LID  + NG FTWS        SRLDRFL +  W   +   R   L R  SDH P+++     +WGP PFRF+N WL+   F +    WW E    GW
Subjt:  ADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVKWGPTPFRFENSWLDIRDFKEQLESWWTELDPTGW

Query:  AGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGDENSSFFHKWAT
         G +FM KLK  + K+KEW+       +A    +  ++  +DRLE  N   +  VE+R+KLK E  ++  EE+RS+  K K +W  EGD NS FFH    
Subjt:  AGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGDENSSFFHKWAT

Query:  AKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWN
        A+K +  IS +E+E G II +EE+I  E++ FF KLY+ E      ++ + W  I       LE  F+EEE+ +++      K+ GPDG +   ++N+W 
Subjt:  AKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWN

Query:  ILKKDL-------------------------------------------------------------------------VEGRQILDAILIATESVEELK
         +K DL                                                                         VEGRQILD++LIA E+VE+ +
Subjt:  ILKKDL-------------------------------------------------------------------------VEGRQILDAILIATESVEELK

Query:  SRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKIL
        SR   G + K+DLEKAYD VDW FLD +L  KGFG  WR WI+GC+ +++FS+++NGR RGK   SRGLRQGDPLSPFLF +V D + R V    +    
Subjt:  SRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKIL

Query:  KGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFW
         G+++G++ ++++ LQ+ADDTL F   +E  + K  +I+      SGL +N +K+ ++G++ +  ++A  A  + C+  + P+ YLG PLGG+  +  FW
Subjt:  KGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFW

Query:  DPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVKWE
        +P+LDK  ++LD W+   +S+ GR  L QSVL+SL IYY SL KAPK  ++A+EK +RDF W GG    G +LV W+
Subjt:  DPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVKWE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.5e-2025.47Show/hide
Query:  KSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKI
        +++  + V++ +D EKA+D +   F+   L++ G    +   I+        +II+NG+         G RQG PLSP LF +V + ++R ++   ++K 
Subjt:  KSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKI

Query:  LKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHR--K
        +KG ++G+   EV +  +ADD +V+  N  V       ++    + SG  +N  K+     N +    +     L     S  + YLG  L  +     K
Subjt:  LKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHR--K

Query:  MFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSL--LKAPKATVRAMEKTVRDFVW
          + PLL + +   + W+N   S  GR  + +  +   +IY F+   +K P      +EKT   F+W
Subjt:  MFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSL--LKAPKATVRAMEKTVRDFVW

P08548 LINE-1 reverse transcriptase homolog6.0e-1723.44Show/hide
Query:  ESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQL
        + + +LK++ +  ++L +D EKA+D +   F+   L + G    +   I+        +II+NG          G RQG PLSP LF +V + ++  ++ 
Subjt:  ESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQL

Query:  CIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGN
          E+K +KG  IG   +++++  +ADD +V+  N      K  ++++     SG  +N+ K+       +        + +        + YLG  L  +
Subjt:  CIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGN

Query:  YH--RKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSL--LKAPKATVRAMEKTVRDFVW
             K  ++ L  +    ++ W+N   S  GR  + +  +    IY F+   +KAP +  + +EK +  F+W
Subjt:  YH--RKMFWDPLLDKFRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSL--LKAPKATVRAMEKTVRDFVW

P11369 LINE-1 retrotransposable element ORF2 protein8.1e-2221.2Show/hide
Query:  LVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDL-----PMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSI
        +VGDFN     ++R  ++   R   K    +   DL D+     P   G   +S        S++D  +  K  ++++ +  ++      SDH  + L  
Subjt:  LVGDFNIIRWVEERLSRRNPLRSMKKFNRFISSADLIDL-----PMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSI

Query:  -GAVKWGPTPF--RFENSWL-----------DIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECN
           +  G   F  +  N+ L           +I+DF E  E+  T   P  W   +       L+ K+   +  ++    A    +   +  +++ +E N
Subjt:  -GAVKWGPTPF--RFENSWL-----------DIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECN

Query:  EAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLR-WLNEG-DENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQG----
             + ++  KL+ E+ +  +E +R++ +  + R W  E  ++      +     ++K  I+ +  E G+I    E+I+  I SF+++LYS +      
Subjt:  EAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLR-WLNEG-DENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQG----

Query:  -----PRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL------------VEG----------------
              R+ +  +N D +D      L      +EI   I  L   KS GPDG + EFY+      K+DL            VEG                
Subjt:  -----PRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDL------------VEG----------------

Query:  --------------------RQILDAILIATESVEELKS-------------------RKNSGV-------------LLKLDLEKAYDMVDWCFLDAILS
                             +IL+ IL A    E +K+                   RK+  V             ++ LD EKA+D +   F+  +L 
Subjt:  --------------------RQILDAILIATESVEELKS-------------------RKNSGV-------------LLKLDLEKAYDMVDWCFLDAILS

Query:  RKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEV
        R G    +   IK        +I VNG     I    G RQG PLSP+LF +V + ++R ++   ++K +KG +IG+  +++++L  ADD +V+  + + 
Subjt:  RKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEV

Query:  DIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLG---GNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTL
           +  +++    +  G  +NS+K+       +                +  + YLG  L     + + K F   L  + +  L  W++   S  GR  +
Subjt:  DIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLG---GNYHRKMFWDPLLDKFRRKLDNWRNFHISKRGRTTL

Query:  AQSVLNSLLIYYFSL--LKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVK
         +  +    IY F+   +K P      +E  +  FVW     +   +L+K
Subjt:  AQSVLNSLLIYYFSL--LKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVK

P14381 Transposon TX1 uncharacterized 149 kDa protein9.5e-1524.91Show/hide
Query:  VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFL
        V GR I D + +  + +   +    S   L LD EKA+D VD  +L   L    FG ++  ++K    ++   + +N      +   RG+RQG PLS  L
Subjt:  VEGRQILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFL

Query:  FIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTA-IIGVNTDMSMLANWANHLDCKA
        + +  +       LC+ +K L G  + +  M V +  YADD ++    + VD+ +  +   V    S   +N SK++ ++  +  +  L       D   
Subjt:  FIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTA-IIGVNTDMSMLANWANHLDCKA

Query:  ESLPVNYLGFPLGG-NYHRKMFWDPLLDKFRRKLDNWRNFH--ISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGG
        ES  + YLG  L    Y     +  L +    +L  W+ F   +S RGR  +   ++ S + Y    L   +  +  +++ + DF+W G
Subjt:  ESLPVNYLGFPLGG-NYHRKMFWDPLLDKFRRKLDNWRNFH--ISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGG

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)5.4e-1032.02Show/hide
Query:  ILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVN-GRPRGKIVASRGLRQGDPLSPFLFIVV
        +++++L+ T      + RK   V + LD+ KA+D V    +   L R G       +I G L +S  +I V  G    KI   RG++QGDPLSPFLF  V
Subjt:  ILDAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVN-GRPRGKIVASRGLRQGDPLSPFLFIVV

Query:  GDAVSRTVQLC-IEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGV
         D +     LC ++     G  IG+ ++ V  L +ADD L+   +N+V +      +    +  G+SLN+ K+  I V
Subjt:  GDAVSRTVQLC-IEKKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGV

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein5.4e-0523.4Show/hide
Query:  LCQGVWCLVGDFNIIRWVEERLS---RRNPLRSMKKFNRFISSADLIDLPMLNGLFTWS-RLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHF
        LC   W +VGDFN I  V E  S       L+ ++     +  +DL+DLP    L+TWS          +LDR +++  W+  F     +      SDH 
Subjt:  LCQGVWCLVGDFNIIRWVEERLS---RRNPLRSMKKFNRFISSADLIDLPMLNGLFTWS-RLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHF

Query:  P--IMLSIGAVKWGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQ
           ++L+          F++ +      DF   + + W +    G   F   E LK  K   +  N  RR  +  +   +    D++ R E         
Subjt:  P--IMLSIGAVKWGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQ

Query:  VEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGD
           RK        +    +    +K +++WL EGD
Subjt:  VEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGD

AT1G43760.1 DNAse I-like superfamily protein2.1e-1724.5Show/hide
Query:  LVGDFNIIRWVEER---LSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWS-RLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFP-IMLSI
        LVGDF+ I    +    L    P+R +++F   +  +DL+D+P     +TWS          +LDR + +  W   F     +      SDH P I++  
Subjt:  LVGDFNIIRWVEER---LSRRNPLRSMKKFNRFISSADLIDLPMLNGLFTWS-RLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFP-IMLSI

Query:  GAVKWGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQ-QVEDRKKL
           K     FR+ +       F   L   W E  P G   F   E LK  K   K  N       + K  E L+ ++ I      N +    +VE   + 
Subjt:  GAVKWGPTPFRFENSWLDIRDFKEQLESWWTELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQ-QVEDRKKL

Query:  KVELLEITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQG---PRFVLDGVNWDPIDAQ
        K       +E      +K +++WL +GD N+ FFHK   A + K  I  L  +    +     ++  I++++  L   +     P  V    +  P    
Subjt:  KVELLEITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKKNKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQG---PRFVLDGVNWDPIDAQ

Query:  CCL--DLEKEFKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDLV
          L   L     ++EI  A+  +   K+ GPD  T EF+  SW ++K   +
Subjt:  CCL--DLEKEFKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDLV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.6e-0439.34Show/hide
Query:  VEGRQILDAILIATESVEELKSRK--NSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRW
        + GR   D I+   E+V  ++ +K     +LLKLDLEKAYD + W +L+  L   GF   W
Subjt:  VEGRQILDAILIATESVEELKSRK--NSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)7.3e-1044.12Show/hide
Query:  IVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDT
        I+NG P+G +  SRGLRQGDPLSP+LFI+  + +S   +   E+  L G ++  N   +  L +ADDT
Subjt:  IVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIEKKILKGWKIGQNRMEVAMLQYADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGCTGAACTGAGAAGTGCCATTAGATTAGACGATCTTCATGACGCATCAGAGAGGTTGCAAAACCAGGATAGAGAACAAACTCTTTTCTTGTGGTTTGATAGAAA
CTACAAGGGGAGAGTGATTAGAATACAAGAAGACCACCTTAGGAAAAAGTTTTCCTTATCTGCGGAAGAGATAGTTATTGTTTGGATCTCAGACTCGATCGAGGACCTGC
TCCTTGCTCCAGCCACCCACAAGTTATTCAGAAAAGTTGATTGCAATAATGGTTTCATATGGATTCAGAAAATCTCGAACAAAAGAGGAAGCTTCCTGGAAATCACGAAA
GTTAACAACTCAGGAGGTAAACATAATTTGGTTGTGCCTGCAGGAGTAGATTACAAGGGATGGAAAAGCTTCTCGGACCTTCTAAATAAGTTCCTTAACGGAGAGGACGA
TTATCTGGAGGATAGCAAGCAAGCTGAAACAGTGGTCAGGAGAAAGGGGAAAGGGAAGTCGTTTGCCGACATTGTTAAAAGATCCTCGAGTAATAGCATACCTAACGATC
CCTCGACTCAGATAACTCCCCAAGGATCAAGAGGAAAAGAGAGGAAAAACGCTCCAAAGGTCAGGAGGAGCCACACTGGAACAGAATGGAAAGATATACCTGTATGCAGA
GGATGTTCCGAAGAAGTCAGAAAGATTAATTGGGAAGAAGTCATAGTTATTACTAAAAGAGACTTTCATGATGATTGGGGAAGAATTCTTGAAGTGATGCAACAACAGTT
GATGGAGCCCCTTATTATTAACCCCTTTCACCCGGACAAGGCCCTGATTAAATGCCATTCCAGAGAGCTCGTTGACCTGTTGACTAAAAATCAGGGGTGGGTAAGTTTTG
GTCCGATTATCTTAAAGGTGGAGAAATGGAACAAAAATGCTCATAGCAAAATTAGCTGTATTCCGAGCTACGGGGGGTGGGTCAAGATCCGAAACCTTCCTTTGCATTTA
TGGAATCTGCAGACATTTAAAGCCATAGGGGACAACCTTGGAGGGTTTATTGAATATGAAGAAGCTAACTCCCTGCTCATTGAATGCGTGGAAGTCAAAATAAAAATCAA
AGAAAACTACTGTGGATTTATTCCGGCGGAAGTCCGAATCGAAGATGGAGAAGACCAGTACATCGTGCAAATCGTCACCTTCCAAGATGGTTCTTTGCTGATAGATCGAG
TGGCCGGAATACATGGCACCTTCTCATCGACGGTAGTTCATATCTTCCACAGAGGTCCTATGGATCCTCTCTTTTGCACAGCATATATTTGGAGAATCGATAATGGCTTG
CATTACCCAGTGGTTATTGTCCCTCAATCGGTTGATGAGCCCAGGATATGTGGAGGACAGCTGGATGACGATAAAGTAGATTTTGAAATCTCCCGCCTAAGAACTAAAGG
AAAAAATAAAGCGGAAGCTGGACCCTCAAAGTGTGCGGTTATTCTTGTCCGAGTCGAAGCCCAGCCCATTTTGGTCCAGCCTGATGGCCCTTCAACGATGAAAAGATCTA
AAAAGGAAAAGAGAAAAGGAGTCTCTTTTGCCAAGGAGGCCCAGATCACCTTGTTTAAAAAGGGGGAGGTCCATCACACCCAAAAAAGTCCCAGCCCTCATTTGAAAGGA
GTTACGACCGACACTTGGAATGGAGAGCAAGTCCTCGAATCTGATTTTTCTTTCTCAAGCCCAACAAGCTTAGACGGTGAAGGCGGGAAAGATAGTGGTCTTAGTAGCAG
GAAGGCCTTCGATTGTGATATCCCGGAAGGCTATCAGTTATGCTTTTCCAATGATACCGATCAAGATATTAGCTCGATTCCTCTTAGTGTGGAGGAATCTGTAGAGTTGG
CGGAAGAAGCGCCGAGTACCAGTAAGGAGGAAACAATCCAGAGTCCGAAAGCTCCCCCTGAGTGCAGAACCATCGGAGAAGCTTTGCACGAAGAAAACCATTTTCTCCAT
CCAAAGGCTCTAGCTATAATCCCGGCAGGCAAAGAGGACAGAGCTGATTTAGAAACAGAAAAGGGGGAGCTCCTGTTAAGTAAAGAGCTAATTCTCACCCTTAGAAGGAA
TAACCTATGCATTAGGTCGATAGTTGGCTCCAGTGCTAAGAAAGGTAACTCTACCAAGAAAAAGCGCAGCAGGGAAGTTACCAACCTTTTTAGAACATGGGAGAAAGAAG
TAGAGCCTATCATAGAAGAGGAAAATGGCAATGAAAATGGGAATGTTAGCTTTAGTGAGGTTAAGAGTCATGTAGAGAGGGCCGAGACGGAGTCCAGGTTAGCCGCCATT
GATAGCCGCCTTGTGAAGTCTGTTTGGAGCTCTAGGCACATTGCCTGGGTGGTGCTGGACCCGGTGGCCTCGGGCGGGGGCATTATTATTCTCTGGAAGGAAGATAGAGT
AGAAGTTGTAGACACTGTTTTGGGGGCTTTTTCGATCTCCATCAAAGTAACTCTCCTCGACAAGTCAGAAGGTTGGATATCGGGCGTCTATGGCCCGTGCGGCTATCAGG
AAAAACTCCACTTCTTGCAAGAGCTCTATGATATTCATGGTCTCTGTCAAGGTGTATGGTGTTTGGTGGGGGATTTTAATATCATTAGATGGGTTGAGGAAAGGTTGAGT
AGGCGTAATCCTTTGAGGAGTATGAAGAAGTTTAATCGGTTTATCTCCTCGGCTGATTTGATTGACTTGCCTATGCTCAATGGGTTATTCACCTGGTCCAGATTGGGGGT
GAGAGTTGCTGCTTCAAGATTAGATAGATTCCTTCTCTCCAAACAATGGATTGACAAGTTCTCTGATTTTAGAGTTTTGAGACTTAAGAGACCCACCTCTGATCACTTCC
CCATCATGCTGTCTATTGGTGCGGTTAAATGGGGCCCAACTCCATTCCGGTTCGAAAATTCCTGGCTGGATATTCGAGATTTTAAGGAGCAGCTTGAGTCGTGGTGGACT
GAGTTGGATCCCACAGGGTGGGCGGGATTTAGATTCATGGAGAAGTTGAAAGGGTTGAAATGTAAAATTAAAGAGTGGAATGAAGGCCGTCGAGTGTTGGCAAGAGCTAA
GAATGATGAGATCTTAGAAAAAATTGACTACATTGATAGATTGGAGGAATGCAATGAGGCTCATGAGCAACAAGTGGAAGACAGAAAGAAGCTTAAAGTTGAGTTGTTGG
AAATCACTATAGAAGAGCAAAGAAGCCTCAATAAGAAGTGCAAGCTTAGATGGTTAAATGAGGGGGATGAAAACTCCTCCTTCTTCCATAAATGGGCCACTGCCAAGAAA
AACAAAGCTTTTATTTCAGTTTTGGAGAAGGAAAGTGGGGAAATCATCCCCAGAGAAGAAGATATTGAAGGTGAGATTTTGAGCTTTTTCAGAAAACTGTACTCGAAAGA
GCAGGGACCAAGATTTGTGCTTGATGGTGTGAATTGGGATCCCATTGATGCTCAATGTTGTTTGGATCTGGAAAAGGAGTTTAAGGAGGAAGAGATTTACAAGGCTATTT
GTGATTTGGGGAATCTAAAATCCCTGGGTCCGGATGGGATGACAAGGGAGTTTTATAAAAATTCTTGGAACATCTTGAAGAAAGATTTAGTAGAGGGTAGGCAAATCTTG
GATGCTATTTTAATCGCTACCGAATCTGTGGAAGAGTTAAAAAGTCGTAAAAATTCGGGTGTTCTTCTAAAGCTGGATCTTGAGAAGGCTTATGATATGGTGGATTGGTG
CTTCCTTGATGCTATTCTCTCCAGGAAGGGATTCGGCCATAGATGGAGGCTATGGATAAAAGGCTGCCTTCGGAACTCTAATTTCTCTATAATTGTGAATGGCAGGCCAA
GAGGGAAGATTGTGGCTTCTAGAGGGCTTAGGCAAGGAGATCCGTTATCGCCCTTTCTTTTTATAGTGGTTGGGGATGCCGTGAGTAGAACAGTTCAACTCTGTATTGAG
AAGAAGATTTTAAAGGGCTGGAAAATTGGGCAAAACAGAATGGAGGTGGCCATGCTTCAGTACGCTGATGACACTCTCGTTTTTTGCCCAAACAATGAGGTAGACATTGC
TAAATGGTGGGATATTTTGAGAGTTATTTTGCAGGGCTCTGGTTTATCTTTGAATTCCTCGAAGACAGCTATAATAGGGGTGAATACGGATATGAGCATGCTGGCTAATT
GGGCAAATCACCTTGACTGCAAAGCCGAATCTCTTCCGGTTAACTACTTGGGTTTTCCTTTGGGAGGGAATTATCACAGGAAGATGTTCTGGGATCCTTTATTGGATAAG
TTTAGAAGGAAACTCGACAATTGGAGGAATTTCCATATATCAAAAAGGGGTCGAACTACGTTAGCCCAATCTGTTTTGAATAGCCTCCTCATCTATTATTTTTCTTTGTT
AAAAGCCCCCAAAGCAACCGTTAGAGCGATGGAAAAAACTGTTCGGGATTTTGTGTGGGGCGGTGGTACCTACAAACCGGGGGTGAATTTGGTGAAATGGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAGCTGAACTGAGAAGTGCCATTAGATTAGACGATCTTCATGACGCATCAGAGAGGTTGCAAAACCAGGATAGAGAACAAACTCTTTTCTTGTGGTTTGATAGAAA
CTACAAGGGGAGAGTGATTAGAATACAAGAAGACCACCTTAGGAAAAAGTTTTCCTTATCTGCGGAAGAGATAGTTATTGTTTGGATCTCAGACTCGATCGAGGACCTGC
TCCTTGCTCCAGCCACCCACAAGTTATTCAGAAAAGTTGATTGCAATAATGGTTTCATATGGATTCAGAAAATCTCGAACAAAAGAGGAAGCTTCCTGGAAATCACGAAA
GTTAACAACTCAGGAGGTAAACATAATTTGGTTGTGCCTGCAGGAGTAGATTACAAGGGATGGAAAAGCTTCTCGGACCTTCTAAATAAGTTCCTTAACGGAGAGGACGA
TTATCTGGAGGATAGCAAGCAAGCTGAAACAGTGGTCAGGAGAAAGGGGAAAGGGAAGTCGTTTGCCGACATTGTTAAAAGATCCTCGAGTAATAGCATACCTAACGATC
CCTCGACTCAGATAACTCCCCAAGGATCAAGAGGAAAAGAGAGGAAAAACGCTCCAAAGGTCAGGAGGAGCCACACTGGAACAGAATGGAAAGATATACCTGTATGCAGA
GGATGTTCCGAAGAAGTCAGAAAGATTAATTGGGAAGAAGTCATAGTTATTACTAAAAGAGACTTTCATGATGATTGGGGAAGAATTCTTGAAGTGATGCAACAACAGTT
GATGGAGCCCCTTATTATTAACCCCTTTCACCCGGACAAGGCCCTGATTAAATGCCATTCCAGAGAGCTCGTTGACCTGTTGACTAAAAATCAGGGGTGGGTAAGTTTTG
GTCCGATTATCTTAAAGGTGGAGAAATGGAACAAAAATGCTCATAGCAAAATTAGCTGTATTCCGAGCTACGGGGGGTGGGTCAAGATCCGAAACCTTCCTTTGCATTTA
TGGAATCTGCAGACATTTAAAGCCATAGGGGACAACCTTGGAGGGTTTATTGAATATGAAGAAGCTAACTCCCTGCTCATTGAATGCGTGGAAGTCAAAATAAAAATCAA
AGAAAACTACTGTGGATTTATTCCGGCGGAAGTCCGAATCGAAGATGGAGAAGACCAGTACATCGTGCAAATCGTCACCTTCCAAGATGGTTCTTTGCTGATAGATCGAG
TGGCCGGAATACATGGCACCTTCTCATCGACGGTAGTTCATATCTTCCACAGAGGTCCTATGGATCCTCTCTTTTGCACAGCATATATTTGGAGAATCGATAATGGCTTG
CATTACCCAGTGGTTATTGTCCCTCAATCGGTTGATGAGCCCAGGATATGTGGAGGACAGCTGGATGACGATAAAGTAGATTTTGAAATCTCCCGCCTAAGAACTAAAGG
AAAAAATAAAGCGGAAGCTGGACCCTCAAAGTGTGCGGTTATTCTTGTCCGAGTCGAAGCCCAGCCCATTTTGGTCCAGCCTGATGGCCCTTCAACGATGAAAAGATCTA
AAAAGGAAAAGAGAAAAGGAGTCTCTTTTGCCAAGGAGGCCCAGATCACCTTGTTTAAAAAGGGGGAGGTCCATCACACCCAAAAAAGTCCCAGCCCTCATTTGAAAGGA
GTTACGACCGACACTTGGAATGGAGAGCAAGTCCTCGAATCTGATTTTTCTTTCTCAAGCCCAACAAGCTTAGACGGTGAAGGCGGGAAAGATAGTGGTCTTAGTAGCAG
GAAGGCCTTCGATTGTGATATCCCGGAAGGCTATCAGTTATGCTTTTCCAATGATACCGATCAAGATATTAGCTCGATTCCTCTTAGTGTGGAGGAATCTGTAGAGTTGG
CGGAAGAAGCGCCGAGTACCAGTAAGGAGGAAACAATCCAGAGTCCGAAAGCTCCCCCTGAGTGCAGAACCATCGGAGAAGCTTTGCACGAAGAAAACCATTTTCTCCAT
CCAAAGGCTCTAGCTATAATCCCGGCAGGCAAAGAGGACAGAGCTGATTTAGAAACAGAAAAGGGGGAGCTCCTGTTAAGTAAAGAGCTAATTCTCACCCTTAGAAGGAA
TAACCTATGCATTAGGTCGATAGTTGGCTCCAGTGCTAAGAAAGGTAACTCTACCAAGAAAAAGCGCAGCAGGGAAGTTACCAACCTTTTTAGAACATGGGAGAAAGAAG
TAGAGCCTATCATAGAAGAGGAAAATGGCAATGAAAATGGGAATGTTAGCTTTAGTGAGGTTAAGAGTCATGTAGAGAGGGCCGAGACGGAGTCCAGGTTAGCCGCCATT
GATAGCCGCCTTGTGAAGTCTGTTTGGAGCTCTAGGCACATTGCCTGGGTGGTGCTGGACCCGGTGGCCTCGGGCGGGGGCATTATTATTCTCTGGAAGGAAGATAGAGT
AGAAGTTGTAGACACTGTTTTGGGGGCTTTTTCGATCTCCATCAAAGTAACTCTCCTCGACAAGTCAGAAGGTTGGATATCGGGCGTCTATGGCCCGTGCGGCTATCAGG
AAAAACTCCACTTCTTGCAAGAGCTCTATGATATTCATGGTCTCTGTCAAGGTGTATGGTGTTTGGTGGGGGATTTTAATATCATTAGATGGGTTGAGGAAAGGTTGAGT
AGGCGTAATCCTTTGAGGAGTATGAAGAAGTTTAATCGGTTTATCTCCTCGGCTGATTTGATTGACTTGCCTATGCTCAATGGGTTATTCACCTGGTCCAGATTGGGGGT
GAGAGTTGCTGCTTCAAGATTAGATAGATTCCTTCTCTCCAAACAATGGATTGACAAGTTCTCTGATTTTAGAGTTTTGAGACTTAAGAGACCCACCTCTGATCACTTCC
CCATCATGCTGTCTATTGGTGCGGTTAAATGGGGCCCAACTCCATTCCGGTTCGAAAATTCCTGGCTGGATATTCGAGATTTTAAGGAGCAGCTTGAGTCGTGGTGGACT
GAGTTGGATCCCACAGGGTGGGCGGGATTTAGATTCATGGAGAAGTTGAAAGGGTTGAAATGTAAAATTAAAGAGTGGAATGAAGGCCGTCGAGTGTTGGCAAGAGCTAA
GAATGATGAGATCTTAGAAAAAATTGACTACATTGATAGATTGGAGGAATGCAATGAGGCTCATGAGCAACAAGTGGAAGACAGAAAGAAGCTTAAAGTTGAGTTGTTGG
AAATCACTATAGAAGAGCAAAGAAGCCTCAATAAGAAGTGCAAGCTTAGATGGTTAAATGAGGGGGATGAAAACTCCTCCTTCTTCCATAAATGGGCCACTGCCAAGAAA
AACAAAGCTTTTATTTCAGTTTTGGAGAAGGAAAGTGGGGAAATCATCCCCAGAGAAGAAGATATTGAAGGTGAGATTTTGAGCTTTTTCAGAAAACTGTACTCGAAAGA
GCAGGGACCAAGATTTGTGCTTGATGGTGTGAATTGGGATCCCATTGATGCTCAATGTTGTTTGGATCTGGAAAAGGAGTTTAAGGAGGAAGAGATTTACAAGGCTATTT
GTGATTTGGGGAATCTAAAATCCCTGGGTCCGGATGGGATGACAAGGGAGTTTTATAAAAATTCTTGGAACATCTTGAAGAAAGATTTAGTAGAGGGTAGGCAAATCTTG
GATGCTATTTTAATCGCTACCGAATCTGTGGAAGAGTTAAAAAGTCGTAAAAATTCGGGTGTTCTTCTAAAGCTGGATCTTGAGAAGGCTTATGATATGGTGGATTGGTG
CTTCCTTGATGCTATTCTCTCCAGGAAGGGATTCGGCCATAGATGGAGGCTATGGATAAAAGGCTGCCTTCGGAACTCTAATTTCTCTATAATTGTGAATGGCAGGCCAA
GAGGGAAGATTGTGGCTTCTAGAGGGCTTAGGCAAGGAGATCCGTTATCGCCCTTTCTTTTTATAGTGGTTGGGGATGCCGTGAGTAGAACAGTTCAACTCTGTATTGAG
AAGAAGATTTTAAAGGGCTGGAAAATTGGGCAAAACAGAATGGAGGTGGCCATGCTTCAGTACGCTGATGACACTCTCGTTTTTTGCCCAAACAATGAGGTAGACATTGC
TAAATGGTGGGATATTTTGAGAGTTATTTTGCAGGGCTCTGGTTTATCTTTGAATTCCTCGAAGACAGCTATAATAGGGGTGAATACGGATATGAGCATGCTGGCTAATT
GGGCAAATCACCTTGACTGCAAAGCCGAATCTCTTCCGGTTAACTACTTGGGTTTTCCTTTGGGAGGGAATTATCACAGGAAGATGTTCTGGGATCCTTTATTGGATAAG
TTTAGAAGGAAACTCGACAATTGGAGGAATTTCCATATATCAAAAAGGGGTCGAACTACGTTAGCCCAATCTGTTTTGAATAGCCTCCTCATCTATTATTTTTCTTTGTT
AAAAGCCCCCAAAGCAACCGTTAGAGCGATGGAAAAAACTGTTCGGGATTTTGTGTGGGGCGGTGGTACCTACAAACCGGGGGTGAATTTGGTGAAATGGGAATGA
Protein sequenceShow/hide protein sequence
MLAELRSAIRLDDLHDASERLQNQDREQTLFLWFDRNYKGRVIRIQEDHLRKKFSLSAEEIVIVWISDSIEDLLLAPATHKLFRKVDCNNGFIWIQKISNKRGSFLEITK
VNNSGGKHNLVVPAGVDYKGWKSFSDLLNKFLNGEDDYLEDSKQAETVVRRKGKGKSFADIVKRSSSNSIPNDPSTQITPQGSRGKERKNAPKVRRSHTGTEWKDIPVCR
GCSEEVRKINWEEVIVITKRDFHDDWGRILEVMQQQLMEPLIINPFHPDKALIKCHSRELVDLLTKNQGWVSFGPIILKVEKWNKNAHSKISCIPSYGGWVKIRNLPLHL
WNLQTFKAIGDNLGGFIEYEEANSLLIECVEVKIKIKENYCGFIPAEVRIEDGEDQYIVQIVTFQDGSLLIDRVAGIHGTFSSTVVHIFHRGPMDPLFCTAYIWRIDNGL
HYPVVIVPQSVDEPRICGGQLDDDKVDFEISRLRTKGKNKAEAGPSKCAVILVRVEAQPILVQPDGPSTMKRSKKEKRKGVSFAKEAQITLFKKGEVHHTQKSPSPHLKG
VTTDTWNGEQVLESDFSFSSPTSLDGEGGKDSGLSSRKAFDCDIPEGYQLCFSNDTDQDISSIPLSVEESVELAEEAPSTSKEETIQSPKAPPECRTIGEALHEENHFLH
PKALAIIPAGKEDRADLETEKGELLLSKELILTLRRNNLCIRSIVGSSAKKGNSTKKKRSREVTNLFRTWEKEVEPIIEEENGNENGNVSFSEVKSHVERAETESRLAAI
DSRLVKSVWSSRHIAWVVLDPVASGGGIIILWKEDRVEVVDTVLGAFSISIKVTLLDKSEGWISGVYGPCGYQEKLHFLQELYDIHGLCQGVWCLVGDFNIIRWVEERLS
RRNPLRSMKKFNRFISSADLIDLPMLNGLFTWSRLGVRVAASRLDRFLLSKQWIDKFSDFRVLRLKRPTSDHFPIMLSIGAVKWGPTPFRFENSWLDIRDFKEQLESWWT
ELDPTGWAGFRFMEKLKGLKCKIKEWNEGRRVLARAKNDEILEKIDYIDRLEECNEAHEQQVEDRKKLKVELLEITIEEQRSLNKKCKLRWLNEGDENSSFFHKWATAKK
NKAFISVLEKESGEIIPREEDIEGEILSFFRKLYSKEQGPRFVLDGVNWDPIDAQCCLDLEKEFKEEEIYKAICDLGNLKSLGPDGMTREFYKNSWNILKKDLVEGRQIL
DAILIATESVEELKSRKNSGVLLKLDLEKAYDMVDWCFLDAILSRKGFGHRWRLWIKGCLRNSNFSIIVNGRPRGKIVASRGLRQGDPLSPFLFIVVGDAVSRTVQLCIE
KKILKGWKIGQNRMEVAMLQYADDTLVFCPNNEVDIAKWWDILRVILQGSGLSLNSSKTAIIGVNTDMSMLANWANHLDCKAESLPVNYLGFPLGGNYHRKMFWDPLLDK
FRRKLDNWRNFHISKRGRTTLAQSVLNSLLIYYFSLLKAPKATVRAMEKTVRDFVWGGGTYKPGVNLVKWE