; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017939 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017939
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:11943947..11951184
RNA-Seq ExpressionLag0017939
SyntenyLag0017939
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.6e-13432.89Show/hide
Query:  RFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGGNVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFI
        RF+G YG+P A  R   W LLRR+ N D S W++GGD NA L   E    +    SQ ++FR+ MD C L D+ F G +FTW N +   +Q+ +RLDRF+
Subjt:  RFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGGNVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFI

Query:  ANEDYLQLFPN------------TCVDHLQ--------WAQSD----------HRPILMNGYR--------------------IDQDKLF------QDWF
         N+ +  +FP+            +  D +Q        W +S+           +  +++ Y                     ++ +++F      +DW 
Subjt:  ANEDYLQLFPN------------TCVDHLQ--------WAQSD----------HRPILMNGYR--------------------IDQDKLF------QDWF

Query:  VGIIVEVIHLDLKDVYNLDIKEMVSPSMNEKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALI
           I  +  LD++ + NL I   ++  +NE+L+AP+TK EIE A+ QM P+KA GPD FPALFYQ YW  VG  T   CL+ LN    +++WN T+IALI
Subjt:  VGIIVEVIHLDLKDVYNLDIKEMVSPSMNEKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALI

Query:  PK-------ADLFLIML----FWVMSACILLKIK-------------------------------------KRGRKGWLALKLDMSKAYDRVEWCFLERL
        PK       +D   I L    + ++S  I  ++K                                     K G  G  ALKLD+SKA+DRVEW +LE +
Subjt:  PK-------ADLFLIML----FWVMSACILLKIK-------------------------------------KRGRKGWLALKLDMSKAYDRVEWCFLERL

Query:  MLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLI---------TGAVW-----------------------
        M K+GFN  W++ I++C+ T  FSI LNG P     P+RG+RQGDPLSPYLFLL +E LS+LI         TG  +                       
Subjt:  MLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLI---------TGAVW-----------------------

Query:  -----VLRNILLQYEHASGQKVNVGKSALYFSPNVQMEFRSVI---------------------------------------------------------
              LR +L  Y  ASGQ +N  KSAL FSPNV  E +  +                                                         
Subjt:  -----VLRNILLQYEHASGQKVNVGKSALYFSPNVQMEFRSVI---------------------------------------------------------

Query:  -STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGAL
           L+AK VW    +P+LL +KVLK +Y +  SLL A + S  S FW+GF+W R+L+V G+R R+GNG +   F DPW+P+  +FKP         +GAL
Subjt:  -STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGAL

Query:  ---VSEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSISVDQESASSNNQRLWWKTLWNSKMPQKIKLFIW
           V+ FI+    WD+  +      ED D+I ++PIS  N +D W+WHY   GNY+VRSGYKL   +  +  SAS+N +   W ++W   +P KIK+FIW
Subjt:  ---VSEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSISVDQESASSNNQRLWWKTLWNSKMPQKIKLFIW

Query:  KAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVW--LARNLDDESFERACIAFWSLWNDRNSS
        ++ H  +PT   L  RGI   P C +C  + E+I HA   CKRA++I   LF  +   +   +N     +W  L   L+ +    A I  W +WNDRNS 
Subjt:  KAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVW--LARNLDDESFERACIAFWSLWNDRNSS

Query:  NNGMPI------MDWVKRYAACSLAKDGSGYGAVITEANDRLCGAMEFFDPTRLTSFAAEVNALMHG
         +G  +       +W+  +         S Y +  T++N R    ++++ P+   S     +A   G
Subjt:  NNGMPI------MDWVKRYAACSLAKDGSGYGAVITEANDRLCGAMEFFDPTRLTSFAAEVNALMHG

XP_024035599.1 uncharacterized protein LOC112096407 [Citrus clementina]7.5e-11332.74Show/hide
Query:  FTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASI-TWDSKMWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGGNVVRDS
        F VD  G+SGGL + W  D +V I SY++ HID  + +   K WR  G+YG+P  S + H W LLRRL       W+  GDFN  L   E  GG      
Subjt:  FTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASI-TWDSKMWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGGNVVRDS

Query:  QFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPILMNGYRIDQDKLF----------------
            FR+A+ DC L DL   G  FTWSNRQ     I E+LDRF+ N+++   F +    +L   +SDH P++M      +  LF                
Subjt:  QFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPILMNGYRIDQDKLF----------------

Query:  ---------QDWFVGII-----------------------------------VEVIHLDLK-------------DVYNLD--------------------
                 ++W    I                                   ++V+   LK             ++ NL+                    
Subjt:  ---------QDWFVGII-----------------------------------VEVIHLDLK-------------DVYNLD--------------------

Query:  ----------IKEMVSPSMNEKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIPK-------
                  +   V  +MN  L  PFT  EIE A++QM P+KAPGPD   A+F+QK+W  V N     CLD+LN + ++   N T+I LIPK       
Subjt:  ----------IKEMVSPSMNEKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIPK-------

Query:  ---ADLFLIMLFWVMSA-------------------------------------CILLKIKKRGRK-GWLALKLDMSKAYDRVEWCFLERLMLKLGFNAN
             + L  + +++ A                                     C+      +G+K   +ALKLD+ KAYDRVEW FL+ ++ +LGF++ 
Subjt:  ---ADLFLIMLFWVMSA-------------------------------------CILLKIKKRGRK-GWLALKLDMSKAYDRVEWCFLERLMLKLGFNAN

Query:  WVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLITGAVWVLRNILLQYEHASGQKVNVGKSALYFSPNVQMEFRSVI---
        W+ LIM C+ T  FS+++NG     I P RGLRQG PLSPYLFLL +E     I    W         E  S  K+  G           + FR V    
Subjt:  WVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLITGAVWVLRNILLQYEHASGQKVNVGKSALYFSPNVQMEFRSVI---

Query:  STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGALV
          LLAKQ W L  NP  L AKV+K RY ++   L+A   S+ S  WR  +W R+++  G R RIGNG         WIP+  +FK   I+  +   GA V
Subjt:  STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGALV

Query:  SEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSISVDQESASSNNQRLWWKTLWNSKMPQKIKLFIWKAYH
        SE I P+ +W+ + + +   + D DII +I +    QED+ IWHY   G Y+V+SGY+LA  I   +   SS      W+ LW   +P KIK+F+WKA  
Subjt:  SEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSISVDQESASSNNQRLWWKTLWNSKMPQKIKLFIWKAYH

Query:  GCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRI
          LPT   LW R I   P+C +C    E + HAL  CK A++I
Subjt:  GCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRI

XP_030946032.1 uncharacterized protein LOC115970553 [Quercus lobata]4.1e-11131.63Show/hide
Query:  LNYSGCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASITWDSK-MWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGG
        LN  G   V    + GG+ +FWK  +D ++ ++S  HID  +   ++  WRF+G YG    ++    W+ LRRL N +   W+  GDFN      E  GG
Subjt:  LNYSGCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASITWDSK-MWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGG

Query:  NVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPIL--MNG--------YRID-----
         +    Q + FR  +D+CG +DL   G  FTW N       + ERLDR +A  D+L  FP T V HL++  SDH+PIL  +NG        +R +     
Subjt:  NVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPIL--MNG--------YRID-----

Query:  ----QDKLFQDWFVGI----------------------------------------------IVEVIHLDLKDVYNLDIKE-----------MVSPSMNE
            +D +   W   +                                              I  ++    +++++  + +           +VS  MN+
Subjt:  ----QDKLFQDWFVGI----------------------------------------------IVEVIHLDLKDVYNLDIKE-----------MVSPSMNE

Query:  KLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIPKA----------DLFLIMLFWVMSACIL-
         L+APF + E+E A+ QM P KA GPD  P LF+Q +W  +G+      L  LN        N T I LIPK            + L    + + + +L 
Subjt:  KLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIPKA----------DLFLIMLFWVMSACIL-

Query:  -------------------------------------LKIKKRGRKGWLALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGI
                                             +K +K  + G++ALKLDMSK YD V W +L ++M KLGF   WV L+ EC+ +  +SIL+NG 
Subjt:  -------------------------------------LKIKKRGRKGWLALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGI

Query:  PTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLITGA-----VWVLRNILLQYEHASGQKVNVGKSALYFSPNVQMEFRSVISTLLAKQVWHLSTNPSLLA
        P   I P+RGLRQGDPLSPYLFLL SE L+ ++  A     + V+++IL  YE ASGQK+N  K+ ++FS  VQ + ++ +S  L  QVW L  + + L 
Subjt:  PTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLITGA-----VWVLRNILLQYEHASGQKVNVGKSALYFSPNVQMEFRSVISTLLAKQVWHLSTNPSLLA

Query:  AKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFK---PRPIMGRTNQDGALVSEFI-SPSMEWDLNKL
         +V K +Y  + S+  A +    S  W+  + AR ++  GMR RIGNG S + + D W+P + S K   PR        DGA V+  I S +   D N L
Subjt:  AKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFK---PRPIMGRTNQDGALVSEFI-SPSMEWDLNKL

Query:  KEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYK-LARSISVDQESAS-SNNQRLWWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERG
        ++     ++  I AIP+   +QED  IW  C  GNY+V++GY+ L  S +VD  S+S S+ Q L+WK +W  ++P KIK+F+W+     LPT   L  R 
Subjt:  KEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYK-LARSISVDQESAS-SNNQRLWWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERG

Query:  IDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVWLARNLDDESFERACIAFWSLWNDRN
        I     C  C +  E   +A+ GC++ + I    F  V    P + +  + +  + + +D    E   +  W +WN RN
Subjt:  IDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVWLARNLDDESFERACIAFWSLWNDRN

XP_042942906.1 uncharacterized protein LOC122277092 [Carya illinoinensis]3.1e-11130.51Show/hide
Query:  LNYSGCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASITWDS-KMWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGG
        +   GCF VD +G+ GGL LFWK   +V I +YSQ+H+ A +  D+   W F+G YG+P+A+ R + W+LL RL   ++  W V GDFN  L  EE  GG
Subjt:  LNYSGCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASITWDS-KMWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGG

Query:  NVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPILM-----NGYR------------
        ++  +SQ   FR+A++  GL D+ ++G+ FTWSN         ERLDRF+ N  +   F    V+ L    SDH+P+L+      G R            
Subjt:  NVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPILM-----NGYR------------

Query:  -IDQD------KLFQDWFVGIIVEVIHL-----------DLK----------------------------------------------------------
         I +D      K+ +D     +VE+  L           D+K                                                          
Subjt:  -IDQD------KLFQDWFVGIIVEVIHL-----------DLK----------------------------------------------------------

Query:  -DVYNLD-------------IKEMVSPSMNEKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIAL
         +VY  +             I+E V+ +M E+L   FT+ E++ A+ QM P K+PGPD + A FYQ +W  VG+      L  LN  +  R  N T++AL
Subjt:  -DVYNLD-------------IKEMVSPSMNEKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIAL

Query:  IPK-------ADLFLIMLFWVMSACIL-----------------------------------------LKIKKRGRKGWLALKLDMSKAYDRVEWCFLER
        IPK        D   I L  V+   I                                          +K + +GR G +ALKLD+SKAYD+VEW FL+ 
Subjt:  IPK-------ADLFLIMLFWVMSACIL-----------------------------------------LKIKKRGRKGWLALKLDMSKAYDRVEWCFLER

Query:  LMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLITG---------------------------------A
        +M KLGF A WV LIMECV +  +++L+NG P   I PTRGLRQGDPLSPYLFLL +E LSSLI G                                 A
Subjt:  LMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLITG---------------------------------A

Query:  VW----VLRNILLQYEHASGQKVNVGKSALYFSPN-----------------------------------------------------------------
         W     + +IL  YE ASGQ +N  K+++ FS N                                                                 
Subjt:  VW----VLRNILLQYEHASGQKVNVGKSALYFSPN-----------------------------------------------------------------

Query:  --------------------------VQMEFRSVISTLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIG
                                  +  E  S    LLAKQVW + TNPS +AA++LK +Y ++ S+L+    +  S+ WR    + EL+  G+  R+G
Subjt:  --------------------------VQMEFRSVISTLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIG

Query:  NGRSTHFFRDPWIPKEHSFK---PRPIMGRTNQDGALVSEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGY--KLA
        NG+S   + D WIP+  SFK   P  I+ R  +   L+ E       W  + +KE+  +E+ D+I   PIS  N +D+ IW     G YTV+S Y  +L 
Subjt:  NGRSTHFFRDPWIPKEHSFK---PRPIMGRTNQDGALVSEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGY--KLA

Query:  RSISVDQESASSNNQRLWWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRA
        R   +  ES+  N +   WK++W    P  IK+F+WKA + CLPT + L++R +   P+C +C +K ET+ H L  C  A
Subjt:  RSISVDQESASSNNQRLWWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRA

XP_042980185.1 uncharacterized protein LOC122310356 [Carya illinoinensis]1.4e-10627.81Show/hide
Query:  LNYSGCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASITWDS--KMWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEG
        LN+   F VDCVGRSGG+   WK +V+  + SYS  HI  ++  ++  +    +G YG+P A+ R   WNL+R +H++    W+  GDFN  LL EE  G
Subjt:  LNYSGCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASITWDS--KMWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEG

Query:  GNVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQL--------FPNTCVDH-------------------------
         +     Q ++FR  ++DCG+QDL + GD FTWSNR+E  +    RLDR   N+ +  L         P  C+DH                         
Subjt:  GNVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQL--------FPNTCVDH-------------------------

Query:  -------------LQWAQS------------------------------DHRPIL----------------MNGYRIDQ-----DKLFQD----W-----
                       W QS                              DH+ +L                 N  +I Q     DKL +     W     
Subjt:  -------------LQWAQS------------------------------DHRPIL----------------MNGYRIDQ-----DKLFQD----W-----

Query:  ----------------------FVGIIVEV-------------IHLDLKDVY-NL--------------DIKEMVSPSMNEKLMAPFTKCEIERAVNQMS
                               V  I +V             I L  +D Y NL                +  V+  MN  L +P++  EI+RA+  M+
Subjt:  ----------------------FVGIIVEV-------------IHLDLKDVY-NL--------------DIKEMVSPSMNEKLMAPFTKCEIERAVNQMS

Query:  PSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIPK----------ADLFLIMLFWVMSACIL---LKI--------------
        P  +PGPD FPALFY ++W  VG   +   L++LN  +  ++ N T I+LIPK            + L  +F+ + A +L   LK+              
Subjt:  PSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIPK----------ADLFLIMLFWVMSACIL---LKI--------------

Query:  ---------------------KKRGRK-GWLALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLS
                             + +GR+ G++ALKLDMSKAYDRVEW FL+  +LK+GF+++WV+L+MECV T  +SIL+NGIP     PTRG+RQGDPLS
Subjt:  ---------------------KKRGRK-GWLALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLS

Query:  PYLFLLVSEVLSSLIT-----GAV-------------------------------WV-LRNILLQYEHASGQKVNVGKSALYFSPNV-------------
        PYLF++ SE+L+  +      GA+                               W  L++ L  +E ASGQ++N  KS++YFS N              
Subjt:  PYLFLLVSEVLSSLIT-----GAV-------------------------------WV-LRNILLQYEHASGQKVNVGKSALYFSPNV-------------

Query:  ---------------------QMEFRSVISTL--------------------------------LAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSK
                             Q  F S++  +                                +AKQ W L    + LAA+VLK +Y    S LS   +
Subjt:  ---------------------QMEFRSVISTL--------------------------------LAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSK

Query:  SNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGALVSEFISP-SMEWDLNKLKEVVNQEDMDIIAAIPISLANQE
        ++ S  W+  + AR ++  G+  R+GNG+    + D W+P+  S+  +  +   N++ A V + I+P +M+W+L  +  + ++E+ ++I  +PIS    +
Subjt:  SNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGALVSEFISP-SMEWDLNKLKEVVNQEDMDIIAAIPISLANQE

Query:  DQWIWHYCSHGNYTVRSGYKLARSIS--VDQESASSNNQRLWWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCG
        D   W   S+G ++V+S Y L  S+   V  + ++SN     W  LW+  +P   K  +W+A    LPT   L +R +  SP+C +C ++ ET+ HAL  
Subjt:  DQWIWHYCSHGNYTVRSGYKLARSIS--VDQESASSNNQRLWWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCG

Query:  CKRAKRICNVLFHRVDAAIPILNNFPDRVVWLARNLDDESFERACIAFWSLWNDRNS
        CK A+ +  +   ++         F + +  +   L  E      +    LWN RNS
Subjt:  CKRAKRICNVLFHRVDAAIPILNNFPDRVVWLARNLDDESFERACIAFWSLWNDRNS

TrEMBL top hitse value%identityAlignment
A0A2N9G299 Uncharacterized protein2.7e-10829.88Show/hide
Query:  GCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDAS-ITWDSKMWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGGNVVR
        GCF V+     GGL LFW D V + I+SYS FHIDA  I  D  +WRF+G YG+P A  RI  W LLR+L+   D  WV+ GDFN     EE  G     
Subjt:  GCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDAS-ITWDSKMWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGGNVVR

Query:  DSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPILMNGYRIDQD------KLFQ-------
          Q  +FR+A+ DC L DL F+G  FTWSN +E E  +  RLDR +A++++  LFPN  V+H+ +  SDH  +LM+     Q       K+F+       
Subjt:  DSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPILMNGYRIDQD------KLFQ-------

Query:  -----------------------------------------------------DWFVGIIVEVIHL--------------DLKDVYNLDIKEMVSPSMNE
                                                              W  G + E + L               +  V  L + E+V+  MN 
Subjt:  -----------------------------------------------------DWFVGIIVEVIHL--------------DLKDVYNLDIKEMVSPSMNE

Query:  KLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSV-REWNDTHI------ALIPKADLF--LIMLFWVMSACILLK
        KL+ PFT  E++ A+ QM PSKAPGPD    LF+QKYW  VG       LD LN  R +  ++  T++      A +P   +   +I+ F ++     LK
Subjt:  KLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSV-REWNDTHI------ALIPKADLF--LIMLFWVMSACILLK

Query:  IKKRGRKGWLALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSE----VLSSLITG
         K+ G+ G +  KLDMSKAYDRVEW +L  ++LKLGF+  WV L+M CV +  +SI+LNG     I P RGLRQGDPLSPYLFL+  E    +  +  TG
Subjt:  IKKRGRKGWLALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSE----VLSSLITG

Query:  AVWVLRNILLQYEHASGQKVNVGKSALYFSPNVQMEFRSVI-----------------------------------------------------------
            L +IL  YE+ASGQK+N GK+ L+FS N Q + R +I                                                           
Subjt:  AVWVLRNILLQYEHASGQKVNVGKSALYFSPNVQMEFRSVI-----------------------------------------------------------

Query:  --------------------------------------------------------------------STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDS
                                                                              LLA+Q W L   P+ L ++VLK +Y  + S
Subjt:  --------------------------------------------------------------------STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDS

Query:  LLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWI---PKEHSFKPRPIMGRTNQDGALVSEFISPSMEWDLNKLKEVVNQEDMDIIAA
         + A  K   S  +R    ARE++ +GM  R+G G +   ++D W+   P      P  I+      G+L+   +   M+WD+  + ++    + +I+  
Subjt:  LLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWI---PKEHSFKPRPIMGRTNQDGALVSEFISPSMEWDLNKLKEVVNQEDMDIIAA

Query:  IPISLANQEDQWIWHYCSHGNYTVRSGYKLA-RSISVDQESASSNNQRLWWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWE
        IP+S     D   W     G ++V+S Y L  +  +    SAS+   +++W  +W+S++  K++ FIW+A    LPT  +L+ER I  S  C  C  + E
Subjt:  IPISLANQEDQWIWHYCSHGNYTVRSGYKLA-RSISVDQESASSNNQRLWWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWE

Query:  TIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVWLARNLDDESFERACIAFWSLWNDRN
        T DH L  C+ A+++ +     +   + I  +F + +    + L     E      W LW  RN
Subjt:  TIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVWLARNLDDESFERACIAFWSLWNDRN

A0A6J1DX30 uncharacterized protein LOC1110248747.5e-13532.89Show/hide
Query:  RFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGGNVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFI
        RF+G YG+P A  R   W LLRR+ N D S W++GGD NA L   E    +    SQ ++FR+ MD C L D+ F G +FTW N +   +Q+ +RLDRF+
Subjt:  RFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGGNVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFI

Query:  ANEDYLQLFPN------------TCVDHLQ--------WAQSD----------HRPILMNGYR--------------------IDQDKLF------QDWF
         N+ +  +FP+            +  D +Q        W +S+           +  +++ Y                     ++ +++F      +DW 
Subjt:  ANEDYLQLFPN------------TCVDHLQ--------WAQSD----------HRPILMNGYR--------------------IDQDKLF------QDWF

Query:  VGIIVEVIHLDLKDVYNLDIKEMVSPSMNEKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALI
           I  +  LD++ + NL I   ++  +NE+L+AP+TK EIE A+ QM P+KA GPD FPALFYQ YW  VG  T   CL+ LN    +++WN T+IALI
Subjt:  VGIIVEVIHLDLKDVYNLDIKEMVSPSMNEKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALI

Query:  PK-------ADLFLIML----FWVMSACILLKIK-------------------------------------KRGRKGWLALKLDMSKAYDRVEWCFLERL
        PK       +D   I L    + ++S  I  ++K                                     K G  G  ALKLD+SKA+DRVEW +LE +
Subjt:  PK-------ADLFLIML----FWVMSACILLKIK-------------------------------------KRGRKGWLALKLDMSKAYDRVEWCFLERL

Query:  MLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLI---------TGAVW-----------------------
        M K+GFN  W++ I++C+ T  FSI LNG P     P+RG+RQGDPLSPYLFLL +E LS+LI         TG  +                       
Subjt:  MLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLI---------TGAVW-----------------------

Query:  -----VLRNILLQYEHASGQKVNVGKSALYFSPNVQMEFRSVI---------------------------------------------------------
              LR +L  Y  ASGQ +N  KSAL FSPNV  E +  +                                                         
Subjt:  -----VLRNILLQYEHASGQKVNVGKSALYFSPNVQMEFRSVI---------------------------------------------------------

Query:  -STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGAL
           L+AK VW    +P+LL +KVLK +Y +  SLL A + S  S FW+GF+W R+L+V G+R R+GNG +   F DPW+P+  +FKP         +GAL
Subjt:  -STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGAL

Query:  ---VSEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSISVDQESASSNNQRLWWKTLWNSKMPQKIKLFIW
           V+ FI+    WD+  +      ED D+I ++PIS  N +D W+WHY   GNY+VRSGYKL   +  +  SAS+N +   W ++W   +P KIK+FIW
Subjt:  ---VSEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSISVDQESASSNNQRLWWKTLWNSKMPQKIKLFIW

Query:  KAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVW--LARNLDDESFERACIAFWSLWNDRNSS
        ++ H  +PT   L  RGI   P C +C  + E+I HA   CKRA++I   LF  +   +   +N     +W  L   L+ +    A I  W +WNDRNS 
Subjt:  KAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVW--LARNLDDESFERACIAFWSLWNDRNSS

Query:  NNGMPI------MDWVKRYAACSLAKDGSGYGAVITEANDRLCGAMEFFDPTRLTSFAAEVNALMHG
         +G  +       +W+  +         S Y +  T++N R    ++++ P+   S     +A   G
Subjt:  NNGMPI------MDWVKRYAACSLAKDGSGYGAVITEANDRLCGAMEFFDPTRLTSFAAEVNALMHG

A0A803PHH5 Uncharacterized protein8.3e-11830.26Show/hide
Query:  MSLNYSGCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASITWDSKM-WRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENE
        +SL + GCF V+  G+SGGL L W + ++ +I S+S FHID+ I  +    WRF+G YG+P+ S R   W LL R+       W++GGDFN  L  +E +
Subjt:  MSLNYSGCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASITWDSKM-WRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENE

Query:  GGNVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPILMNGYRIDQD-----------
        GGN        +FR A+DDC L+++EF G+MFTW N ++ EN I ERLDR   N D+  LFP   V HL+   SDH P+L+      QD           
Subjt:  GGNVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPILMNGYRIDQD-----------

Query:  ----------------------------------------------------------KLFQDWFVGIIVEVIHLD---LKDV---YNL-----------
                                                                  K ++D    +  +  H     LK++   YN+           
Subjt:  ----------------------------------------------------------KLFQDWFVGIIVEVIHLD---LKDV---YNL-----------

Query:  -------------------------------------------------------------------DIKEM-------VSPSMNEKLMAPFTKCEIERA
                                                                           ++KE        +S  +NE L++PFTK E+ +A
Subjt:  -------------------------------------------------------------------DIKEM-------VSPSMNEKLMAPFTKCEIERA

Query:  VNQMSPSKAPGPDDFPALFYQKYW----DEVGNITALNCLDILNLKRSVREWNDTHIALIPKADLFLIMLFWVMSACI------LLKIKKRGRKGWLALK
        +  +SP KAPG D  P LFY+K+W    DEV  I A  CL       + R  +  H A+      F+       +A I       +K+K+ G    +ALK
Subjt:  VNQMSPSKAPGPDDFPALFYQKYW----DEVGNITALNCLDILNLKRSVREWNDTHIALIPKADLFLIMLFWVMSACI------LLKIKKRGRKGWLALK

Query:  LDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLITGA----------------
        LDMSKAYDRVEW FL  +M  LG++  W++ IM CV +  FS+L+NG    +  P+RGLRQGD LSPYLFL+ SE L  LI  A                
Subjt:  LDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLITGA----------------

Query:  ---------------------VWVLRNILLQYEHASGQKVNVGKS-----------------------ALYFS-------PNVQ--MEFRSVI---STLL
                                ++ I  QYE  SGQK+N+ KS                        L++S       P  +  + FRS+      LL
Subjt:  ---------------------VWVLRNILLQYEHASGQKVNVGKS-----------------------ALYFS-------PNVQ--MEFRSVI---STLL

Query:  AKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEH-SFKPRPIMGRTNQDGALVSEF
        AKQ WHL   P  L A+VLK  Y  + S L+A    N S  W+G VW RE++  G R R+GNGR+   ++D W+P+ + +   RPI    N     VS  
Subjt:  AKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEH-SFKPRPIMGRTNQDGALVSEF

Query:  ISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSISVDQESASSNNQ-RLWWKTLWNSKMPQKIKLFIWKAYHGC
        ++   EW+++ L    ++ED+  I  IPI   + ED  IW +   GNY V+SGY++AR I++    +S+ +Q   WWK  W+  +P ++KLF WK     
Subjt:  ISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSISVDQESASSNNQ-RLWWKTLWNSKMPQKIKLFIWKAYHGC

Query:  LPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVWLARNLDDESFERACIAFWSLWNDRN------SSNN
        LP    L  RG+ ++P+C  C+   ET+ HAL  C++ K++  ++           N+  D +V     L  E FE      W++W +RN       + +
Subjt:  LPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVWLARNLDDESFERACIAFWSLWNDRN------SSNN

Query:  GMPIMDWV
        G  +++WV
Subjt:  GMPIMDWV

A0A803QAN3 Uncharacterized protein7.6e-11129.54Show/hide
Query:  LFWKDDVDVTIRSYSQFHIDASITW-DSKMWRFSGLYGNPNASHRIHKWNLLRRLHNHDD-STWVVGGDFNATLLFEENEGGNVVRDSQFQSFRDAMDDC
        L WKDDVDVT+ S++    D  + + +     F+  YG PN +HR+H W LL+RL +      WVV GDFN  L     +GGN   +SQ   FR  +D C
Subjt:  LFWKDDVDVTIRSYSQFHIDASITW-DSKMWRFSGLYGNPNASHRIHKWNLLRRLHNHDD-STWVVGGDFNATLLFEENEGGNVVRDSQFQSFRDAMDDC

Query:  GLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPI--------------LMNGYRIDQDKLFQ-----------DW
         L +L F GD FTW   + + + I+ERLD    N+ +   F      HL +  SDHR I              L N   I  D L Q           DW
Subjt:  GLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPI--------------LMNGYRIDQDKLFQ-----------DW

Query:  F-----------------------------VGIIV-------EVIHLDLKDVYNLD-------------IKEMVSPSMNEKLMAPFTKCEIERAVNQMSP
                                       GI V       +VI      ++  D             I E ++  MN  L APFT  E+  A+  MSP
Subjt:  F-----------------------------VGIIV-------EVIHLDLKDVYNLD-------------IKEMVSPSMNEKLMAPFTKCEIERAVNQMSP

Query:  SKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIPKAD-----------LFLIMLFWVMSACILLKIKK---------------
         K+PG D   A+FYQ YWD VG       L +LN    + + N + I LIPK                 +++ ++S  I+L+ +K               
Subjt:  SKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIPKAD-----------LFLIMLFWVMSACILLKIKK---------------

Query:  ----------------------RGRKGWLALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPY
                              +GR+G+ ALKLDMSKA+DRVEW +LE +MLK+GF + WV LIM C+ T+ FS  LNG     + P+RGLRQGDPLSPY
Subjt:  ----------------------RGRKGWLALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPY

Query:  LFLLVSEVLSSLI-------------------------------------TGAVWVLRNILLQYEHASGQKVNVGKSALYFSPNVQ--------------
        LFL+ SE LS L+                                       +   ++  L  Y  ASGQ +N  KS + FSPN                
Subjt:  LFLLVSEVLSSLI-------------------------------------TGAVWVLRNILLQYEHASGQKVNVGKSALYFSPNVQ--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------MEFRSVI---STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPK
                  M FRS +     LLAKQ W +   P+ L +++LK RY    +   A    + S  W+   W R+L+V GMR +IG G +     DPWIP 
Subjt:  ----------MEFRSVI---STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPK

Query:  EHSFKPRPIMGRTNQDGALVSEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSISVDQESASSNNQRLWWK
          +FKP   +  T      VS FI+  MEW+++ L E     D+D I +IP+S    +D+ IWH+ S G Y V+SG+ LA S+     S++S+  R WW+
Subjt:  EHSFKPRPIMGRTNQDGALVSEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSISVDQESASSNNQRLWWK

Query:  TLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVD--AAIPILNNFPDRVVWLARNLDDESFE
          WN  +P K+++F WK +H  LP    L+++ I  S  C LC S WE+I HAL GCK AK I       +D   A  + N   D +++L+   + E FE
Subjt:  TLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVD--AAIPILNNFPDRVVWLARNLDDESFE

Query:  RACIAFWSLWNDRNSSNNG
              W +W+DRN   +G
Subjt:  RACIAFWSLWNDRNSSNNG

A0A803QLY3 Uncharacterized protein9.9e-11933.06Show/hide
Query:  MSLNYSGCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASITW-DSKMWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENE
        + L + GCF V+  G+SG L L W + V+  + S+S FHID+ I   + + WRF+  YG+P+ S R   W LL R+       W VGGDFN  L  +E  
Subjt:  MSLNYSGCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASITW-DSKMWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENE

Query:  GGNVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPILMNGYRIDQDKLFQDWFVGII
        GG        ++FR A+D C L+++ F G  +TW N + + N I ERLDR   N+   Q+    C                            D+F+ + 
Subjt:  GGNVVRDSQFQSFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPILMNGYRIDQDKLFQDWFVGII

Query:  V--EVIHLDLKDVYNLDIKEMVSPSMNEKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIPK
            V   +L++   L +   +S + N+ L+ PFT  EI  A+  + P KAPG D  P LFY+KYW  +G+  +  CL ILN    V E NDT I LIPK
Subjt:  V--EVIHLDLKDVYNLDIKEMVSPSMNEKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIPK

Query:  ADLFLIM---LFWVMSACILLKIKKRGRKGWLALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPL
             ++         +   L+ K+ G    +ALKLDMSKAY+RVEW FL  +M  LG+   WV+ IM+CV +  FS+L+NG    R  PTRGLRQGD L
Subjt:  ADLFLIM---LFWVMSACILLKIKKRGRKGWLALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPL

Query:  SPYLFLLVSEVLSSLI---------------TGAVWV----------------------LRNILLQYEHASGQKVNVGKSALYFSPNVQ-----------
        SPYLFL+  E LS LI               +  V V                      ++ IL +Y   SGQ++N+ KS +     V            
Subjt:  SPYLFLLVSEVLSSLI---------------TGAVWV----------------------LRNILLQYEHASGQKVNVGKSALYFSPNVQ-----------

Query:  ----------------------------MEFRSVI---STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRK
                                    + FRS+      LLAKQ W L  NP  L A VLK  Y    S + A      S  W+G +W R+++  G R 
Subjt:  ----------------------------MEFRSVI---STLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRK

Query:  RIGNGRSTHFFRDPWIPKEH----SFKPRPIMGRTNQDGALVSEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYK
        R+GNGR+   + D W+P+      + KPR       Q    +  FI+    W L  +    ++ED+  I  IPI L   ED   W Y S+GNY V+SGY+
Subjt:  RIGNGRSTHFFRDPWIPKEH----SFKPRPIMGRTNQDGALVSEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYK

Query:  LARSISVDQESASSNNQ-RLWWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPI
        + R +++     S+  +   WWK LW+ ++P  +KLF W+  H  LPT   L  RG+++SP+C LC S  ET+ HAL  C + K +  +L          
Subjt:  LARSISVDQESASSNNQ-RLWWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPI

Query:  LNNFPDRVVWLARNLDDESFERACIAFWSLWNDRNSSNNGMPIMDWVKRY
          +  D +  L   L    FE +    W++W +RN   N +P+M+  + Y
Subjt:  LNNFPDRVVWLARNLDDESFERACIAFWSLWNDRNSSNNGMPIMDWVKRY

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.3e-1127.92Show/hide
Query:  EKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVRE-------WNDTHIALIP-------KADLFLIMLFWVMS
        E L  P T  EI   +N +   K+PGPD F A FYQ+Y +E+          +L L +S+ +       + +  I LIP       K + F  +    + 
Subjt:  EKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVRE-------WNDTHIALIP-------KADLFLIMLFWVMS

Query:  ACILLK---------IKK----------RGRKGW---------------------LALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCF
        A IL K         IKK           G +GW                     + + +D  KA+D+++  F+ + + KLG +  ++K+I         
Subjt:  ACILLK---------IKK----------RGRKGW---------------------LALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCF

Query:  SILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLI
        +I+LNG   +      G RQG PLSP LF +V EVL+  I
Subjt:  SILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLI

P08548 LINE-1 reverse transcriptase homolog2.7e-1224.43Show/hide
Query:  EKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVRE-------WNDTHIALIPK----------------ADLF
        E L  P +  EI   +  +   K+PGPD F + FYQ + +E+  I       +LNL +++ +       + + +I LIPK                 ++ 
Subjt:  EKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVRE-------WNDTHIALIPK----------------ADLF

Query:  LIMLFWVMSACILLKIKK----------RGRKGW---------------------LALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCF
          +L  +++  I   IKK           G +GW                     + L +D  KA+D ++  F+ R + K+G    ++KLI         
Subjt:  LIMLFWVMSACILLKIKK----------RGRKGW---------------------LALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCF

Query:  SILLNGIPTDRIFPTR-GLRQGDPLSPYLFLLVSEVLSSLI--------------------------------TGAVWVLRNILLQYEHASGQKVNVGKS
        +I+LNG+   + FP R G RQG PLSP LF +V EVL+  I                                  +   L  ++ +Y + SG K+N  KS
Subjt:  SILLNGIPTDRIFPTR-GLRQGDPLSPYLFLLVSEVLSSLI--------------------------------TGAVWVLRNILLQYEHASGQKVNVGKS

Query:  ALYFSPN
          +   N
Subjt:  ALYFSPN

P11369 LINE-1 retrotransposable element ORF2 protein7.3e-1025.75Show/hide
Query:  EKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIP-------KADLFLIMLFWVMSACILLKI
        + L +P +  EIE  +N +   K+PGPD F A FYQ + +++  I       I         + +  I LIP       K + F  +    + A IL KI
Subjt:  EKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIP-------KADLFLIMLFWVMSACILLKI

Query:  KKR-------------------GRKGW---------------------LALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGI
                              G +GW                     + + LD  KA+D+++  F+ +++ + G    ++ +I         +I +NG 
Subjt:  KKR-------------------GRKGW---------------------LALKLDMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGI

Query:  PTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLI
          + I    G RQG PLSPYLF +V EVL+  I
Subjt:  PTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLI

P92555 Uncharacterized mitochondrial protein AtMg012503.7e-0664.86Show/hide
Query:  LLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSL
        ++NG P   + P+RGLRQGDPLSPYLF+L +EVLS L
Subjt:  LLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSL

P93295 Uncharacterized mitochondrial protein AtMg003109.8e-0733.33Show/hide
Query:  LLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKP
        LLAKQ + +   P  L +++L+ RY  H S++     +  S  WR  +  REL+  G+ + IG+G  T  + D WI  E    P
Subjt:  LLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKP

Arabidopsis top hitse value%identityAlignment
AT2G22440.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT4G29090.1)2.0e-0733.67Show/hide
Query:  THFFRDPWIPKEHSFKPRPIMGRTN-QDGAL-VSEFISPSME-WDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSIS
        T  ++DPWIP   +   RP     N +D  L V++ I  +   W L++L+ +++  D+ +I  I  S     D + W +   GNYTV+SGY +AR +S
Subjt:  THFFRDPWIPKEHSFKPRPIMGRTN-QDGAL-VSEFISPSME-WDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSIS

AT3G09510.1 Ribonuclease H-like superfamily protein1.3e-2225.98Show/hide
Query:  LKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGALVSEFISPSME---WDLNKLKEVV
        +K RY +  S+L A  +   S  W   +    L+  G R  IG+G++     D  +    S  PRP+          ++           WD +K+ + V
Subjt:  LKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGALVSEFISPSME---WDLNKLKEVV

Query:  NQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGY-KLARSISVDQESASSNNQRLWWKT-LWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVS
        +Q D   I  I ++ + + D+ IW+Y + G YTVRSGY  L    S +  + +  +  +  KT +WN  +  K+K F+W+A    L T  RL  RG+ + 
Subjt:  NQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGY-KLARSISVDQESASSNNQRLWWKT-LWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVS

Query:  PMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVWLARNLDDESFE--RACIAFWSLWNDRNSSNN
        P C  C+ + E+I+HAL  C  A     +    +     + N+F + +  +   + D +       +  W +W    + NN
Subjt:  PMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVWLARNLDDESFE--RACIAFWSLWNDRNSSNN

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.1e-0841.54Show/hide
Query:  WWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKR
        W   +W+ K+  KIKL IWKA +  LP   +L  R I + P C  C   +ETI H L  C  A+R
Subjt:  WWKTLWNSKMPQKIKLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKR

AT4G29090.1 Ribonuclease H-like superfamily protein3.7e-2526.77Show/hide
Query:  LLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGALVSE
        LL KQ+W + + P  L AKV K RY      L+AP  S  S  W+    ++E++  G R  +GNG     +R  W+  + +     +     Q+ A VS 
Subjt:  LLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDGALVSE

Query:  FISPS-------MEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSI----SVDQESASSNNQRLWWKTLWNSKMPQKI
         +  S        EW  + ++ +  + +  +I  +        D + W Y S G+YTV+SGY +   I    S  QE +  +   ++ K +W S+   KI
Subjt:  FISPS-------MEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSI----SVDQESASSNNQRLWWKTLWNSKMPQKI

Query:  KLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRV---VWLARNL--DDESFERAC----I
        + F+WK     LP    L  R +     C  C S  ET++H L  C  A+    + +      IP+   + D +   ++   NL   +  +E+A      
Subjt:  KLFIWKAYHGCLPTFYRLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRV---VWLARNL--DDESFERAC----I

Query:  AFWSLWNDRN
          W LW +RN
Subjt:  AFWSLWNDRN

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.0e-0833.33Show/hide
Query:  LLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKP
        LLAKQ + +   P  L +++L+ RY  H S++     +  S  WR  +  REL+  G+ + IG+G  T  + D WI  E    P
Subjt:  LLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTAAATTACTCCGGTTGTTTTACTGTGGATTGTGTGGGGCGCAGTGGGGGTCTGTGTTTGTTTTGGAAAGATGACGTCGATGTTACAATTAGATCGTACTCTCA
ATTTCATATTGATGCTTCGATAACTTGGGATTCCAAGATGTGGCGGTTTTCGGGCCTATACGGTAATCCCAACGCGAGTCACAGGATCCATAAATGGAATTTACTTCGAA
GATTACATAATCATGATGATTCTACTTGGGTGGTGGGGGGAGATTTTAATGCAACTCTTTTGTTCGAAGAAAATGAGGGTGGGAATGTAGTTCGTGATTCCCAATTTCAG
TCTTTTCGAGATGCTATGGACGATTGTGGGTTGCAGGACTTGGAGTTCATGGGAGATATGTTTACATGGTCTAATCGACAAGAACGGGAGAATCAGATCAACGAACGCCT
TGATAGATTCATTGCAAATGAGGATTATCTTCAATTATTTCCTAATACTTGTGTTGATCATTTGCAATGGGCTCAATCTGATCACCGACCGATTCTAATGAATGGATACA
GAATCGATCAAGACAAATTATTTCAGGATTGGTTTGTTGGAATAATCGTGGAAGTGATCCACCTCGACTTGAAAGATGTCTACAATCTTGATATTAAGGAGATGGTATCT
CCAAGCATGAATGAAAAGCTCATGGCGCCATTTACAAAATGCGAGATCGAGCGGGCGGTTAATCAAATGTCTCCATCTAAAGCTCCTGGTCCGGATGATTTTCCAGCATT
GTTCTACCAGAAATACTGGGATGAGGTTGGTAATATTACTGCTTTGAATTGTCTGGATATTCTCAATCTAAAACGATCGGTTAGGGAATGGAATGACACTCATATTGCAT
TGATCCCTAAAGCCGATCTATTTTTGATAATGTTATTTTGGGTCATGAGTGCTTGCATTCTATTAAAAATAAAAAAACGTGGTCGTAAGGGATGGTTGGCTTTAAAGTTA
GATATGAGTAAAGCCTACGACCGTGTTGAGTGGTGTTTTCTGGAGCGTCTAATGCTGAAACTTGGGTTCAATGCTAATTGGGTGAAGTTGATAATGGAATGTGTCCAAAC
TACTTGTTTTTCCATTTTGCTAAATGGCATACCCACCGATAGGATCTTTCCTACCCGAGGATTACGTCAGGGAGACCCTTTATCTCCTTATTTGTTTTTGCTTGTATCAG
AAGTTTTATCTTCACTGATTACAGGAGCGGTGTGGGTTTTGAGAAACATTTTGCTACAGTATGAACATGCGTCAGGACAGAAGGTTAATGTTGGGAAATCAGCCTTATAT
TTTTCCCCAAATGTACAAATGGAGTTCAGATCAGTTATATCCACTTTACTGGCAAAACAAGTGTGGCACTTGTCTACAAATCCATCTTTACTAGCTGCCAAGGTGCTAAA
GGGACGGTATGCACAGCATGATTCCCTATTATCAGCCCCATCTAAAAGTAATTGCTCTGTTTTCTGGCGAGGTTTTGTGTGGGCTCGGGAGCTGGTGGTAAGCGGAATGA
GGAAAAGAATTGGGAATGGCCGATCAACTCACTTTTTTCGAGATCCTTGGATTCCTAAGGAGCATTCTTTTAAGCCCAGGCCAATCATGGGAAGAACTAATCAGGATGGT
GCTTTAGTTTCTGAATTTATAAGCCCTTCGATGGAGTGGGATTTAAATAAACTAAAAGAGGTGGTGAATCAGGAGGATATGGATATTATAGCGGCTATTCCTATAAGCTT
AGCGAATCAAGAAGATCAGTGGATTTGGCATTATTGTTCTCATGGGAATTACACTGTTCGGAGTGGTTATAAACTAGCTAGATCGATTTCGGTCGATCAGGAATCTGCTA
GTTCTAACAACCAACGATTATGGTGGAAGACACTTTGGAATTCAAAAATGCCACAGAAGATAAAACTTTTTATTTGGAAGGCATACCATGGTTGTTTACCAACATTTTAT
CGGCTTTGGGAGCGAGGTATAGATGTGTCCCCTATGTGTTTTCTTTGTAATTCAAAGTGGGAAACAATCGATCACGCTCTATGCGGATGCAAGAGGGCTAAGAGAATATG
TAATGTGTTGTTTCATCGCGTGGATGCTGCAATTCCGATCCTAAATAATTTTCCGGACCGTGTGGTATGGCTAGCTAGAAACCTTGATGATGAATCATTTGAAAGAGCAT
GTATTGCTTTTTGGTCTTTGTGGAATGACAGAAATAGCTCTAATAACGGAATGCCTATTATGGATTGGGTGAAACGATATGCAGCGTGCTCTCTTGCAAAGGATGGCTCT
GGATATGGAGCTGTCATTACAGAGGCTAATGACAGGTTATGTGGTGCAATGGAATTTTTCGACCCCACCCGTCTTACGTCGTTTGCGGCAGAGGTGAATGCTTTGATGCA
TGGAGTTCGACTTTTGCAACGTTTGCAAATAAACGTGCGCGTGTTTGTTCGGATTCATCTAATGCCATCAAGATGCGTATTGGTGAAACTCCTATTACATCCGAGGCCAA
GGATGCTGACTTCTCCAAGAAATTGGCTTGCTTGCCCCAACCTTCTTTTGCCGGCTTGCAAATCAGTCACACGTCAGAATGACCCTCCATCAGCTGTTGTCGTACTTTTT
CACGCTCTGGGATTGGCGCCTGCTCTGAAAGATAACTTGGGCAAGGAAATAAGTACAGAAAGCATGCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTAAATTACTCCGGTTGTTTTACTGTGGATTGTGTGGGGCGCAGTGGGGGTCTGTGTTTGTTTTGGAAAGATGACGTCGATGTTACAATTAGATCGTACTCTCA
ATTTCATATTGATGCTTCGATAACTTGGGATTCCAAGATGTGGCGGTTTTCGGGCCTATACGGTAATCCCAACGCGAGTCACAGGATCCATAAATGGAATTTACTTCGAA
GATTACATAATCATGATGATTCTACTTGGGTGGTGGGGGGAGATTTTAATGCAACTCTTTTGTTCGAAGAAAATGAGGGTGGGAATGTAGTTCGTGATTCCCAATTTCAG
TCTTTTCGAGATGCTATGGACGATTGTGGGTTGCAGGACTTGGAGTTCATGGGAGATATGTTTACATGGTCTAATCGACAAGAACGGGAGAATCAGATCAACGAACGCCT
TGATAGATTCATTGCAAATGAGGATTATCTTCAATTATTTCCTAATACTTGTGTTGATCATTTGCAATGGGCTCAATCTGATCACCGACCGATTCTAATGAATGGATACA
GAATCGATCAAGACAAATTATTTCAGGATTGGTTTGTTGGAATAATCGTGGAAGTGATCCACCTCGACTTGAAAGATGTCTACAATCTTGATATTAAGGAGATGGTATCT
CCAAGCATGAATGAAAAGCTCATGGCGCCATTTACAAAATGCGAGATCGAGCGGGCGGTTAATCAAATGTCTCCATCTAAAGCTCCTGGTCCGGATGATTTTCCAGCATT
GTTCTACCAGAAATACTGGGATGAGGTTGGTAATATTACTGCTTTGAATTGTCTGGATATTCTCAATCTAAAACGATCGGTTAGGGAATGGAATGACACTCATATTGCAT
TGATCCCTAAAGCCGATCTATTTTTGATAATGTTATTTTGGGTCATGAGTGCTTGCATTCTATTAAAAATAAAAAAACGTGGTCGTAAGGGATGGTTGGCTTTAAAGTTA
GATATGAGTAAAGCCTACGACCGTGTTGAGTGGTGTTTTCTGGAGCGTCTAATGCTGAAACTTGGGTTCAATGCTAATTGGGTGAAGTTGATAATGGAATGTGTCCAAAC
TACTTGTTTTTCCATTTTGCTAAATGGCATACCCACCGATAGGATCTTTCCTACCCGAGGATTACGTCAGGGAGACCCTTTATCTCCTTATTTGTTTTTGCTTGTATCAG
AAGTTTTATCTTCACTGATTACAGGAGCGGTGTGGGTTTTGAGAAACATTTTGCTACAGTATGAACATGCGTCAGGACAGAAGGTTAATGTTGGGAAATCAGCCTTATAT
TTTTCCCCAAATGTACAAATGGAGTTCAGATCAGTTATATCCACTTTACTGGCAAAACAAGTGTGGCACTTGTCTACAAATCCATCTTTACTAGCTGCCAAGGTGCTAAA
GGGACGGTATGCACAGCATGATTCCCTATTATCAGCCCCATCTAAAAGTAATTGCTCTGTTTTCTGGCGAGGTTTTGTGTGGGCTCGGGAGCTGGTGGTAAGCGGAATGA
GGAAAAGAATTGGGAATGGCCGATCAACTCACTTTTTTCGAGATCCTTGGATTCCTAAGGAGCATTCTTTTAAGCCCAGGCCAATCATGGGAAGAACTAATCAGGATGGT
GCTTTAGTTTCTGAATTTATAAGCCCTTCGATGGAGTGGGATTTAAATAAACTAAAAGAGGTGGTGAATCAGGAGGATATGGATATTATAGCGGCTATTCCTATAAGCTT
AGCGAATCAAGAAGATCAGTGGATTTGGCATTATTGTTCTCATGGGAATTACACTGTTCGGAGTGGTTATAAACTAGCTAGATCGATTTCGGTCGATCAGGAATCTGCTA
GTTCTAACAACCAACGATTATGGTGGAAGACACTTTGGAATTCAAAAATGCCACAGAAGATAAAACTTTTTATTTGGAAGGCATACCATGGTTGTTTACCAACATTTTAT
CGGCTTTGGGAGCGAGGTATAGATGTGTCCCCTATGTGTTTTCTTTGTAATTCAAAGTGGGAAACAATCGATCACGCTCTATGCGGATGCAAGAGGGCTAAGAGAATATG
TAATGTGTTGTTTCATCGCGTGGATGCTGCAATTCCGATCCTAAATAATTTTCCGGACCGTGTGGTATGGCTAGCTAGAAACCTTGATGATGAATCATTTGAAAGAGCAT
GTATTGCTTTTTGGTCTTTGTGGAATGACAGAAATAGCTCTAATAACGGAATGCCTATTATGGATTGGGTGAAACGATATGCAGCGTGCTCTCTTGCAAAGGATGGCTCT
GGATATGGAGCTGTCATTACAGAGGCTAATGACAGGTTATGTGGTGCAATGGAATTTTTCGACCCCACCCGTCTTACGTCGTTTGCGGCAGAGGTGAATGCTTTGATGCA
TGGAGTTCGACTTTTGCAACGTTTGCAAATAAACGTGCGCGTGTTTGTTCGGATTCATCTAATGCCATCAAGATGCGTATTGGTGAAACTCCTATTACATCCGAGGCCAA
GGATGCTGACTTCTCCAAGAAATTGGCTTGCTTGCCCCAACCTTCTTTTGCCGGCTTGCAAATCAGTCACACGTCAGAATGACCCTCCATCAGCTGTTGTCGTACTTTTT
CACGCTCTGGGATTGGCGCCTGCTCTGAAAGATAACTTGGGCAAGGAAATAAGTACAGAAAGCATGCGTTAG
Protein sequenceShow/hide protein sequence
MSLNYSGCFTVDCVGRSGGLCLFWKDDVDVTIRSYSQFHIDASITWDSKMWRFSGLYGNPNASHRIHKWNLLRRLHNHDDSTWVVGGDFNATLLFEENEGGNVVRDSQFQ
SFRDAMDDCGLQDLEFMGDMFTWSNRQERENQINERLDRFIANEDYLQLFPNTCVDHLQWAQSDHRPILMNGYRIDQDKLFQDWFVGIIVEVIHLDLKDVYNLDIKEMVS
PSMNEKLMAPFTKCEIERAVNQMSPSKAPGPDDFPALFYQKYWDEVGNITALNCLDILNLKRSVREWNDTHIALIPKADLFLIMLFWVMSACILLKIKKRGRKGWLALKL
DMSKAYDRVEWCFLERLMLKLGFNANWVKLIMECVQTTCFSILLNGIPTDRIFPTRGLRQGDPLSPYLFLLVSEVLSSLITGAVWVLRNILLQYEHASGQKVNVGKSALY
FSPNVQMEFRSVISTLLAKQVWHLSTNPSLLAAKVLKGRYAQHDSLLSAPSKSNCSVFWRGFVWARELVVSGMRKRIGNGRSTHFFRDPWIPKEHSFKPRPIMGRTNQDG
ALVSEFISPSMEWDLNKLKEVVNQEDMDIIAAIPISLANQEDQWIWHYCSHGNYTVRSGYKLARSISVDQESASSNNQRLWWKTLWNSKMPQKIKLFIWKAYHGCLPTFY
RLWERGIDVSPMCFLCNSKWETIDHALCGCKRAKRICNVLFHRVDAAIPILNNFPDRVVWLARNLDDESFERACIAFWSLWNDRNSSNNGMPIMDWVKRYAACSLAKDGS
GYGAVITEANDRLCGAMEFFDPTRLTSFAAEVNALMHGVRLLQRLQINVRVFVRIHLMPSRCVLVKLLLHPRPRMLTSPRNWLACPNLLLPACKSVTRQNDPPSAVVVLF
HALGLAPALKDNLGKEISTESMR