; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026268 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026268
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:33625084..33639190
RNA-Seq ExpressionLag0026268
SyntenyLag0026268
Gene Ontology termsGO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067173.1 retrotransposon protein [Cucumis melo var. makuwa]5.9e-9259.88Show/hide
Query:  KEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMAAVKGSFSNPNNRGRGRRQSAKNRGF
        K ++DKL  ASI +EDEEILVHTLN LP  FNAFRTSIRTRSG++SLEELH LL +EE TM     ++ IPT MAA   S  + ++RGRGRR  +     
Subjt:  KEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMAAVKGSFSNPNNRGRGRRQSAKNRGF

Query:  SNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSF--GNGSHAGQQFGPNFSSPTANGSGGNFGSASSN-------FDSGFNGGRIFCQICSK
         N     S RG  F +          +RG +S ++FG++ +    N ++ G    P+  +P      G  GS+ S+        +SG+N GRIFCQIC K
Subjt:  SNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSF--GNGSHAGQQFGPNFSSPTANGSGGNFGSASSN-------FDSGFNGGRIFCQICSK

Query:  SGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTF
          HGALDCYN MNFSYQ RH P+QLAAMAVNSMNSQ S+++ NNFWLSDSG NVHMTN+ ANLNLSNNYNGEE++TVGNGQPLNI+NTGSG LST SHTF
Subjt:  SGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTF

Query:  NLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYG
        NLSKILHAP+LA NLLSVHKFCLDNNC+F+F TD FLIQDKV G
Subjt:  NLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYG

KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]9.9e-7137.42Show/hide
Query:  ETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMAAVKGSFSNPNNRGRG
        ++I+ Y  +IK+  D LA  S+ IEDE+IL++ LN LP E+NAF+TSIRT+S +++LEE++ +L+ EE+T++S    +  P    A+  +   PN     
Subjt:  ETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMAAVKGSFSNPNNRGRG

Query:  RRQSAKNRGFSNRGFEESQRGRG-FQNRAAFDGLSPFNRGSQSWDSFGSRGSFGNGSHAGQQFG-PNFSSPTANGSGGNFGSASSNFDSGFNGGRIFCQI
            + NRG+S   F    RGRG F NR                   G   SFG        FG  N   PT      N  S +S+         + CQI
Subjt:  RRQSAKNRGFSNRGFEESQRGRG-FQNRAAFDGLSPFNRGSQSWDSFGSRGSFGNGSHAGQQFG-PNFSSPTANGSGGNFGSASSNFDSGFNGGRIFCQI

Query:  CSKSGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSS
        C+K+GH ALDCY+RM+FSYQG+ P  QL AM   S    + +D + N+W +D+G   H+T D ANLN    Y G+++IT+ NGQ L+I ++G  ++  + 
Subjt:  CSKSGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSS

Query:  HTFNLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYGKILYTGESINGLYPIPSPSM-------LSSDLHPKNFN---------------
        HTF L+ +L  P +A NLLSVH+FC DN+C FIFD++ F IQDK   ++L+ G S +GLYP+P+ S+       L   LH +++N               
Subjt:  HTFNLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYGKILYTGESINGLYPIPSPSM-------LSSDLHPKNFN---------------

Query:  ------FMAKQECS-LWHHRFGHPAPKILRSSISRLGLSLSSSISTCECISCFKAKMSKLSFPMSVSLSISPLDLIHNDVWGPFPITIIQPTTVTIS
              ++ KQ  + LWH R GHP+   L+S +S   ++     S   C  C   KM+KL FP+S + S +PL L+H+D+WGP P T     T  +S
Subjt:  ------FMAKQECS-LWHHRFGHPAPKILRSSISRLGLSLSSSISTCECISCFKAKMSKLSFPMSVSLSISPLDLIHNDVWGPFPITIIQPTTVTIS

TYK06402.1 putative mitochondrial protein [Cucumis melo var. makuwa]3.3e-6654.9Show/hide
Query:  KLPSETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMAAVKGSFSNPNN
        K PSE+I++YT RIK +VDKLA  S+ +EDEEILVHTLN L   FNAFRTSIRTR+ ++SLEELH L  +EE TM     ++ IPT MAA      + ++
Subjt:  KLPSETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMAAVKGSFSNPNN

Query:  RGRGRRQSAKNRGFSNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSR-GSFGNGSHAGQQFGP--NFSSPTAN-----GSGGNFGSA---SSNF
        RGRGRR  +      N     S RG  F +          +RG  S ++FG++   F    +   +F P  +F+S   N     GS  ++G +    +NF
Subjt:  RGRGRRQSAKNRGFSNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSR-GSFGNGSHAGQQFGP--NFSSPTAN-----GSGGNFGSA---SSNF

Query:  DSGFNGGRIFCQICSKSGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLN
          G+N GRIFCQIC K GHGALDCYNRMNFSYQGRHPP+QLAAM VNSMNSQ S ++ NNFWL DSGCNVHMTN+ ANLNLSNNYNGEE++TV N QPLN
Subjt:  DSGFNGGRIFCQICSKSGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLN

Query:  IQNTGS
        I+NTGS
Subjt:  IQNTGS

XP_016902697.1 PREDICTED: uncharacterized protein LOC107991825 [Cucumis melo]4.1e-9357.64Show/hide
Query:  GYVFLNLELVLLVLVKNKVALKLPSETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNV
        GY++      + V V +    K PSE+I+QYT RIK +VDKL  AS+ +EDEEILVHTLN LP  FNAFRTSIRTR+G++SLEELH LL +EE  M    
Subjt:  GYVFLNLELVLLVLVKNKVALKLPSETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNV

Query:  VVDPIPTVMAAVKGSFSNPNNRGRGRRQSAKNRGFSNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSF--GNGSHAGQQFGP-NFSSPTAN
         ++ IPT MAA      +  +RGRGRR  +      N     S RG  F +          +RG  S ++FG++ +    N ++ G    P NF+S   N
Subjt:  VVDPIPTVMAAVKGSFSNPNNRGRGRRQSAKNRGFSNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSF--GNGSHAGQQFGP-NFSSPTAN

Query:  GSG--GNFGSASSNF----DSGFNGGRIFCQICSKSGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNL
          G  G+  S   +F    +SG+N GRIFCQIC K GH ALDCYNRMNFSYQG+HPP+QL AMA+NSMNSQ   ++ NNF LSDSGCNVHMTN+ ANLNL
Subjt:  GSG--GNFGSASSNF----DSGFNGGRIFCQICSKSGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNL

Query:  SNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLI
        SNNYNGEE++TVGNGQP+NI+NTGSG L T SHTFNLSKILHAP+LA NLLSVHKFCLDNNC+FIFDTDWFLI
Subjt:  SNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLI

XP_022158189.1 uncharacterized protein LOC111024722 [Momordica charantia]1.2e-7965.22Show/hide
Query:  MAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKILHAPELAANLLSVHKFCLDNNC
        MAVN+M   SS+++ NNFWLSDSGCN H+TND  NLNL ++YNGEE +TVGNGQ LNI +TGSG LS SSH F +S +LHAP+LA NLLSVHKFCLDN+C
Subjt:  MAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKILHAPELAANLLSVHKFCLDNNC

Query:  IFIFDTDWFLIQDKVYGKILYTGESINGLYPIPSPSMLSS---DLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSISRLGLSLSSSISTCECISCFKAK
        IF++D+DWFLIQDKV    LY G+S+NGLYPIPS S LSS   +LHPKN   +AK    LWHHR GH +PKILR ++S  GLS+S S +TC+C SC KAK
Subjt:  IFIFDTDWFLIQDKVYGKILYTGESINGLYPIPSPSMLSS---DLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSISRLGLSLSSSISTCECISCFKAK

Query:  MSKLSFPMSVSLSISPLDLIHNDVWGPFPI
        MSKLSFPMS S S +PL+ +H+DVWG  P+
Subjt:  MSKLSFPMSVSLSISPLDLIHNDVWGPFPI

TrEMBL top hitse value%identityAlignment
A0A1S4E394 uncharacterized protein LOC1079918252.0e-9357.64Show/hide
Query:  GYVFLNLELVLLVLVKNKVALKLPSETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNV
        GY++      + V V +    K PSE+I+QYT RIK +VDKL  AS+ +EDEEILVHTLN LP  FNAFRTSIRTR+G++SLEELH LL +EE  M    
Subjt:  GYVFLNLELVLLVLVKNKVALKLPSETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNV

Query:  VVDPIPTVMAAVKGSFSNPNNRGRGRRQSAKNRGFSNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSF--GNGSHAGQQFGP-NFSSPTAN
         ++ IPT MAA      +  +RGRGRR  +      N     S RG  F +          +RG  S ++FG++ +    N ++ G    P NF+S   N
Subjt:  VVDPIPTVMAAVKGSFSNPNNRGRGRRQSAKNRGFSNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSF--GNGSHAGQQFGP-NFSSPTAN

Query:  GSG--GNFGSASSNF----DSGFNGGRIFCQICSKSGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNL
          G  G+  S   +F    +SG+N GRIFCQIC K GH ALDCYNRMNFSYQG+HPP+QL AMA+NSMNSQ   ++ NNF LSDSGCNVHMTN+ ANLNL
Subjt:  GSG--GNFGSASSNF----DSGFNGGRIFCQICSKSGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNL

Query:  SNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLI
        SNNYNGEE++TVGNGQP+NI+NTGSG L T SHTFNLSKILHAP+LA NLLSVHKFCLDNNC+FIFDTDWFLI
Subjt:  SNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLI

A0A2N9EZ90 Uncharacterized protein1.1e-7241.15Show/hide
Query:  LLVLVKNKVALKLPSETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMA
        ++ L K    +K  ++TI QY  RIKE +DKLA     ++DE++L   L  LP+E+ +F +++ T++ S+S EELH L+ ++E+ +KS+       ++MA
Subjt:  LLVLVKNKVALKLPSETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMA

Query:  AVKGSFSNPNNRGRGRRQSAKNRGFSNRGFEE-SQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSFGNGSHAGQQFGPNF--SSPTANGSGGNFGSAS
            + S  N+  RG        GF NRG      RGRG  NR  F     F  G        S G +   +    Q  PNF  SSP  + S  NF  +S
Subjt:  AVKGSFSNPNNRGRGRRQSAKNRGFSNRGFEE-SQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSFGNGSHAGQQFGPNF--SSPTANGSGGNFGSAS

Query:  SNFDSGFNGGRIFCQICSKSGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQ
         NF S  N  R  CQIC K GH ALDCYNRMN+SYQGRHPPA+LAAMA     S  S   + N W+SD+G   H T D ANL  S+ YN  + ++VGNGQ
Subjt:  SNFDSGFNGGRIFCQICSKSGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQ

Query:  PLNIQNTGSGTLSTSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYGKILYTGESINGLYPI-------PSPSMLSSDLHPKNF
         L I + G+  L TSS+ F L  IL  P +A+NLLSV+KFC DN+C F FD+D F IQD++ GK LY G S +GLYP+        S S  SS       
Subjt:  PLNIQNTGSGTLSTSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYGKILYTGESINGLYPI-------PSPSMLSSDLHPKNF

Query:  NFMAKQECSLWHHRFGHPAPKILRSSISRLGLSLSSSISTCECISCFKAKMSKLSFPMSVSLSISPLDLIHNDVWGPFPIT--------IIQPTTVTISS
            +   +LWH RFGHP  ++LR  +     S S S  +  C  C + KM++L F  S + +  PL ++++DVWGP PIT         I P T T  S
Subjt:  NFMAKQECSLWHHRFGHPAPKILRSSISRLGLSLSSSISTCECISCFKAKMSKLSFPMSVSLSISPLDLIHNDVWGPFPIT--------IIQPTTVTISS

Query:  LLP
        LLP
Subjt:  LLP

A0A2N9HPA0 Uncharacterized protein8.7e-7336.97Show/hide
Query:  VRNQRPSRCESRCQPQHPFKEQCCVRLFQQGSTPAWFSIDIFVHSV-AVRDSLENGYVFLNLELVLLVLVKNKVALKLPSETIEQYTCRIKEIVDKLAVA
        ++N   S  E+    Q   ++Q  + L     +P   S+ +   +   V   LE  Y   +   +L + ++     K  +++I  +  +IK+  D+L   
Subjt:  VRNQRPSRCESRCQPQHPFKEQCCVRLFQQGSTPAWFSIDIFVHSV-AVRDSLENGYVFLNLELVLLVLVKNKVALKLPSETIEQYTCRIKEIVDKLAVA

Query:  SIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMAAVKGS----FSNPNNRGRGRRQSAKNRGFSNRGFE
         ++I++EEIL   L  LP E++AF T+IRTR+ + S E++H LL AEE++++S + +      MA +  +    FS+  NRGRGR  ++ NRG       
Subjt:  SIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMAAVKGS----FSNPNNRGRGRRQSAKNRGFSNRGFE

Query:  ESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSFGNGSHAGQQFGPNFSSPTANGSGGNFGSASSNFDSGFNGGRIFCQICSKSGHGALDCYNRMNFS
           RGR F N +   G +  N G+ S  + GS G F N  +         SSPT +                    R  CQIC K+GH ALDCY+RM++S
Subjt:  ESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSFGNGSHAGQQFGPNFSSPTANGSGGNFGSASSNFDSGFNGGRIFCQICSKSGHGALDCYNRMNFS

Query:  YQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKILHAPELAANL
        YQG+ PP++LAAMA  S NSQ S+ S   +W+SD+G   H T D + +     Y G +  TVGNGQ + I + G+  L  SSH F+L K+L  P +A+NL
Subjt:  YQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKILHAPELAANL

Query:  LSVHKFCLDNNCIFIFDTDWFLIQDKVYGKILYTGESINGLYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSISRLGLSLSSSIST-
        LSV+KFC DNNC F+FD + F I+D   GK+LY G S N LYPI   S L    H  NF+ +      +WH R GHP  ++ +   S   +  SSS  T 
Subjt:  LSVHKFCLDNNCIFIFDTDWFLIQDKVYGKILYTGESINGLYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSISRLGLSLSSSIST-

Query:  CECISCFKAKMSKLSFPMSVSLSISPLDLIHNDVWGPFPIT
          C  C + KM+ L F  SVS +  PL++IH+DVWGP PIT
Subjt:  CECISCFKAKMSKLSFPMSVSLSISPLDLIHNDVWGPFPIT

A0A5A7VGG0 Retrotransposon protein2.9e-9259.88Show/hide
Query:  KEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMAAVKGSFSNPNNRGRGRRQSAKNRGF
        K ++DKL  ASI +EDEEILVHTLN LP  FNAFRTSIRTRSG++SLEELH LL +EE TM     ++ IPT MAA   S  + ++RGRGRR  +     
Subjt:  KEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMAAVKGSFSNPNNRGRGRRQSAKNRGF

Query:  SNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSF--GNGSHAGQQFGPNFSSPTANGSGGNFGSASSN-------FDSGFNGGRIFCQICSK
         N     S RG  F +          +RG +S ++FG++ +    N ++ G    P+  +P      G  GS+ S+        +SG+N GRIFCQIC K
Subjt:  SNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSF--GNGSHAGQQFGPNFSSPTANGSGGNFGSASSN-------FDSGFNGGRIFCQICSK

Query:  SGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTF
          HGALDCYN MNFSYQ RH P+QLAAMAVNSMNSQ S+++ NNFWLSDSG NVHMTN+ ANLNLSNNYNGEE++TVGNGQPLNI+NTGSG LST SHTF
Subjt:  SGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTF

Query:  NLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYG
        NLSKILHAP+LA NLLSVHKFCLDNNC+F+F TD FLIQDKV G
Subjt:  NLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYG

A0A6J1DYN6 uncharacterized protein LOC1110247225.6e-8065.22Show/hide
Query:  MAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKILHAPELAANLLSVHKFCLDNNC
        MAVN+M   SS+++ NNFWLSDSGCN H+TND  NLNL ++YNGEE +TVGNGQ LNI +TGSG LS SSH F +S +LHAP+LA NLLSVHKFCLDN+C
Subjt:  MAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKILHAPELAANLLSVHKFCLDNNC

Query:  IFIFDTDWFLIQDKVYGKILYTGESINGLYPIPSPSMLSS---DLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSISRLGLSLSSSISTCECISCFKAK
        IF++D+DWFLIQDKV    LY G+S+NGLYPIPS S LSS   +LHPKN   +AK    LWHHR GH +PKILR ++S  GLS+S S +TC+C SC KAK
Subjt:  IFIFDTDWFLIQDKVYGKILYTGESINGLYPIPSPSMLSS---DLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSISRLGLSLSSSISTCECISCFKAK

Query:  MSKLSFPMSVSLSISPLDLIHNDVWGPFPI
        MSKLSFPMS S S +PL+ +H+DVWG  P+
Subjt:  MSKLSFPMSVSLSISPLDLIHNDVWGPFPI

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.7e-3030.13Show/hide
Query:  SETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELH-NLLEAEEKTMK-SNVVVDPIPTVMAAVKGSFSNPNNR
        ++TI+ Y   +    D+LA+    ++ +E +   L +LP E+      I  +    +L E+H  LL  E K +  S+  V PI T  A    + +  NN 
Subjt:  SETIEQYTCRIKEIVDKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELH-NLLEAEEKTMK-SNVVVDPIPTVMAAVKGSFSNPNNR

Query:  GRGRRQSAKNRGFSNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSFGNGSHAGQQFGPNFSSPTANGSGGNFGSASSNFDSGFNGGRIFCQ
          G R    N  + NR                       N  S+ W                QQ   NF  P  N S    G                CQ
Subjt:  GRGRRQSAKNRGFSNRGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSFGNGSHAGQQFGPNFSSPTANGSGGNFGSASSNFDSGFNGGRIFCQ

Query:  ICSKSGHGALDCYNRMNF--SYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLS
        IC   GH A  C    +F  S   + PP+        +  +  S  S+NN WL DSG   H+T+D+ NL+L   Y G + + V +G  + I +TGS +LS
Subjt:  ICSKSGHGALDCYNRMNF--SYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLS

Query:  TSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYGKILYTGESINGLY--PIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHP
        T S   NL  IL+ P +  NL+SV++ C  N     F    F ++D   G  L  G++ + LY  PI S   +S    P      +K   S WH R GHP
Subjt:  TSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYGKILYTGESINGLY--PIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHP

Query:  APKILRSSISRLGLS-LSSSISTCECISCFKAKMSKLSFPMSVSLSISPLDLIHNDVW
        AP IL S IS   LS L+ S     C  C   K +K+ F  S   S  PL+ I++DVW
Subjt:  APKILRSSISRLGLS-LSSSISTCECISCFKAKMSKLSFPMSVSLSISPLDLIHNDVW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-2528.92Show/hide
Query:  DKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELH-NLLEAEEKTMKSN-VVVDPIPTVMAAVKGSFSNPNNRGRGRRQSAKNRGFSN
        D+LA+    ++ +E +   L +LP ++      I  +    SL E+H  L+  E K +  N   V PI   +   + + +N N   RG      NR ++N
Subjt:  DKLAVASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELH-NLLEAEEKTMKSN-VVVDPIPTVMAAVKGSFSNPNNRGRGRRQSAKNRGFSN

Query:  RGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSFGNGSHAGQQFGPNFSSPTANGSGGNFGSASSNFDSGFNGGRIFCQICSKSGHGALDCYNR
                                N  S SW    S                              GS S N       GR  CQICS  GH A  C   
Subjt:  RGFEESQRGRGFQNRAAFDGLSPFNRGSQSWDSFGSRGSFGNGSHAGQQFGPNFSSPTANGSGGNFGSASSNFDSGFNGGRIFCQICSKSGHGALDCYNR

Query:  MNF-----SYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKIL
          F       Q   P       A  ++NS     +ANN WL DSG   H+T+D+ NL+    Y G + + + +G  + I +TGS +L TSS + +L+K+L
Subjt:  MNF-----SYQGRHPPAQLAAMAVNSMNSQSSNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKIL

Query:  HAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYGKILYTGESINGLY--PIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSISRL
        + P +  NL+SV++ C  N     F    F ++D   G  L  G++ + LY  PI S   +S    P      +K   S WH R GHP+  IL S IS  
Subjt:  HAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYGKILYTGESINGLY--PIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSISRL

Query:  GLS-LSSSISTCECISCFKAKMSKLSFPMSVSLSISPLDLIHNDVW
         L  L+ S     C  CF  K  K+ F  S   S  PL+ I++DVW
Subjt:  GLS-LSSSISTCECISCFKAKMSKLSFPMSVSLSISPLDLIHNDVW

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAGTAGGCAACAATCGGGGATTGTCTATGCCCCGATCAATGCCAACAACTTTGAGCTGAAGACCGTTAAGATTAATAGAGTCTCTGAGGATGCTATTCGCTTACG
CTTATTTCCTTTTTCTTTGCAGGATAAAGCCGAGATTGGTTGCAGTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGTCCAGGCCTTTTGAAGAAATTTTCCCTC
CTACAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATTGATGAGCAGTTGTTCGAAGCTTGGGAGCGATTTAAAGACTGCAGGTGGGACTCTGTTG
TCCAAGACCGTGGAAAATGCTCGCATACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACCAAAAAGATTGTTGCTGGAGTGTACAGG
GAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGTGCTGCCATAAGAACATTGAGACTCAGCTGGGACAGTTGGTGA
ATGTTGTGAGCACCATGAATAAAGGTAAGGCCCCAGCTGAGCAAGAGAAAACCCAGATGGAGTACTGTAAGGCAATCACTGTGCACCAGGAGGAAGTTGAAGAGGAGCCT
GAGTTTAGGATTATGACACGCCTACAGGGGAAGCTGAGGAGGACACATCATCAGATGAGGCTGAAAAGCCTGAACCTGAGCCTCCTATTCCTTCTCCCCACTGATGTTCC
CAAAGAAAAGAAAAAAAAAAAGAAAAAGAACAATCAGGTTCATGAAGGAATGGTTAGCAAAGAAGCAAAGGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCA
GCACCAGAGTACAAAGAAGGAAGACCATTCCTGCTACTGGGCGAGTGATTATTGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAGGAACGAAAAAGAAATTTTAAAG
CAGTTGAAGACTCTAAAGATGAAGTACTTTTATGGGCTACAGGAAAGGTGCAAGAAAGAGCACCTCTGTTGGATTCACAGAAAAGAAGCCTCCTTGAAGCACGGTCAACA
CGTCGAGCTAATGACGTTAAACAAGCGCTTTGGGAGGCAACCCAAGTCACAGCATCTCGACGCTGCGACCTTAGCGTCTCGACGCTGTGTTATTTCGCTTACTGAAACGC
GCGTTTTGAAGGCAGCGTCTCGACGCAACGACGAAAATCAGAATAAAAGCTTCCTTTCTGCTAGGTTTTTAGGGGATTCAATTTTGGGACTTCTTGGAGCCGTAAACAGA
GCTAAAACAGAGGATTTGAAGGCTGAAGCAAAGGGAGAAAATCGGAAATCAACCCATTGTTCGTGGGATCGTGACGGGACGTCGGTCTTGGCCTACATTTATGGTCCAGT
AAGTTTGAGCTTGCCTTTCAGCAAACTCTTGCCTACTGGCCGACAATTGATGAAGCGGATGAATGGTCCTTCTTCTACTTCCTCTACAGAAATCAAAGACCCAGCCGCTG
TGAGAGTCGCTGCCAACCGTGAGTCACACGTGAAGGCCTTCGCGCTGCTCTATCTCCCTCGTTTCATTGTATTCGCGCCGTCTCTCTCTGTCCTTGCATTTTCGGCCAAA
GAAGCTCGATGTTGGTTTTGTGCGCGTCCAACAACTAGGGTTCCGTATTCCTCGCAGATTCGCCTCTGTCCAGCGCCATTCAGTCTTAAAGGTGGGCGTTTCCGCACCAT
TTGGCGATTCCAGAAGCGTTTAGCCTCGGATGTCTCTGGTTTGCTGAGGTTTCAGCCTTTCTTTGGTGTTTTTGGGCAACCCCTTTTCAGCGCCGCCTACCCTCCAGCCA
CCGTTCAAGCCCTCACGCCGTCGTTTTCTCTGTTCAGTCGCCATCGTCACCGCCGTGGGTGTGCTCTCTCTCTCCCTCGTCGCGATCTCTCTCCCCGTGGATCGTGGTCG
CAAATCTCTTCCTTTCGTGTCGTGTCGTCGTTGATCCTTGTCAGAAATCAAAGACCCAGCCGCTGTGAGAGTCGCTGCCAACCACAACACCCATTTAAGGAGCAGTGCTG
TGTTAGGTTGTTCCAGCAAGGTTCAACCCCTGCTTGGTTTAGTATCGATATATTTGTTCATTCTGTTGCGGTGAGGGATTCACTTGAGAATGGTTATGTTTTCTTGAACT
TGGAACTAGTGTTATTAGTTTTGGTAAAGAATAAAGTTGCCTTGAAGCTTCCATCTGAAACTATCGAACAATATACTTGCCGGATTAAAGAGATTGTCGACAAGCTTGCT
GTAGCTTCAATAAAAATTGAAGACGAAGAGATTTTGGTTCATACCTTGAACGACCTCCCAACTGAATTCAATGCTTTTCGCACTTCCATACGAACTCGAAGTGGATCTCT
CTCCTTGGAAGAACTTCACAATCTTCTCGAGGCTGAAGAAAAAACGATGAAATCAAATGTTGTCGTCGATCCCATTCCAACTGTTATGGCTGCCGTAAAAGGTTCCTTCT
CAAACCCTAACAATCGAGGCAGAGGACGGCGACAATCAGCCAAAAATCGTGGGTTTTCAAATCGAGGATTTGAGGAATCTCAGCGTGGCCGTGGGTTTCAAAATCGCGCT
GCATTTGATGGCCTAAGCCCATTTAACCGAGGTTCACAGTCGTGGGATTCCTTTGGGTCACGCGGTAGTTTTGGCAATGGGTCACATGCAGGTCAGCAATTCGGGCCAAA
TTTTTCGAGCCCAACAGCAAATGGGTCTGGAGGTAATTTCGGGTCTGCTTCTTCAAATTTTGATTCCGGATTTAATGGCGGTCGAATCTTCTGTCAAATCTGCTCCAAAT
CTGGGCATGGAGCGCTTGATTGTTATAATAGAATGAATTTCTCATATCAAGGACGACATCCACCAGCACAATTGGCTGCTATGGCGGTAAATTCCATGAATTCACAGTCT
TCAAATGATTCTGCTAACAATTTTTGGTTGTCTGACAGTGGTTGCAATGTTCATATGACCAACGATTATGCAAATCTCAATCTCTCCAACAATTATAATGGAGAAGAATC
TATCACAGTGGGTAATGGCCAACCTTTAAATATTCAAAACACAGGCAGTGGTACACTCTCTACCTCTTCTCATACTTTTAATCTTTCCAAAATCCTTCATGCTCCTGAAC
TTGCAGCAAATCTTTTATCAGTTCATAAATTTTGCCTTGATAATAACTGTATTTTTATCTTTGATACTGACTGGTTCCTAATTCAAGATAAAGTCTATGGCAAAATCTTA
TACACTGGTGAAAGCATCAATGGCCTTTATCCCATCCCAAGTCCATCCATGCTATCTTCTGATCTGCACCCCAAAAACTTTAATTTTATGGCAAAACAGGAGTGTTCTCT
ATGGCATCACCGGTTTGGTCATCCCGCTCCCAAAATATTACGCTCTAGTATATCTCGTCTTGGTCTTTCTTTATCTAGTTCTATTTCTACTTGCGAATGCATTAGTTGCT
TCAAAGCTAAAATGTCTAAACTTTCATTTCCTATGTCTGTTTCTCTTTCCATTTCTCCTTTAGACCTTATTCATAATGATGTATGGGGACCTTTCCCTATAACCATTATT
CAACCCACTACAGTAACTATCTCCTCACTTCTTCCATGCGACTCCTTGACTTTTCCGACGACCGCCGGCCGCCGTCAGCCGGCGAAGTTTCCGACGAATTTTCCGACGAC
CGCCGACGACTTTTCCGGCGATCGCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTAGTAGGCAACAATCGGGGATTGTCTATGCCCCGATCAATGCCAACAACTTTGAGCTGAAGACCGTTAAGATTAATAGAGTCTCTGAGGATGCTATTCGCTTACG
CTTATTTCCTTTTTCTTTGCAGGATAAAGCCGAGATTGGTTGCAGTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGTCCAGGCCTTTTGAAGAAATTTTCCCTC
CTACAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATTGATGAGCAGTTGTTCGAAGCTTGGGAGCGATTTAAAGACTGCAGGTGGGACTCTGTTG
TCCAAGACCGTGGAAAATGCTCGCATACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACCAAAAAGATTGTTGCTGGAGTGTACAGG
GAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGTGCTGCCATAAGAACATTGAGACTCAGCTGGGACAGTTGGTGA
ATGTTGTGAGCACCATGAATAAAGGTAAGGCCCCAGCTGAGCAAGAGAAAACCCAGATGGAGTACTGTAAGGCAATCACTGTGCACCAGGAGGAAGTTGAAGAGGAGCCT
GAGTTTAGGATTATGACACGCCTACAGGGGAAGCTGAGGAGGACACATCATCAGATGAGGCTGAAAAGCCTGAACCTGAGCCTCCTATTCCTTCTCCCCACTGATGTTCC
CAAAGAAAAGAAAAAAAAAAAGAAAAAGAACAATCAGGTTCATGAAGGAATGGTTAGCAAAGAAGCAAAGGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCA
GCACCAGAGTACAAAGAAGGAAGACCATTCCTGCTACTGGGCGAGTGATTATTGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAGGAACGAAAAAGAAATTTTAAAG
CAGTTGAAGACTCTAAAGATGAAGTACTTTTATGGGCTACAGGAAAGGTGCAAGAAAGAGCACCTCTGTTGGATTCACAGAAAAGAAGCCTCCTTGAAGCACGGTCAACA
CGTCGAGCTAATGACGTTAAACAAGCGCTTTGGGAGGCAACCCAAGTCACAGCATCTCGACGCTGCGACCTTAGCGTCTCGACGCTGTGTTATTTCGCTTACTGAAACGC
GCGTTTTGAAGGCAGCGTCTCGACGCAACGACGAAAATCAGAATAAAAGCTTCCTTTCTGCTAGGTTTTTAGGGGATTCAATTTTGGGACTTCTTGGAGCCGTAAACAGA
GCTAAAACAGAGGATTTGAAGGCTGAAGCAAAGGGAGAAAATCGGAAATCAACCCATTGTTCGTGGGATCGTGACGGGACGTCGGTCTTGGCCTACATTTATGGTCCAGT
AAGTTTGAGCTTGCCTTTCAGCAAACTCTTGCCTACTGGCCGACAATTGATGAAGCGGATGAATGGTCCTTCTTCTACTTCCTCTACAGAAATCAAAGACCCAGCCGCTG
TGAGAGTCGCTGCCAACCGTGAGTCACACGTGAAGGCCTTCGCGCTGCTCTATCTCCCTCGTTTCATTGTATTCGCGCCGTCTCTCTCTGTCCTTGCATTTTCGGCCAAA
GAAGCTCGATGTTGGTTTTGTGCGCGTCCAACAACTAGGGTTCCGTATTCCTCGCAGATTCGCCTCTGTCCAGCGCCATTCAGTCTTAAAGGTGGGCGTTTCCGCACCAT
TTGGCGATTCCAGAAGCGTTTAGCCTCGGATGTCTCTGGTTTGCTGAGGTTTCAGCCTTTCTTTGGTGTTTTTGGGCAACCCCTTTTCAGCGCCGCCTACCCTCCAGCCA
CCGTTCAAGCCCTCACGCCGTCGTTTTCTCTGTTCAGTCGCCATCGTCACCGCCGTGGGTGTGCTCTCTCTCTCCCTCGTCGCGATCTCTCTCCCCGTGGATCGTGGTCG
CAAATCTCTTCCTTTCGTGTCGTGTCGTCGTTGATCCTTGTCAGAAATCAAAGACCCAGCCGCTGTGAGAGTCGCTGCCAACCACAACACCCATTTAAGGAGCAGTGCTG
TGTTAGGTTGTTCCAGCAAGGTTCAACCCCTGCTTGGTTTAGTATCGATATATTTGTTCATTCTGTTGCGGTGAGGGATTCACTTGAGAATGGTTATGTTTTCTTGAACT
TGGAACTAGTGTTATTAGTTTTGGTAAAGAATAAAGTTGCCTTGAAGCTTCCATCTGAAACTATCGAACAATATACTTGCCGGATTAAAGAGATTGTCGACAAGCTTGCT
GTAGCTTCAATAAAAATTGAAGACGAAGAGATTTTGGTTCATACCTTGAACGACCTCCCAACTGAATTCAATGCTTTTCGCACTTCCATACGAACTCGAAGTGGATCTCT
CTCCTTGGAAGAACTTCACAATCTTCTCGAGGCTGAAGAAAAAACGATGAAATCAAATGTTGTCGTCGATCCCATTCCAACTGTTATGGCTGCCGTAAAAGGTTCCTTCT
CAAACCCTAACAATCGAGGCAGAGGACGGCGACAATCAGCCAAAAATCGTGGGTTTTCAAATCGAGGATTTGAGGAATCTCAGCGTGGCCGTGGGTTTCAAAATCGCGCT
GCATTTGATGGCCTAAGCCCATTTAACCGAGGTTCACAGTCGTGGGATTCCTTTGGGTCACGCGGTAGTTTTGGCAATGGGTCACATGCAGGTCAGCAATTCGGGCCAAA
TTTTTCGAGCCCAACAGCAAATGGGTCTGGAGGTAATTTCGGGTCTGCTTCTTCAAATTTTGATTCCGGATTTAATGGCGGTCGAATCTTCTGTCAAATCTGCTCCAAAT
CTGGGCATGGAGCGCTTGATTGTTATAATAGAATGAATTTCTCATATCAAGGACGACATCCACCAGCACAATTGGCTGCTATGGCGGTAAATTCCATGAATTCACAGTCT
TCAAATGATTCTGCTAACAATTTTTGGTTGTCTGACAGTGGTTGCAATGTTCATATGACCAACGATTATGCAAATCTCAATCTCTCCAACAATTATAATGGAGAAGAATC
TATCACAGTGGGTAATGGCCAACCTTTAAATATTCAAAACACAGGCAGTGGTACACTCTCTACCTCTTCTCATACTTTTAATCTTTCCAAAATCCTTCATGCTCCTGAAC
TTGCAGCAAATCTTTTATCAGTTCATAAATTTTGCCTTGATAATAACTGTATTTTTATCTTTGATACTGACTGGTTCCTAATTCAAGATAAAGTCTATGGCAAAATCTTA
TACACTGGTGAAAGCATCAATGGCCTTTATCCCATCCCAAGTCCATCCATGCTATCTTCTGATCTGCACCCCAAAAACTTTAATTTTATGGCAAAACAGGAGTGTTCTCT
ATGGCATCACCGGTTTGGTCATCCCGCTCCCAAAATATTACGCTCTAGTATATCTCGTCTTGGTCTTTCTTTATCTAGTTCTATTTCTACTTGCGAATGCATTAGTTGCT
TCAAAGCTAAAATGTCTAAACTTTCATTTCCTATGTCTGTTTCTCTTTCCATTTCTCCTTTAGACCTTATTCATAATGATGTATGGGGACCTTTCCCTATAACCATTATT
CAACCCACTACAGTAACTATCTCCTCACTTCTTCCATGCGACTCCTTGACTTTTCCGACGACCGCCGGCCGCCGTCAGCCGGCGAAGTTTCCGACGAATTTTCCGACGAC
CGCCGACGACTTTTCCGGCGATCGCCAATGA
Protein sequenceShow/hide protein sequence
MRSRQQSGIVYAPINANNFELKTVKINRVSEDAIRLRLFPFSLQDKAEIGCSITPGSITTWDALSRPFEEIFPPTKTVKLRTEIGTFQQQLMSSCSKLGSDLKTAGGTLL
SKTVENARILLEDMATNSYQWPSERSTPKRLLLECTGSAQSIESAAALASRPQEETIEQCCHKNIETQLGQLVNVVSTMNKGKAPAEQEKTQMEYCKAITVHQEEVEEEP
EFRIMTRLQGKLRRTHHQMRLKSLNLSLLFLLPTDVPKEKKKKKKKNNQVHEGMVSKEAKEKKVDTVYLASTCSTRVQRRKTIPATGRVIIDIERRELTIRVRNEKEILK
QLKTLKMKYFYGLQERCKKEHLCWIHRKEASLKHGQHVELMTLNKRFGRQPKSQHLDAATLASRRCVISLTETRVLKAASRRNDENQNKSFLSARFLGDSILGLLGAVNR
AKTEDLKAEAKGENRKSTHCSWDRDGTSVLAYIYGPVSLSLPFSKLLPTGRQLMKRMNGPSSTSSTEIKDPAAVRVAANRESHVKAFALLYLPRFIVFAPSLSVLAFSAK
EARCWFCARPTTRVPYSSQIRLCPAPFSLKGGRFRTIWRFQKRLASDVSGLLRFQPFFGVFGQPLFSAAYPPATVQALTPSFSLFSRHRHRRGCALSLPRRDLSPRGSWS
QISSFRVVSSLILVRNQRPSRCESRCQPQHPFKEQCCVRLFQQGSTPAWFSIDIFVHSVAVRDSLENGYVFLNLELVLLVLVKNKVALKLPSETIEQYTCRIKEIVDKLA
VASIKIEDEEILVHTLNDLPTEFNAFRTSIRTRSGSLSLEELHNLLEAEEKTMKSNVVVDPIPTVMAAVKGSFSNPNNRGRGRRQSAKNRGFSNRGFEESQRGRGFQNRA
AFDGLSPFNRGSQSWDSFGSRGSFGNGSHAGQQFGPNFSSPTANGSGGNFGSASSNFDSGFNGGRIFCQICSKSGHGALDCYNRMNFSYQGRHPPAQLAAMAVNSMNSQS
SNDSANNFWLSDSGCNVHMTNDYANLNLSNNYNGEESITVGNGQPLNIQNTGSGTLSTSSHTFNLSKILHAPELAANLLSVHKFCLDNNCIFIFDTDWFLIQDKVYGKIL
YTGESINGLYPIPSPSMLSSDLHPKNFNFMAKQECSLWHHRFGHPAPKILRSSISRLGLSLSSSISTCECISCFKAKMSKLSFPMSVSLSISPLDLIHNDVWGPFPITII
QPTTVTISSLLPCDSLTFPTTAGRRQPAKFPTNFPTTADDFSGDRQ