; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015837 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015837
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr12:26749139..26752555
RNA-Seq ExpressionLag0015837
SyntenyLag0015837
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8501049.1 hypothetical protein CXB51_003148 [Gossypium anomalum]2.6e-11846.93Show/hide
Query:  LEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTN-KKVKSVLEVDGVSTIR
        ++DE+  EAWERFKELL+KCPHHG+PHCIQ+ETFYNGLN  T+ +VDASA GA+L+K+++EAYEI+ERI+ NS QW   R T+ ++V  + EVD  +++ 
Subjt:  LEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTN-KKVKSVLEVDGVSTIR

Query:  VDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGFAKAQVMSQQNKQALPQQNSGNSLE
          ++ I++ LKN+TT       A  P    NQ  + ACVYCGE H +E CPSNP S++++G   +N   N+  P   +    SQQ ++ + Q  S NSLE
Subjt:  VDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGFAKAQVMSQQNKQALPQQNSGNSLE

Query:  AMMKEFMARTDAAIQSNQASMRALELQVSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRS-ELESGQGAGGSKENVGASGSVPDVEPPYVPPPP-
         ++K +MA+       N A++R LE QV QLA EL+ RPQG LPSDT++ R  GKE  KA+TLRS E          KE+V A  S     P  V  PP 
Subjt:  AMMKEFMARTDAAIQSNQASMRALELQVSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRS-ELESGQGAGGSKENVGASGSVPDVEPPYVPPPP-

Query:  ---------YVPPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWKEL----------------------------
                 Y PP P+PQR + + Q+  FKKFL++LKQLHINIPLVEA+EQMPNY KF+KDIL+KK++  E                             
Subjt:  ---------YVPPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWKEL----------------------------

Query:  ----------GRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLAT----
                  G+ALCDLGASINLMP+S+++KLGIGE RPTTVTLQLADRS+ + EGKI+DVLV+VDKFIFP DF+ILD+EADK+VPIILGRPFLAT    
Subjt:  ----------GRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLAT----

Query:  ----------------------------DEMEDCSFIRILESTVIETSIQDSAD
                                    D ++DCS +  LE  ++E  +    D
Subjt:  ----------------------------DEMEDCSFIRILESTVIETSIQDSAD

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]1.9e-14042.74Show/hide
Query:  MSDPPGVRFELDPEIERTFRNRRKEQRRNQMENVPQL---PQRNNQAENPILIAN-------------DRTGAIRAYAVPMFNELNPGIARPQIQAANFE
        M+ P G+ F LDPEIERTFR R K++++   +N+ Q+    Q N + ENP ++AN             DR  AIRAYA P   ELNP I RP+IQ   FE
Subjt:  MSDPPGVRFELDPEIERTFRNRRKEQRRNQMENVPQL---PQRNNQAENPILIAN-------------DRTGAIRAYAVPMFNELNPGIARPQIQAANFE

Query:  MKPVMFQMLQTVGQFHG-----------------------------------------------------------------------------------
        +KPVMFQMLQT+GQFHG                                                                                   
Subjt:  MKPVMFQMLQTVGQFHG-----------------------------------------------------------------------------------

Query:  --------FRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKS
                F+Q EDET SEA ERFKE+L+KCPHHGLPHCIQMETFYNGLN VT+ +VDASA GA+L+KT++EAYEILERI+ N+CQW+DVR    +K + 
Subjt:  --------FRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKS

Query:  VLEVDGVSTIRVDLAMIANALKNVTTISHQQPPA-MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNPG-------
        VLEVD +S+I   LA + N L+N+         A +  AA +NQ   E+CVYCGE+H ++ CPSNPAS+F+VGNQ      +NNP+SN YNPG       
Subjt:  VLEVDGVSTIRVDLAMIANALKNVTTISHQQPPA-MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNPG-------

Query:  -----------------------------FAKAQVMSQQNKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVSQLANELKARPQGKLPSD
                                     ++  QV +Q       Q  S  S+E+++KE+MA+ DA IQS QAS+R LE+Q+    N  +     +  +D
Subjt:  -----------------------------FAKAQVMSQQNKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVSQLANELKARPQGKLPSD

Query:  TEHPRREGKEQVKAVTLRSELESGQGAGGSKENVGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAK
        T+    E   Q +     +E+E        +  +  + S       Y P P      PFPQR K K ++  F+KF++ILK++HINIPLVEA++QMPNY K
Subjt:  TEHPRREGKEQVKAVTLRSELESGQGAGGSKENVGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAK

Query:  FLKDILTKKKRW--------------------------------------KELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGK
        FLKD+L  ++++                                      KELGRALCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITYPEGK
Subjt:  FLKDILTKKKRW--------------------------------------KELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGK

Query:  IEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFL
        IED+L++VDKFIF  DFIILDYE D DVPIILGRPFL
Subjt:  IEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFL

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]1.8e-13040.94Show/hide
Query:  QAENPILIANDRTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFH------------------------------------------
        Q  +PI++ +DR  AIR YA PMFNELNPGI RP+IQA  FE+KPVMFQMLQTVGQF                                           
Subjt:  QAENPILIANDRTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFH------------------------------------------

Query:  ------------------------------------------GFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGAL
                                                   F QLEDE+ S+AWERFKELL+KCPHHG+PHCIQMETFYNGLN  +Q ++DASA GA+
Subjt:  ------------------------------------------GFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGAL

Query:  LAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNP
        L+K+++EA+EILE I+ N+ QWS+ R   ++KV  VLEVD ++ +   +A + N LKN++  + +    ++PAA + Q  D +CV+C E H +E CPSNP
Subjt:  LAKTFDEAYEILERISINSCQWSDVRGT-NKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNP

Query:  ASVFFVGNQ----RNNPYSNFYNPGFAKAQVMS----------QQNKQALP-------------QQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQ
         SV ++GNQ     N  +SN YN  +     +S           Q +QA P             Q +  +SLE++M+++MA+ DA IQS  A +R LELQ
Subjt:  ASVFFVGNQ----RNNPYSNFYNPGFAKAQVMS----------QQNKQALP-------------QQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQ

Query:  VSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGSKENVGASGSVPDVE--------------------------------PPYVP
        +  LANELKARPQG LPSDTE+PRR+GKEQ K++ LR    SG+    S+E +  SG    ++                                 P   
Subjt:  VSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGSKENVGASGSVPDVE--------------------------------PPYVP

Query:  PPPYVPPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWKELGRALCDLGASINL---------------MPLSV-
             PPLPFPQR + + QDG FKKFL++LKQLHINIPLVEA+EQMPNY KFLKDILTKK+R  E    L + G    L               +P+S+ 
Subjt:  PPPYVPPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWKELGRALCDLGASINL---------------MPLSV-

Query:  YRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATDEMEDCSFIRILESTVIETSIQDSAD------
         R LGIGEARPTTVTLQLADRS+ +P+GKIEDVLV+VDKFIFP DFIILDYE D++VPIIL RPFLAT      + I + +  +   +  + A       
Subjt:  YRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATDEMEDCSFIRILESTVIETSIQDSAD------

Query:  -KHSEKHGEGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEHMEAQEDIYEKSPREMKKKAQKSAQKEAEFLENLDSV
         +  +  GE ++    M    +EE  F+  ++ R+  P +KE+ K    +  +D     P++  KK +K  +   +F   +  V
Subjt:  -KHSEKHGEGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEHMEAQEDIYEKSPREMKKKAQKSAQKEAEFLENLDSV

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]5.3e-11944.39Show/hide
Query:  LDPEIERTFRNRRKEQRRNQMENVP---QLPQRNNQAENPILIANDRTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTV-----------
        +DPEIERTFR RRKEQ+  +  N+    ++   +N+A NPI +A+DR  AIR YA PMFNELNPGI RP+IQA +FE+KPVMFQMLQT+           
Subjt:  LDPEIERTFRNRRKEQRRNQMENVP---QLPQRNNQAENPILIANDRTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTV-----------

Query:  ----------------------------------------------GQFHGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMV
                                                       +   F+QLEDET S+AWERFKELL+KCPHHG+PHCIQ+ETFYNGLN  ++ ++
Subjt:  ----------------------------------------------GQFHGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMV

Query:  DASAGGALLAKTFDEAYEILERISINSCQWSDVRG-TNKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEACVYCGEDHN
        DASA GA+L+K+++EA+EILERI+ N+ QWS  R  T++KV  VLEVD ++ +   +A + N LKN+         +++PAA + Q  + +CVYCG+ H 
Subjt:  DASAGGALLAKTFDEAYEILERISINSCQWSDVRG-TNKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEACVYCGEDHN

Query:  YEFCPSNPASVFFVGNQ----RNNPYSNFYNP-------------GFAKAQVMSQQNKQAL---------------PQQNSGNSLEAMMKEFMARTDAAI
        +E CPSNPASV +VGNQ     NNPYSN YNP             G + +     Q KQ+                PQ +  +SLE++M+++MA+ DA I
Subjt:  YEFCPSNPASVFFVGNQ----RNNPYSNFYNP-------------GFAKAQVMSQQNKQAL---------------PQQNSGNSLEAMMKEFMARTDAAI

Query:  QSNQASMRALELQVSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSE--LESGQGAGGSKE-----NVGASGSVPDVEPPYVPPPPYVPPLPFPQ
        QS  AS+R LE+Q+ QLAN+LK RPQG LPSDTE+PRR+GKE  KAVTLRS   +ES   A GSKE       G     P      +P         F  
Subjt:  QSNQASMRALELQVSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSE--LESGQGAGGSKE-----NVGASGSVPDVEPPYVPPPPYVPPLPFPQ

Query:  RQ-------------KPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVT
        R+             +   +    ++FL++LKQLHINIPLVEA+EQMP Y K            +++GRALCDLGASINLMP+S+   LG  E       
Subjt:  RQ-------------KPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVT

Query:  LQLADRSITYPEG--KIEDVLVKVDKFIFPVDFIILDYEADK
           A     +     KIEDVLV+VDKFIFP DFIILDYE D+
Subjt:  LQLADRSITYPEG--KIEDVLVKVDKFIFPVDFIILDYEADK

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]3.7e-12846.89Show/hide
Query:  QAENPILIANDRTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGL
        Q  NPI++A+DR  AIR YA PMFNELNP +       A FE++  +            F++LEDE+ S+AWERFKELL+KCPHHG+PHCIQMETFYNGL
Subjt:  QAENPILIANDRTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGFRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGL

Query:  NGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRG-TNKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEAC
        N  ++ ++DASA GA+L+K+++EA+EILE I+ N+ QWS+ R  T++KV  VLEVD ++ +   +A + N   N++                        
Subjt:  NGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRG-TNKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEAC

Query:  VYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGFAKAQVMSQQNKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVSQLANELKAR
         + G+  +    P+        G Q        Y PGF++    SQ  + A P     +SLE++M+++MA+ DA IQS  AS+R LELQ+  LANELKAR
Subjt:  VYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGFAKAQVMSQQNKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVSQLANELKAR

Query:  PQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGSKENVGASG-----------------SVPDVEP-----------PYVPPPPYVPPLPFPQRQK
        PQG LPSDTE+PRR+GKEQ K++ LR    SG+    S+E +  SG                  + D  P               P    PPLPFPQR +
Subjt:  PQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGSKENVGASG-----------------SVPDVEP-----------PYVPPPPYVPPLPFPQRQK

Query:  PKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKR--------------------------------------WKELGRALCDLGASI
         + QDG FKKFL++LKQLHINIPLVEA+EQMPNY KFLKDILTKK+R                                       +++GRALCDLGASI
Subjt:  PKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKR--------------------------------------WKELGRALCDLGASI

Query:  NLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLAT-----------------DEMEDCSF
        NLMP+S+++KLGIGEARPTTVTLQLADRS+ +PEGKIEDVLV+VDKFIFP DFIILDYEAD+DVPIILGRPFLAT                 DE+E+CS 
Subjt:  NLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLAT-----------------DEMEDCSF

Query:  IRILESTVIE
        I +++S V E
Subjt:  IRILESTVIE

TrEMBL top hitse value%identityAlignment
A0A6A2WLX1 Reverse transcriptase6.6e-9135.58Show/hide
Query:  VPQLPQRNNQAENPILIANDRTGA--IRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHG------------------------------
        +P+LP  +   E+ +   +   GA        PM ++LNPGI  P+IQAA+FEMKPVMF ML ++GQF G                              
Subjt:  VPQLPQRNNQAENPILIANDRTGA--IRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHG------------------------------

Query:  ------------------------------------------------------FRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQ
                                                              FRQ +DE+  E W+R+K LLQKC +HG     Q+  FYNG+N  T+
Subjt:  ------------------------------------------------------FRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQ

Query:  GMVDASAGGALLAKTFDEAYEILERISINSCQWSDVR-GTNKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEACVYCGE
         ++DASA G LL K+  EA+ IL+RI+ N  Q+   R G+ ++     E++   ++   L++I N LKN+   +  +           + T  AC+ C  
Subjt:  GMVDASAGGALLAKTFDEAYEILERISINSCQWSDVR-GTNKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEACVYCGE

Query:  DHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGFAK--------------AQVMSQQN-------KQALPQQNSG---------NSLEAMMKEFMART
        +H+   CP+N  S+ FVGN     NNPYSN YN G+ +               Q   QQN       + A+P  N+          +SLEA ++EF++ T
Subjt:  DHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGFAK--------------AQVMSQQN-------KQALPQQNSG---------NSLEAMMKEFMART

Query:  ---------------------DAAIQSNQASMRALELQVSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGSKENVGASGSVPD-
                              A IQS+ +S+RALE QV Q+A  L+ R QG+LPSDTE  +  GKE    +TLRS  +  +     ++       VPD 
Subjt:  ---------------------DAAIQSNQASMRALELQVSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGSKENVGASGSVPD-

Query:  ---VEPPYVPPPPYV-PPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWKE------------------------
           V+  ++P      PP PFPQR K  N +  FKKF++IL QLHINIPL+EA+EQMP YAKF+KDI TKK++ +                         
Subjt:  ---VEPPYVPPPPYV-PPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWKE------------------------

Query:  ----------LGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLAT
                  +G+ALCDLG+S+NL+P S++ KLGIG+ARPT+V LQLAD+S    EG++EDV+V+VDKF+F VDF+ILD E D   PIILGRPFLAT
Subjt:  ----------LGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLAT

A0A6A2XMQ5 Retrotrans_gag domain-containing protein5.6e-9035.81Show/hide
Query:  LDPEIERTFRNRRKEQRRNQMENVPQLPQRNNQAENPILIAND--RTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVG-----------
        ++PEIE +   +   ++ +Q+ N        NQ    +  A    R   IR +  PM ++LNPGI  P+IQAA+FE+KPVMF ML ++G           
Subjt:  LDPEIERTFRNRRKEQRRNQMENVPQLPQRNNQAENPILIAND--RTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVG-----------

Query:  ---QFHG---------------------------------FRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAK
           QF G                                 FRQ +DE+  E W+R+K LL+KC +HG     Q+  FYNG+N  T+ ++DASA G LL K
Subjt:  ---QFHG---------------------------------FRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAK

Query:  TFDEAYEILERISINSCQWSDVR-GTNKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASV
        +  EA+ IL+RI+ N  Q+   R G+ ++     E++   ++   L+ I N LKN+   +  +           + T  AC+ C  +H+   CP+N  S+
Subjt:  TFDEAYEILERISINSCQWSDVR-GTNKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASV

Query:  FFVGNQR---NNPYSNFYNPGFAKAQVMSQQN---------------------KQALPQQNSG---------NSLEAMMKEFMART--------------
         FVGN     NNPYSN YN G+ +    S +N                     + A+P  N+          +SLEA ++EF++ T              
Subjt:  FFVGNQR---NNPYSNFYNPGFAKAQVMSQQN---------------------KQALPQQNSG---------NSLEAMMKEFMART--------------

Query:  -------DAAIQSNQASMRALELQVSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRS----------------------ELESGQGAGGSKENVG
                A +QS+ +S+RALE QV Q+A  L+ R QG+LPSDTE  +  GKE    +TLRS                      E E        K+   
Subjt:  -------DAAIQSNQASMRALELQVSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRS----------------------ELESGQGAGGSKENVG

Query:  ASGSVPD----VEPPYVPPPPYV-PPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWK-----------------
         S  VP+     +   +P      PP PFPQR K  N +  FKKF++IL QLHIN  L+EA+EQMP YAKFLKDI TKK++ +                 
Subjt:  ASGSVPD----VEPPYVPPPPYV-PPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWK-----------------

Query:  -----------------ELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGR
                          +G+ALCDLG+S+N MP S++ KLGIG+ARPT+V LQLAD+S   PEGK+EDV+V+VDKF+FPVDF+ILD E D   PIILGR
Subjt:  -----------------ELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGR

Query:  PFLATDE-MEDC
        PFLAT   + DC
Subjt:  PFLATDE-MEDC

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129454.7e-10537.64Show/hide
Query:  DPEIERTFRNRRKEQRRNQMENVPQLPQRNNQAENPILIANDRTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHG----------
        DP+IERTFR  R+E  +    N       NN   N I +  +   A+R Y VP+   L+  I RP I A NFE+KP   QM+Q+  QF G          
Subjt:  DPEIERTFRNRRKEQRRNQMENVPQLPQRNNQAENPILIANDRTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHG----------

Query:  --------------------------------------------------------------------------FRQLEDETFSEAWERFKELLQKCPHH
                                                                                  F Q + E+  EAWERFKELL++CPHH
Subjt:  --------------------------------------------------------------------------FRQLEDETFSEAWERFKELLQKCPHH

Query:  GLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAM
        G+P  +Q++TFYNGL G  + ++DA+AGGAL++K   +AY +LE ++ N+ QW   R  ++K     E+D + T+   +A ++  L  +           
Subjt:  GLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRVDLAMIANALKNVTTISHQQPPAM

Query:  EPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---QRNNPYSNFYNPGFAKAQVMSQQN----------------KQALPQ-QNSGNSLEAMMK
           AV N +    C  CG+ H+Y+ CP N  SV FVGN   Q+NNPYSN YNPG+      S  N                +QA PQ     + LE ++ 
Subjt:  EPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---QRNNPYSNFYNPGFAKAQVMSQQN----------------KQALPQ-QNSGNSLEAMMK

Query:  EFMARTDAAIQSNQASMRALELQVSQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLR--------------SELESGQGAGGSKENVGASGSVPDV
        +++++TDA IQS  AS+R LE QV QLAN +  RPQG LPSDT+ +P+  GKEQ +A+TLR              SE+E     G  +  +       D 
Subjt:  EFMARTDAAIQSNQASMRALELQVSQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLR--------------SELESGQGAGGSKENVGASGSVPDV

Query:  EPPYVPPPPYVPPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWKE-----------------------------
                   PP PFPQR + +  +  F+KFL + K+LHINIP  EA+EQMP+Y KFLKDIL+KK++  E                             
Subjt:  EPPYVPPPPYVPPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRWKE-----------------------------

Query:  ---------LGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLAT
                   +AL DLGASINLMP S++ KLG+GE +PT+VTLQLADRS  YP G IEDVLVKVDKFIFPVDF+ILD E D+ +PIILGRPFLAT
Subjt:  ---------LGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLAT

A0A6J1DY39 uncharacterized protein LOC1110256533.4e-10337.6Show/hide
Query:  NQAENPILIANDRTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHG----------------------------------------
        N   NPI +A+ R  A+R YA  +  +LN  +       A FE KP+M QML  +GQF G                                        
Subjt:  NQAENPILIANDRTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHG----------------------------------------

Query:  --------------------------------------------FRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGA
                                                    FRQ E+E  + AWERFK+L+  CP+ G+P C+Q+E F+ G + +T+ M++ +A G 
Subjt:  --------------------------------------------FRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGA

Query:  LLAKTFDEAYEILERISINSCQWSDVRGTNKKVKS----VLEVDGVSTIRVDLAMIANALKNV--TTISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYE
          +K+F+E  EIL+++S ++ QW   +   +  ++    VL +D +++++  +  I   LKN+           A    + V Q+ +  C YCG+ H  E
Subjt:  LLAKTFDEAYEILERISINSCQWSDVRGTNKKVKS----VLEVDGVSTIRVDLAMIANALKNV--TTISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYE

Query:  FCPSNPASVFFVG---NQRNNPYSNFYNPGFAKAQVMS-------------QQNKQAL---------------------------PQQNSGN--------
         CPSNP+S+++VG    Q+ NPYSN YNPG+ +    S             QQ K+A                             QQN  N        
Subjt:  FCPSNPASVFFVG---NQRNNPYSNFYNPGFAKAQVMS-------------QQNKQAL---------------------------PQQNSGN--------

Query:  --SLEAMMKEFMARTDAAIQS----------NQASMRALELQVSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSEL---------ESGQGAGGS
            +A MKE M RTD  ++           N  ++R LE+Q+ QL NE++ RPQG LPS TE PRR GKE   ++  RS L         ES       
Subjt:  --SLEAMMKEFMARTDAAIQS----------NQASMRALELQVSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSEL---------ESGQGAGGS

Query:  KENVGASGSVPD--VEPPY-VPPPPYV----PPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKR------------
        K+    + +VPD  VEP   VP  P V    PP PFPQR   KNQD  F+KFL+ILKQLHINIP VEA+EQMP YAKF+KDI+T+KK+            
Subjt:  KENVGASGSVPD--VEPPY-VPPPPYV----PPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKR------------

Query:  --------------------------WKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYE
                                   K++GRALCDLGASINLMPLS+++K  IG+A PTTVTLQLADRSIT PEGKIEDVLVKVDKFIFP DFIILD E
Subjt:  --------------------------WKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYE

Query:  ADKDVPIILGRPFLATDE
        ADKDVPIILGRPFLAT E
Subjt:  ADKDVPIILGRPFLATDE

A0A6J1EQ90 uncharacterized protein LOC1114364119.1e-14142.74Show/hide
Query:  MSDPPGVRFELDPEIERTFRNRRKEQRRNQMENVPQL---PQRNNQAENPILIAN-------------DRTGAIRAYAVPMFNELNPGIARPQIQAANFE
        M+ P G+ F LDPEIERTFR R K++++   +N+ Q+    Q N + ENP ++AN             DR  AIRAYA P   ELNP I RP+IQ   FE
Subjt:  MSDPPGVRFELDPEIERTFRNRRKEQRRNQMENVPQL---PQRNNQAENPILIAN-------------DRTGAIRAYAVPMFNELNPGIARPQIQAANFE

Query:  MKPVMFQMLQTVGQFHG-----------------------------------------------------------------------------------
        +KPVMFQMLQT+GQFHG                                                                                   
Subjt:  MKPVMFQMLQTVGQFHG-----------------------------------------------------------------------------------

Query:  --------FRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKS
                F+Q EDET SEA ERFKE+L+KCPHHGLPHCIQMETFYNGLN VT+ +VDASA GA+L+KT++EAYEILERI+ N+CQW+DVR    +K + 
Subjt:  --------FRQLEDETFSEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGT-NKKVKS

Query:  VLEVDGVSTIRVDLAMIANALKNVTTISHQQPPA-MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNPG-------
        VLEVD +S+I   LA + N L+N+         A +  AA +NQ   E+CVYCGE+H ++ CPSNPAS+F+VGNQ      +NNP+SN YNPG       
Subjt:  VLEVDGVSTIRVDLAMIANALKNVTTISHQQPPA-MEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQ------RNNPYSNFYNPG-------

Query:  -----------------------------FAKAQVMSQQNKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVSQLANELKARPQGKLPSD
                                     ++  QV +Q       Q  S  S+E+++KE+MA+ DA IQS QAS+R LE+Q+    N  +     +  +D
Subjt:  -----------------------------FAKAQVMSQQNKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVSQLANELKARPQGKLPSD

Query:  TEHPRREGKEQVKAVTLRSELESGQGAGGSKENVGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAK
        T+    E   Q +     +E+E        +  +  + S       Y P P      PFPQR K K ++  F+KF++ILK++HINIPLVEA++QMPNY K
Subjt:  TEHPRREGKEQVKAVTLRSELESGQGAGGSKENVGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEAIEQMPNYAK

Query:  FLKDILTKKKRW--------------------------------------KELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGK
        FLKD+L  ++++                                      KELGRALCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITYPEGK
Subjt:  FLKDILTKKKRW--------------------------------------KELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGK

Query:  IEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFL
        IED+L++VDKFIF  DFIILDYE D DVPIILGRPFL
Subjt:  IEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGACCCAGAAATTGAAAGGACATTTAGGAACAGAAGAAAGGAGCAGCGCAGAAACCAGATGGAGAACGTGCCGCAACT
CCCGCAGCGAAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCGGAGCCATTCGAGCGTATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTG
CACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGGTTTAGGCAACTTGAAGATGAGACTTTT
AGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCAAAAGTGTCCCCACCATGGGTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTGAATGGAGTAACCCA
AGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCCTATGAAATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTA
GAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGAGTCGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAACGATTAGT
CATCAGCAGCCACCAGCTATGGAGCCTGCAGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGATCACAACTACGAGTTTTGCCCCAGCAATCC
AGCTTCTGTGTTTTTTGTAGGTAATCAAAGGAACAACCCTTATTCTAACTTCTATAATCCAGGATTTGCTAAAGCGCAAGTAATGTCCCAGCAAAATAAGCAGGCTTTGC
CCCAGCAAAATTCGGGAAATTCTCTCGAGGCGATGATGAAAGAATTTATGGCTCGCACAGACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAA
GTGAGTCAGCTAGCTAATGAGCTAAAGGCGAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAG
GAGTGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAAAGAGAATGTTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCATATGTGCCGCCCCCACCTTATGTAC
CACCTCTACCGTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTCCATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGTAGAAGCT
ATAGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAAGAAAAGGTGGAAAGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCT
TATGCCTCTTTCGGTCTATCGGAAGTTAGGTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAATTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGG
ATGTCTTAGTAAAAGTTGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACT
GACGAAATGGAGGATTGCTCCTTCATTAGGATTCTGGAGAGCACAGTTATTGAGACATCAATACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGGGAATTAG
CCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGCATA
TGGAAGCCCAAGAAGACATCTATGAGAAAAGTCCAAGAGAAATGAAGAAGAAGGCCCAAAAATCAGCCCAAAAGGAAGCTGAATTTCTGGAAAATCTGGACAGCGTCGAG
ACGCTTAGCAAGGCAGCGTCGCGACGCTGCCCTTTCTTGTTGCCCAAGCCGCGTGCGCAGCAGCGTCGCGACGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGACCCAGAAATTGAAAGGACATTTAGGAACAGAAGAAAGGAGCAGCGCAGAAACCAGATGGAGAACGTGCCGCAACT
CCCGCAGCGAAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCGGAGCCATTCGAGCGTATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTG
CACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGGTTTAGGCAACTTGAAGATGAGACTTTT
AGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCAAAAGTGTCCCCACCATGGGTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGTTTGAATGGAGTAACCCA
AGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCCTATGAAATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATGTTA
GAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGAGTCGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAACGATTAGT
CATCAGCAGCCACCAGCTATGGAGCCTGCAGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGATCACAACTACGAGTTTTGCCCCAGCAATCC
AGCTTCTGTGTTTTTTGTAGGTAATCAAAGGAACAACCCTTATTCTAACTTCTATAATCCAGGATTTGCTAAAGCGCAAGTAATGTCCCAGCAAAATAAGCAGGCTTTGC
CCCAGCAAAATTCGGGAAATTCTCTCGAGGCGATGATGAAAGAATTTATGGCTCGCACAGACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAA
GTGAGTCAGCTAGCTAATGAGCTAAAGGCGAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAG
GAGTGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAAAGAGAATGTTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCATATGTGCCGCCCCCACCTTATGTAC
CACCTCTACCGTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTCCATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGTAGAAGCT
ATAGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAAGAAAAGGTGGAAAGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCT
TATGCCTCTTTCGGTCTATCGGAAGTTAGGTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAATTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGG
ATGTCTTAGTAAAAGTTGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACT
GACGAAATGGAGGATTGCTCCTTCATTAGGATTCTGGAGAGCACAGTTATTGAGACATCAATACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGGGAATTAG
CCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGCATA
TGGAAGCCCAAGAAGACATCTATGAGAAAAGTCCAAGAGAAATGAAGAAGAAGGCCCAAAAATCAGCCCAAAAGGAAGCTGAATTTCTGGAAAATCTGGACAGCGTCGAG
ACGCTTAGCAAGGCAGCGTCGCGACGCTGCCCTTTCTTGTTGCCCAAGCCGCGTGCGCAGCAGCGTCGCGACGCTTAG
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDPEIERTFRNRRKEQRRNQMENVPQLPQRNNQAENPILIANDRTGAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGFRQLEDETF
SEAWERFKELLQKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRVDLAMIANALKNVTTIS
HQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGFAKAQVMSQQNKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQ
VSQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGSKENVGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGPFKKFLEILKQLHINIPLVEA
IEQMPNYAKFLKDILTKKKRWKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLAT
DEMEDCSFIRILESTVIETSIQDSADKHSEKHGEGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEHMEAQEDIYEKSPREMKKKAQKSAQKEAEFLENLDSVE
TLSKAASRRCPFLLPKPRAQQRRDA