; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008009 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008009
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:9893075..9898187
RNA-Seq ExpressionLag0008009
SyntenyLag0008009
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]9.7e-9737.07Show/hide
Query:  GDAEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANES
        G A A L++F  ++I + +D+ DKFL K+FP  +NA  R +II FRQ  NE ++ AWERF+ L+R  P+ G+PAC+ +EHF+   D  +  ++N +AN  
Subjt:  GDAEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANES

Query:  FLKKSANE---------------------------------------GTRTRIDGI--------------HIRPRTTQ-------VETTSKL--------
        F  KS NE                                         + +ID I               + P TT         E+T ++        
Subjt:  FLKKSANE---------------------------------------GTRTRIDGI--------------HIRPRTTQ-------VETTSKL--------

Query:  --------------LLGGQVGSSNSNTNQFGK--------------PSTQQQ--PHKNFQQ---------------------------------------
                         GQ  SS +  NQ  K              P T QQ    KN+ Q                                       
Subjt:  --------------LLGGQVGSSNSNTNQFGK--------------PSTQQQ--PHKNFQQ---------------------------------------

Query:  -----------------VQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYPVNQEAEIPKKLEKRFLKRNPEYKPSPPYPQRFK
                         +Q G +A E++ RPQG+LPS  E P R        V+     E D    P   +  +  ++      +    +  PP+PQR  
Subjt:  -----------------VQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYPVNQEAEIPKKLEKRFLKRNPEYKPSPPYPQRFK

Query:  HASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASI
          +QD  F+KFLD+LKQLHINIP VEALEQMPTY KFLKDI+++K++LGEYE V LTECSS + K++  PKLKDPGSFTI C IGG++VGR LCDLGA I
Subjt:  HASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASI

Query:  NLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNV
        NLMP S+FK                  +S+T P GKIED+LV VDKF+FP DFIILDCEADKDV IILGR FLATG TLIDV+KGELTMRV+DQ+VTFN+
Subjt:  NLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNV

Query:  MNAMKYPGDSEECSMID--------EIDELSRSTFQEMMKGETLEDILGDEEPEVKGGKKIEVEEINQP
        ++AMKYP D EEC +I         E+D+L  +  +  ++    E I+     + +  K I+  +I  P
Subjt:  MNAMKYPGDSEECSMID--------EIDELSRSTFQEMMKGETLEDILGDEEPEVKGGKKIEVEEINQP

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]5.2e-10636.26Show/hide
Query:  GDAEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANES
        G A A L++F  ++IT+W+D+ DKFL K+FP  +NA  R +II FRQ  NE ++ AWERF+ L+   P+ G+PAC+ +EHF+ G D  +K ++N +AN  
Subjt:  GDAEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANES

Query:  FLKKSANE----------------GTRTRIDGIHIRP-----------RTTQVETTSKLL----------------------------------------
        F  KS NE                  ++R       P              Q++T +++L                                        
Subjt:  FLKKSANE----------------GTRTRIDGIHIRP-----------RTTQVETTSKLL----------------------------------------

Query:  --------------------------------------LGGQVGSSNSNTNQFGK-----------PSTQQQPH-----KNFQQ----------------
                                                GQ  S+ +  NQ  K           P+    PH     KN+ Q                
Subjt:  --------------------------------------LGGQVGSSNSNTNQFGK-----------PSTQQQPH-----KNFQQ----------------

Query:  ----------------------------------------VQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYP---------V
                                                +Q G +  E++ RPQG+LPS  E P R G E C ++  RSGL+Y+GP+ P          
Subjt:  ----------------------------------------VQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYP---------V

Query:  NQEAEIPKKLEKRFLK-----RNPEYKPSPPYPQRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSAL
             +P K+ +  +      +    +P PP+PQR    +QD  F+KFLD+LKQLHINIP VEALEQMPTY KF+KDI+++K++LGEYE V LTECSS +
Subjt:  NQEAEIPKKLEKRFLK-----RNPEYKPSPPYPQRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSAL

Query:  VKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKD
         K+++ PKLKDPGSFTIPC IGG+DVGR LCDLGASINLMP S+FK                  +S+T P GKIED+LV VDKF+FP DFIILDCEADKD
Subjt:  VKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKD

Query:  VSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMID--------EIDELSRSTFQEMMKGETLEDILGDEEPEVKGGKKIEV
        V IILGR FLATG TLIDV+KGELTMRV+DQ+VTFN+++AMKY  D EEC++I         E+D+L  +  +  ++    E I+     + +  K I+ 
Subjt:  VSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMID--------EIDELSRSTFQEMMKGETLEDILGDEEPEVKGGKKIEV

Query:  EEINQP
         +I  P
Subjt:  EEINQP

XP_024965798.1 uncharacterized protein LOC112506000 [Cynara cardunculus var. scolymus]4.8e-10443.36Show/hide
Query:  AEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANESFL
        A A L+S  PNSI +WNDLA+KFL+K+FP  +N K R +I+ FRQ  +E +  AWERF+ L+RK  HHG+P C+ LE FY+ L  A+K +++A+A  +F 
Subjt:  AEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANESFL

Query:  KKSANEG--------------------TRTRIDGIH----IRPRTTQVETTSKLL-----LGG---QVGSSNSNTNQF----------------GKP---
         K+ NEG                          GIH    I     Q+   + L+     L G   QV S  SN++                  G P   
Subjt:  KKSANEG--------------------TRTRIDGIH----IRPRTTQVETTSKLL-----LGG---QVGSSNSNTNQF----------------GKP---

Query:  -----STQQQPHKNFQQVQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRS-----------------------GLEYDGPKYPV----NQEAE
             +T+  P     + QFG  A ++KNRPQGTLP   EIP     E  KAVTLRS                          +D  K+PV     +EA+
Subjt:  -----STQQQPHKNFQQVQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRS-----------------------GLEYDGPKYPV----NQEAE

Query:  IPKKLEK--------------------RFLKRNPEYKPSP--------PYPQRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKK
        IP  + +                    + L    + K  P        P+P R K  + DVQFKKFLD+ KQL+INIPLVEALEQM +YVKFLKDIL+KK
Subjt:  IPKKLEK--------------------RFLKRNPEYKPSP--------PYPQRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKK

Query:  RRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFKQ------------------SLTHPIGKIEDILVNVD
        RRL E E V LT+  SAL+  +I PKLKDPGSFTI CSIGG++VG  LCDLGASINLMP S+F Q                  SL +P  KIEDILV VD
Subjt:  RRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFKQ------------------SLTHPIGKIEDILVNVD

Query:  KFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMIDEI-DELSRSTFQEMMKGETL---EDILG
        KF+FP DF++LD EA+K+V IILGR FLATGRTLIDVQKGELTMRVNDQQVTFNV   +K+ G+ E+CS I  + ++L  S    +  G+T+   EDI  
Subjt:  KFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMIDEI-DELSRSTFQEMMKGETL---EDILG

Query:  DE
        ++
Subjt:  DE

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]2.3e-9837.19Show/hide
Query:  SFIRILENIIVETTMEDLANKHLEGHGEGD-AEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHG
        SF+ + ++  ++   E++    L      D A + L++ SP+S+T+WND A+KFL K+FP  +NAK+R++I+ F Q  +E    AWERF+ L+RK PHHG
Subjt:  SFIRILENIIVETTMEDLANKHLEGHGEGD-AEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHG

Query:  LPACIILEHFYNGLDQASKALVNASANESFLKKSANEG--------------TRTRIDGIH----------IRPRTTQVETTSKLLLGGQVGSS------
        +P CI +E FYNGL+  S+ +++ASAN + L KS NE               + TR  G            I   TTQ+ + + +L    +G+S      
Subjt:  LPACIILEHFYNGLDQASKALVNASANESFLKKSANEG--------------TRTRIDGIH----------IRPRTTQVETTSKLLLGGQVGSS------

Query:  ---------------------------------NSNTN----------------------------------------------------QFGKPSTQQQ
                                         N N N                                                    Q  +PS+ + 
Subjt:  ---------------------------------NSNTN----------------------------------------------------QFGKPSTQQQ

Query:  PHKNFQ-----------------QVQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYPVNQEAE-----IPKKLEKRFLKRNPE
          +++                  ++Q GH+A E+K RPQG+LPS  E P R+G EQCK++ LRSG      +  +    E     I +KL K+  +   +
Subjt:  PHKNFQ-----------------QVQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYPVNQEAE-----IPKKLEKRFLKRNPE

Query:  -----------------------YKPSPPYPQRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVK
                                KP  P+PQRF+   QD QFKKFLDVLKQLHINIPLVEALEQMP YVKFLKDIL+KKRRLGE+E   LTE   A++K
Subjt:  -----------------------YKPSPPYPQRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVK

Query:  NEISPKLKDPGSFTIPCSIGGRDVGRVLCDLG-ASINLMPCSVFKQSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDV
        N+I PKLKDPGSFTIP SIGGRD+G     +G A    +   +  +S+ HP GKIED+LV VDKF+FP DFIILD E D++V IIL R FLATGRTLIDV
Subjt:  NEISPKLKDPGSFTIPCSIGGRDVGRVLCDLG-ASINLMPCSVFKQSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDV

Query:  QKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMIDEID-----ELSRSTFQEMMKGETLEDILGDEEPEVKGGKKIEVEEINQPMKRQRIK
        +KGELTMR  D+Q TF V   ++ P    EC  I ++D     E      +++ K   L++I  D EP+   GK   + +  QP K++R K
Subjt:  QKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMIDEID-----ELSRSTFQEMMKGETLEDILGDEEPEVKGGKKIEVEEINQPMKRQRIK

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]2.0e-10242.86Show/hide
Query:  NAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANESFLKKSANEG--------------------TRTRI
        NAK+  +   F++  +E    AWERF+ L+RK PHHG+P CI +E FYNGL+ AS+ +++ASAN + L KS NE                     T  ++
Subjt:  NAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANESFLKKSANEG--------------------TRTRI

Query:  DGI----HIRPRTTQVETTS----KLLLGGQVGSSNSNTNQFGK----PSTQQQPH-----KNFQ-----------------------------QVQFGH
         G+     I   T Q+ + +     L  GGQ G+S+S     G+    P   QQP      +N Q                             ++Q GH
Subjt:  DGI----HIRPRTTQVETTS----KLLLGGQVGSSNSNTNQFGK----PSTQQQPH-----KNFQ-----------------------------QVQFGH

Query:  IAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSG---------LEYDGPKYPVNQEAEIPKKLEKRFLKRNPE-----------------YKPSPPYP
        +A E+K RPQG+LPS  E P R+G EQCK++ LRSG         ++  G    +  + ++ KK  +      P                   KP  P+P
Subjt:  IAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSG---------LEYDGPKYPVNQEAEIPKKLEKRFLKRNPE-----------------YKPSPPYP

Query:  QRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDL
        QRF+   QD QFKKFLDVLKQLHINIPLVEALEQMP YVKFLKDIL+KKRRLGE+E V LTE  SA++K++I PKLKDPGSFTIPCSIGGRDVGR LCDL
Subjt:  QRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDL

Query:  GASINLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQV
        GASINLMP S+FK                  +S+ HP GKIED+LV VDKF+FP DFIILD EAD+DV IILGR FLATGRTLIDVQ GELTMR+     
Subjt:  GASINLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQV

Query:  TFNVMNAMKYPGDSEECSMIDEIDELSRSTFQE--------MMKGETLEDILGDEEPEVKGGKKIE-VEEINQPMKRQRIK
                  P + EECS I  ID +    F +        +   + LED+  DE+ +V   + ++ +    +P +   +K
Subjt:  TFNVMNAMKYPGDSEECSMIDEIDELSRSTFQE--------MMKGETLEDILGDEEPEVKGGKKIE-VEEINQPMKRQRIK

TrEMBL top hitse value%identityAlignment
A0A2G9HWF8 Reverse transcriptase2.4e-8538.78Show/hide
Query:  SFIRILENIIVETTMEDLANKHLEGHG-EGDAEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHG
        +F++I + +  E   +D     L      GDA    +S   +SIT+W  L ++F+ KFF   K A  RA+I+ FRQ  +E +  AW RF++++R  P+H 
Subjt:  SFIRILENIIVETTMEDLANKHLEGHG-EGDAEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHG

Query:  LPACIILEHFYNGLDQASKALVNASANESFL-------------------KKSANEGTRTRIDGIHIRPRTTQVETTSKLLLGGQVGSSNSNTNQF----
        +P  I +  FY+GL +  K  ++    +SFL                   +K +   T  +  G+        +E      L  ++     +   F    
Subjt:  LPACIILEHFYNGLDQASKALVNASANESFL-------------------KKSANEGTRTRIDGIHIRPRTTQVETTSKLLLGGQVGSSNSNTNQF----

Query:  GKPSTQQQPHKNFQQVQFGHIAQEIKNRP----------------------QGTLP--------SKIEIPHREGNEQCKAVTLRSGLE-YDGPKYPV-NQ
        G PS  Q PH + + +QF   A++ +N P                      QG+ P        +    P ++G  QC+AVTLR+G +  +  K P  ++
Subjt:  GKPSTQQQPHKNFQQVQFGHIAQEIKNRP----------------------QGTLP--------SKIEIPHREGNEQCKAVTLRSGLE-YDGPKYPV-NQ

Query:  EAEIPKKLEKRFLKRNPEY-KPS---PPYPQRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKN
        E E+  + +++ ++   E  KP+   PP+PQR +      QF KFL+V K+LHINIP  EALEQMP+YVKF+KDILSKKRRLG+YE V LTE  SA+++N
Subjt:  EAEIPKKLEKRFLKRNPEY-KPS---PPYPQRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKN

Query:  EISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSI
        ++ PKLKDPG              R LCDLGASINLMP S+++                  +SLT+P G IEDILV VDKF+FP DF++LD E D +V I
Subjt:  EISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSI

Query:  ILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMIDEIDELS
        ILGR FLATGRTLIDVQKGELTMRV DQQ+TFNV  AMK+P +S+EC  +   D L+
Subjt:  ILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMIDEIDELS

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129456.8e-8838.17Show/hide
Query:  AEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANESFL
        A++ L+S    SIT+W DLA KFL KFFP  K AK R  I  F Q   E L  AWERF+ L+R+ PHHG+P  + ++ FYNGL  + K +++A+A  + +
Subjt:  AEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANESFL

Query:  KKSA--------------------NEGTRTRIDGIHI---RPRTTQVETTSKLL--LGGQ------------------------------VGSSN-----
         K+A                      G+R  +    I      TTQV   SK L  LG                                VG+ N     
Subjt:  KKSA--------------------NEGTRTRIDGIHI---RPRTTQVETTSKLL--LGGQ------------------------------VGSSN-----

Query:  ------------------------SNTNQFGKPSTQQQ-----PHKNFQ--------------------------QVQFGHIAQEIKNRPQGTLPSKIEI
                                SN      P  QQQ     P K  Q                          + Q G +A  I NRPQG+LPS  +I
Subjt:  ------------------------SNTNQFGKPSTQQQ-----PHKNFQ--------------------------QVQFGHIAQEIKNRPQGTLPSKIEI

Query:  PHREGNEQCKAVTLRSGLEYDGPKYPVNQEA------------------EIPKKLEKRFLKRNPE--YKPSPPYPQRFKHASQDVQFKKFLDVLKQLHIN
         + +G EQC+A+TLRSG E +G    VNQ+A                  EI +K + +   +       P PP+PQR +    + QF+KFL+V K+LHIN
Subjt:  PHREGNEQCKAVTLRSGLEYDGPKYPVNQEA------------------EIPKKLEKRFLKRNPE--YKPSPPYPQRFKHASQDVQFKKFLDVLKQLHIN

Query:  IPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFK------------
        IP  EALEQMP+YVKFLKDILSKKR+LGE+E V LTE  SA+++N++ PKLKDPGSFTIPC+IG     + L DLGASINLMP S+F+            
Subjt:  IPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFK------------

Query:  ------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMIDEIDE
              +S  +P G IED+LV VDKF+FPVDF+ILD E D+ + IILGR FLAT   +IDV++G+++ +V ++ V FN+ NA K+P  +  C  ++ IDE
Subjt:  ------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMIDEIDE

A0A6J1CPJ3 uncharacterized protein LOC1110129476.2e-9737.22Show/hide
Query:  GDAEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANES
        G A A L++F  ++I + +D+ DKFL K+FP  +NA  R +II FRQ  NE ++ AWERF+ L+R  P+ G+PAC+ +EHF+   D  +  ++N +AN  
Subjt:  GDAEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANES

Query:  FLKKSANE---------------------------------------GTRTRIDGI--------------HIRPRTTQ-------VETTSKL--------
        F  KS NE                                         + +ID I               + P TT         E+T ++        
Subjt:  FLKKSANE---------------------------------------GTRTRIDGI--------------HIRPRTTQ-------VETTSKL--------

Query:  --------------LLGGQVGSSNSNTNQFGK--------------PSTQQQ--PHKNFQQ---------------------------------------
                         GQ  SS +  NQ  K              P T QQ    KN+ Q                                       
Subjt:  --------------LLGGQVGSSNSNTNQFGK--------------PSTQQQ--PHKNFQQ---------------------------------------

Query:  -----------------VQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYPVNQEAEIPKKLEKRFLKRNPEYKPSPPYPQRFK
                         +Q G +A E++ RPQG+LPS  E P R        V+     E D    P   +  +  ++      +    +  PP+PQR  
Subjt:  -----------------VQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYPVNQEAEIPKKLEKRFLKRNPEYKPSPPYPQRFK

Query:  HASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASI
          +QD  F+KFLD+LKQLHINIP VEALEQMPTY KFLKDI+++K++LGEYE V LTECSS + K++  PKLKDPGSFTI C IGG+DVGR LCDLGA I
Subjt:  HASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASI

Query:  NLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNV
        NLMP S+FK                  +S+T P GKIED+LV VDKF+FP DFIILDCEADKDV IILGR FLATG TLIDV+KGELTMRV+DQ+VTFN+
Subjt:  NLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNV

Query:  MNAMKYPGDSEECSMID--------EIDELSRSTFQEMMKGETLEDILGDEEPEVKGGKKIEVEEINQP
        ++AMKYP D EEC +I         E+D+L  +  +  ++    E I+     + +  K I+  +I  P
Subjt:  MNAMKYPGDSEECSMID--------EIDELSRSTFQEMMKGETLEDILGDEEPEVKGGKKIEVEEINQP

A0A6J1DY39 uncharacterized protein LOC1110256532.5e-10636.26Show/hide
Query:  GDAEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANES
        G A A L++F  ++IT+W+D+ DKFL K+FP  +NA  R +II FRQ  NE ++ AWERF+ L+   P+ G+PAC+ +EHF+ G D  +K ++N +AN  
Subjt:  GDAEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANES

Query:  FLKKSANE----------------GTRTRIDGIHIRP-----------RTTQVETTSKLL----------------------------------------
        F  KS NE                  ++R       P              Q++T +++L                                        
Subjt:  FLKKSANE----------------GTRTRIDGIHIRP-----------RTTQVETTSKLL----------------------------------------

Query:  --------------------------------------LGGQVGSSNSNTNQFGK-----------PSTQQQPH-----KNFQQ----------------
                                                GQ  S+ +  NQ  K           P+    PH     KN+ Q                
Subjt:  --------------------------------------LGGQVGSSNSNTNQFGK-----------PSTQQQPH-----KNFQQ----------------

Query:  ----------------------------------------VQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYP---------V
                                                +Q G +  E++ RPQG+LPS  E P R G E C ++  RSGL+Y+GP+ P          
Subjt:  ----------------------------------------VQFGHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYP---------V

Query:  NQEAEIPKKLEKRFLK-----RNPEYKPSPPYPQRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSAL
             +P K+ +  +      +    +P PP+PQR    +QD  F+KFLD+LKQLHINIP VEALEQMPTY KF+KDI+++K++LGEYE V LTECSS +
Subjt:  NQEAEIPKKLEKRFLK-----RNPEYKPSPPYPQRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSAL

Query:  VKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKD
         K+++ PKLKDPGSFTIPC IGG+DVGR LCDLGASINLMP S+FK                  +S+T P GKIED+LV VDKF+FP DFIILDCEADKD
Subjt:  VKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFK------------------QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKD

Query:  VSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMID--------EIDELSRSTFQEMMKGETLEDILGDEEPEVKGGKKIEV
        V IILGR FLATG TLIDV+KGELTMRV+DQ+VTFN+++AMKY  D EEC++I         E+D+L  +  +  ++    E I+     + +  K I+ 
Subjt:  VSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMID--------EIDELSRSTFQEMMKGETLEDILGDEEPEVKGGKKIEV

Query:  EEINQP
         +I  P
Subjt:  EEINQP

A0A6J1DZC3 uncharacterized protein LOC1110244491.4e-8535.96Show/hide
Query:  LADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANESFLKKSAN-------------
        + DKFL K+FP  KNA  R +II FRQ  NE ++  WERF+ L+R  P+ G+PAC+ +EHFY   D  +K ++N +AN  F  K+ N             
Subjt:  LADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAAWERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANESFLKKSAN-------------

Query:  -----EGTRTR-------------------------------IDGIHIRPRTTQVETT----------------------------SKLLLGGQVGSSNS
             E +RT+                               + G ++ P  T   T                             + L   GQ  S+N+
Subjt:  -----EGTRTR-------------------------------IDGIHIRPRTTQVETT----------------------------SKLLLGGQVGSSNS

Query:  NTNQ--------------FGKPSTQQQPHKNFQQ----------------------------------------------------------VQFGHIAQ
           Q                 P  Q    KN  Q                                                           Q G +A 
Subjt:  NTNQ--------------FGKPSTQQQPHKNFQQ----------------------------------------------------------VQFGHIAQ

Query:  EIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYPVNQEAEIPKKLEKRFLKRNPEYKPSPPYPQRFKHASQDVQFKKFLDVLKQLHINIPLV
        E+KNRP+GTLPS  E P  EG E CK +T RSGL Y+ PK P  + +  P K      ++    +P  P                         I I   
Subjt:  EIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYPVNQEAEIPKKLEKRFLKRNPEYKPSPPYPQRFKHASQDVQFKKFLDVLKQLHINIPLV

Query:  EALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFK----------------
            +MPTY KFLKDI+++K++LGEYE V LTECSS + K+++SPKLKDPGSFTIPCSIGG+DVGR LCDL ASINLMP S+FK                
Subjt:  EALEQMPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFK----------------

Query:  --QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMID--------
          +S+T P GKIED+LV VDKF+FP DFIIL+CEADKDV IILGR FL+TG TLIDV+KGELTM V+DQ+VTFN+++AMKYP D EEC+ I         
Subjt:  --QSLTHPIGKIEDILVNVDKFVFPVDFIILDCEADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMID--------

Query:  EIDELSRSTFQEMMKGETLEDIL
        E+D+L  +  +  ++    E I+
Subjt:  EIDELSRSTFQEMMKGETLEDIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTAAGGTTCCAGAAGATCCACTTGATCCCCAGAATCGTGTGTTGTAGCCAAATCTGCTACTGGAGCAAAATGGACAGGATTGCACGCCCTCA
AATTGAGGCAGCAAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTACAAACCGTGGGACAGTTCCATGGACCAATTAGGACATGGAATGAGTTAGCAGAAAAATTTC
GTAGTAAATATTTTCCACCGACTAGGAATGTCAAATTGAGGAGTGAGATAGTAGGGTTTAGGCAACTTGAAGATGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAG
CTTTCGCGAAAGTGTCCCCACTATGGTTTACCTCATTGTATCCAAATGGAAGCATTTTACAATGGGTTAAATGGAGCAATCCAAGTGATTAGTCATCAGCAACCAGTTGT
GGAGCCTGCTGCACTGGTGAACCAAGTTGTAGAGAAATCATGTGTTTATTGTGGTGAAGAGCACAACTACGAGTTTTCCCCAGCAATCCAGCTTCTGTTGGGCCAGCTAG
CTAATGAGCTGAAGGTAAGGCCCAAGGGAAACTTCTTAGATACTGAGCACCCTAGAAGGGAAGGTAAGGACCAGTTGGAGTCTGGTAAAGGTGCTGGAGGCAGCAATAAT
GATGTTGGAGCATCTAGTTCTGTTCAAATGTGGAACCACCTTATGTGTCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGACAAAGGCCTAAGAATCAGGAT
GGTCAATTTAAGAAGTGGAAAAGAGTTAGGTAGAGCACATTGTGATTTAGGTGCAAGCATTAACATTATGCCTCTTTCGGTCTATCGAAAGCTAGGTATTGGTGAAGCTA
GGCCTACCACAGTCACACTCCAACTTGCTGATAGGTCTATCACATATCCTGAGGGTAAAATTGATGATGTCTTAGTAAAGACGTTGATAGATGTTCAAAAAGGGGAATTA
ACAATGAGGGTTTATAATGAGGAAGTGAAGTTTAATGTTTTGAAGGCCATGAAGTATCCAGACGAAGTGGAAGATTGTTCTTTCATTAGGATTCTGGAGAACATAATTGT
TGAGACAACAATGGAGGATTTGGCGAATAAACATTTGGAAGGTCATGGAGAGGGAGATGCTGAAGCTAGGTTGGATTCATTCTCTCCAAACTCCATCACTTCTTGGAATG
ATTTGGCAGATAAATTCTTAGAGAAGTTTTTTCCTTCTAATAAAAATGCCAAATATAGAGCTAAAATTATTGATTTCAGACAATCTTACAATGAACCTCTAGATGCAGCT
TGGGAAAGATTTCAAAGGTTGGTTCGGAAGTTTCCACACCACGGATTGCCTGCTTGCATCATCTTAGAGCATTTTTATAATGGATTAGATCAAGCTTCGAAGGCACTAGT
CAATGCATCTGCAAACGAATCTTTCTTGAAGAAGTCTGCAAATGAGGGAACCAGAACCAGAATAGATGGAATCCATATTCGGCCACGTACAACCCAGGTGGAGACAACAT
CCAAACTTCTATTGGGAGGACAAGTCGGGTCAAGTAATTCCAACACAAATCAATTTGGGAAACCTTCTACACAACAACAGCCGCATAAGAACTTCCAACAAGTCCAGTTT
GGACATATTGCTCAGGAGATTAAGAATAGACCGCAAGGGACATTGCCTAGCAAGATCGAGATCCCTCATAGAGAAGGAAATGAGCAATGCAAGGCAGTGACCTTAAGAAG
TGGATTAGAATACGATGGCCCAAAGTATCCAGTAAATCAAGAAGCAGAAATCCCAAAGAAGCTCGAGAAAAGATTTTTAAAAAGAAATCCAGAATACAAACCATCTCCTC
CATACCCTCAGAGATTCAAACACGCTTCGCAAGATGTACAGTTCAAGAAGTTCTTAGATGTATTAAAGCAGCTACACATCAATATCCCATTGGTAGAAGCACTTGAACAA
ATGCCTACCTATGTGAAGTTCCTGAAAGATATTCTATCAAAGAAGAGAAGGTTGGGAGAATACGAAATTGTTGTACTCACTGAATGTTCCAGTGCACTGGTCAAGAATGA
GATCTCTCCCAAGCTCAAAGACCCAGGAAGTTTCACCATTCCATGCTCCATTGGAGGCAGGGATGTAGGCAGAGTTTTGTGCGACTTAGGAGCGAGCATCAACTTAATGC
CATGCTCAGTTTTTAAACAATCACTTACACATCCTATCGGAAAGATTGAAGACATATTGGTCAATGTTGATAAGTTTGTCTTCCCTGTAGATTTCATCATTTTAGATTGT
GAAGCTGACAAGGACGTGTCAATCATCTTAGGACGACTGTTCCTAGCCACTGGCAGAACCTTAATAGATGTGCAGAAAGGAGAATTGACCATGAGGGTCAACGATCAACA
AGTCACGTTCAATGTGATGAATGCAATGAAGTACCCTGGAGATTCCGAGGAATGTTCTATGATTGATGAAATTGATGAACTCTCCCGGTCTACTTTTCAGGAAATGATGA
AGGGAGAAACATTGGAGGATATACTTGGAGATGAAGAACCAGAAGTAAAAGGTGGGAAGAAGATTGAAGTTGAAGAGATCAATCAACCCATGAAAAGACAAAGGATTAAA
CCATATTGGGGAAGGGCTTCGAGGATGAGGAAGCCCTTGTTTCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTAAGGTTCCAGAAGATCCACTTGATCCCCAGAATCGTGTGTTGTAGCCAAATCTGCTACTGGAGCAAAATGGACAGGATTGCACGCCCTCA
AATTGAGGCAGCAAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTACAAACCGTGGGACAGTTCCATGGACCAATTAGGACATGGAATGAGTTAGCAGAAAAATTTC
GTAGTAAATATTTTCCACCGACTAGGAATGTCAAATTGAGGAGTGAGATAGTAGGGTTTAGGCAACTTGAAGATGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAG
CTTTCGCGAAAGTGTCCCCACTATGGTTTACCTCATTGTATCCAAATGGAAGCATTTTACAATGGGTTAAATGGAGCAATCCAAGTGATTAGTCATCAGCAACCAGTTGT
GGAGCCTGCTGCACTGGTGAACCAAGTTGTAGAGAAATCATGTGTTTATTGTGGTGAAGAGCACAACTACGAGTTTTCCCCAGCAATCCAGCTTCTGTTGGGCCAGCTAG
CTAATGAGCTGAAGGTAAGGCCCAAGGGAAACTTCTTAGATACTGAGCACCCTAGAAGGGAAGGTAAGGACCAGTTGGAGTCTGGTAAAGGTGCTGGAGGCAGCAATAAT
GATGTTGGAGCATCTAGTTCTGTTCAAATGTGGAACCACCTTATGTGTCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGACAAAGGCCTAAGAATCAGGAT
GGTCAATTTAAGAAGTGGAAAAGAGTTAGGTAGAGCACATTGTGATTTAGGTGCAAGCATTAACATTATGCCTCTTTCGGTCTATCGAAAGCTAGGTATTGGTGAAGCTA
GGCCTACCACAGTCACACTCCAACTTGCTGATAGGTCTATCACATATCCTGAGGGTAAAATTGATGATGTCTTAGTAAAGACGTTGATAGATGTTCAAAAAGGGGAATTA
ACAATGAGGGTTTATAATGAGGAAGTGAAGTTTAATGTTTTGAAGGCCATGAAGTATCCAGACGAAGTGGAAGATTGTTCTTTCATTAGGATTCTGGAGAACATAATTGT
TGAGACAACAATGGAGGATTTGGCGAATAAACATTTGGAAGGTCATGGAGAGGGAGATGCTGAAGCTAGGTTGGATTCATTCTCTCCAAACTCCATCACTTCTTGGAATG
ATTTGGCAGATAAATTCTTAGAGAAGTTTTTTCCTTCTAATAAAAATGCCAAATATAGAGCTAAAATTATTGATTTCAGACAATCTTACAATGAACCTCTAGATGCAGCT
TGGGAAAGATTTCAAAGGTTGGTTCGGAAGTTTCCACACCACGGATTGCCTGCTTGCATCATCTTAGAGCATTTTTATAATGGATTAGATCAAGCTTCGAAGGCACTAGT
CAATGCATCTGCAAACGAATCTTTCTTGAAGAAGTCTGCAAATGAGGGAACCAGAACCAGAATAGATGGAATCCATATTCGGCCACGTACAACCCAGGTGGAGACAACAT
CCAAACTTCTATTGGGAGGACAAGTCGGGTCAAGTAATTCCAACACAAATCAATTTGGGAAACCTTCTACACAACAACAGCCGCATAAGAACTTCCAACAAGTCCAGTTT
GGACATATTGCTCAGGAGATTAAGAATAGACCGCAAGGGACATTGCCTAGCAAGATCGAGATCCCTCATAGAGAAGGAAATGAGCAATGCAAGGCAGTGACCTTAAGAAG
TGGATTAGAATACGATGGCCCAAAGTATCCAGTAAATCAAGAAGCAGAAATCCCAAAGAAGCTCGAGAAAAGATTTTTAAAAAGAAATCCAGAATACAAACCATCTCCTC
CATACCCTCAGAGATTCAAACACGCTTCGCAAGATGTACAGTTCAAGAAGTTCTTAGATGTATTAAAGCAGCTACACATCAATATCCCATTGGTAGAAGCACTTGAACAA
ATGCCTACCTATGTGAAGTTCCTGAAAGATATTCTATCAAAGAAGAGAAGGTTGGGAGAATACGAAATTGTTGTACTCACTGAATGTTCCAGTGCACTGGTCAAGAATGA
GATCTCTCCCAAGCTCAAAGACCCAGGAAGTTTCACCATTCCATGCTCCATTGGAGGCAGGGATGTAGGCAGAGTTTTGTGCGACTTAGGAGCGAGCATCAACTTAATGC
CATGCTCAGTTTTTAAACAATCACTTACACATCCTATCGGAAAGATTGAAGACATATTGGTCAATGTTGATAAGTTTGTCTTCCCTGTAGATTTCATCATTTTAGATTGT
GAAGCTGACAAGGACGTGTCAATCATCTTAGGACGACTGTTCCTAGCCACTGGCAGAACCTTAATAGATGTGCAGAAAGGAGAATTGACCATGAGGGTCAACGATCAACA
AGTCACGTTCAATGTGATGAATGCAATGAAGTACCCTGGAGATTCCGAGGAATGTTCTATGATTGATGAAATTGATGAACTCTCCCGGTCTACTTTTCAGGAAATGATGA
AGGGAGAAACATTGGAGGATATACTTGGAGATGAAGAACCAGAAGTAAAAGGTGGGAAGAAGATTGAAGTTGAAGAGATCAATCAACCCATGAAAAGACAAAGGATTAAA
CCATATTGGGGAAGGGCTTCGAGGATGAGGAAGCCCTTGTTTCCGTGA
Protein sequenceShow/hide protein sequence
MSDPPGVRFQKIHLIPRIVCCSQICYWSKMDRIARPQIEAANFEMKPVMFQMLQTVGQFHGPIRTWNELAEKFRSKYFPPTRNVKLRSEIVGFRQLEDETFSEAWERFKE
LSRKCPHYGLPHCIQMEAFYNGLNGAIQVISHQQPVVEPAALVNQVVEKSCVYCGEEHNYEFSPAIQLLLGQLANELKVRPKGNFLDTEHPRREGKDQLESGKGAGGSNN
DVGASSSVQMWNHLMCRPHLMYHLYLFHKDKGLRIRMVNLRSGKELGRAHCDLGASINIMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKTLIDVQKGEL
TMRVYNEEVKFNVLKAMKYPDEVEDCSFIRILENIIVETTMEDLANKHLEGHGEGDAEARLDSFSPNSITSWNDLADKFLEKFFPSNKNAKYRAKIIDFRQSYNEPLDAA
WERFQRLVRKFPHHGLPACIILEHFYNGLDQASKALVNASANESFLKKSANEGTRTRIDGIHIRPRTTQVETTSKLLLGGQVGSSNSNTNQFGKPSTQQQPHKNFQQVQF
GHIAQEIKNRPQGTLPSKIEIPHREGNEQCKAVTLRSGLEYDGPKYPVNQEAEIPKKLEKRFLKRNPEYKPSPPYPQRFKHASQDVQFKKFLDVLKQLHINIPLVEALEQ
MPTYVKFLKDILSKKRRLGEYEIVVLTECSSALVKNEISPKLKDPGSFTIPCSIGGRDVGRVLCDLGASINLMPCSVFKQSLTHPIGKIEDILVNVDKFVFPVDFIILDC
EADKDVSIILGRLFLATGRTLIDVQKGELTMRVNDQQVTFNVMNAMKYPGDSEECSMIDEIDELSRSTFQEMMKGETLEDILGDEEPEVKGGKKIEVEEINQPMKRQRIK
PYWGRASRMRKPLFP