; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G10560 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G10560
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionCOBW domain-containing protein
Genome locationClcChr08:21873424..21882022
RNA-Seq ExpressionClc08G10560
SyntenyClc08G10560
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0000166 - nucleotide binding (molecular function)
InterPro domainsIPR003495 - CobW/HypB/UreG, nucleotide-binding domain
IPR011629 - Cobalamin (vitamin B12) biosynthesis CobW-like, C-terminal
IPR013641 - Protein KTI12/L-seryl-tRNA(Sec) kinase
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR036627 - CobW-like, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN64163.1 hypothetical protein VITISV_040646 [Vitis vinifera]2.2e-20762.27Show/hide
Query:  EDEEDPPLAVEISSTITAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICCTV
        E++EDPPLAVEI+   +   + ++D VSVGVTVITGYLGAGKST VN+ILNSQHGKRIAVILNEFGEEIGVERAMINEGD GALVEEWVELANGCICCTV
Subjt:  EDEEDPPLAVEISSTITAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICCTV

Query:  KHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIITDTIILNKVDLVSSELGD
        KH+LVQALEQLVQ KE                            RLDHILLETTGLANPAPLASVLWLDDQLESS++LDSIITD +ILNKVDLVS E   
Subjt:  KHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIITDTIILNKVDLVSSELGD

Query:  GALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGVRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVY
        G LE+LE EIHNINSLA I+HSVRCQVDLS IL+C +Y+A +  HLEALL+E++SLS++DLHD+ VRTLCIS+   VDLDKV  WLEEILW+KK  MDVY
Subjt:  GALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGVRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVY

Query:  RCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTN------------KIVFI------GRNLNEDVLSETFRACSSVPPLSLSSLVS--------
        RCKGVL + +SDQLHTLQAVRE+YE+VP+ +  + +   N             +V I      G++     L    +   S   + +    S        
Subjt:  RCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTN------------KIVFI------GRNLNEDVLSETFRACSSVPPLSLSSLVS--------

Query:  ----PPPPFFRSCRRSIPPNGSALIFHSIVVTDSLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDS
                  R   RS      ++   SI++ DSLNSIK                        EE+HCR WNE+R E GEA+YDD IFEDLVRRFE+PD 
Subjt:  ----PPPPFFRSCRRSIPPNGSALIFHSIVVTDSLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDS

Query:  RNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPT
        RNRWDSPLFEL+P RDGIEKSS  ILDAVSYLTK  DSK+RD+KILQPTIAT ++RFSEANSLYE+D+ATQE++NAIVEA  QA+GGP+N I LGQ L  
Subjt:  RNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPT

Query:  IYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNREL
        I IS+SVGLPELRRLRRTFIKLTGQTSLSGPPPPSD++SAKRMF+DYLNREL
Subjt:  IYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNREL

EOY01736.1 Cobalamin biosynthesis CobW-like protein [Theobroma cacao]3.8e-23164.33Show/hide
Query:  DEEDPPLAVEISSTITA-----AERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCI
        DEE+PPLA++I   + A      E+ +DD VSVGVTVITGYLGAGKSTLVNYILN+QHGKRIAVILNEFGEEIGVERAMINEG+ GALVEEWVELANGC+
Subjt:  DEEDPPLAVEISSTITA-----AERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCI

Query:  CCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT-------------
        CCTVKH+LVQALEQLVQ K+                            RLDHILLETTGLANPAPLASVLWLDDQLESS+KLDSI+T             
Subjt:  CCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT-------------

Query:  -----------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGV
                         D +ILNKVDLVS E  +G +E+LE EIH+INSLA ++HSVRCQVDLS ILN  +Y+A +  HLEALL+ES+S+ +RDLHD+GV
Subjt:  -----------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGV

Query:  RTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRA------
        RTLC +  + VDL KV  W+EEILW+KK G+DVYRCKGVLSI+NSD LHTLQAVRE+YE+VP+RQW   + Q N+IVFIG NL+E++L+ +FR       
Subjt:  RTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRA------

Query:  ----CSSVPPLSLSSLVSPPPPFFRSCRRSIPPNGSA------LIFHSIVVTD----------SLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRK
            C ++  L+L   V   P    S   S+ P+ +A      LIF    ++D              IKGYRYELWCLAR  GIRYCVLYCDVEE  CRK
Subjt:  ----CSSVPPLSLSSLVSPPPPFFRSCRRSIPPNGSA------LIFHSIVVTD----------SLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRK

Query:  WNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKAT
        WNEERREKGEA Y+D IFEDL RRFE+PD RNRWDSPL+EL+PH+DG+EKSS  I D VSYLTK VDSKSRDVKILQPTIATQN RFSEANSLYELD+AT
Subjt:  WNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKAT

Query:  QEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNRELET
        QEV+NAIVEA  QA GGPL  I + Q LP+I ISRSVGLPELRRLRRTFIKLTGQTSLSG PPPSDA+SAKRMF+DYLNREL T
Subjt:  QEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNRELET

OMO71963.1 hypothetical protein COLO4_27903 [Corchorus olitorius]3.3e-22761.03Show/hide
Query:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC
        ++EE+PPLA++I  T+     +++E+   D VSVGVTVITGYLGAGKSTLVNYILN+QHGKRIAVILNEFGEEIGVERAMINEG+ GALVEEWVELANGC
Subjt:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC

Query:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------
        +CCTVKH+LVQALEQLVQ K+                            RLDHILLETTGLANPAPLASVLWLDDQLESS+KLDSI+T            
Subjt:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------

Query:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG
                          D ++LNKVDLVSSE  +GA+EDLE EI +INSLA I+ SVRCQVDLSL+LN  +Y+A + +HLEALL+ES+S+ ++DLHD+G
Subjt:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG

Query:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFI--------------------
        VRTLCI+  + VDL+KV  W+EEILWEKK  +DVYRCKGVLSI+NSD LHTLQAVRE+YE+VP+RQW   E Q N+IVFI                    
Subjt:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFI--------------------

Query:  ------------------------------------GRNLNEDVLSETFRACSSVP---PLSLSSLVSPPPPFFRSCRRSIPPNGSALIFHSIVVTDSLN
                                               LNE    +T R           + S    P     R   RS      ++   +I++ DSLN
Subjt:  ------------------------------------GRNLNEDVLSETFRACSSVP---PLSLSSLVSPPPPFFRSCRRSIPPNGSALIFHSIVVTDSLN

Query:  SIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTV
        SIKGYRYELWCLAR  G+RYCVLYCDVEE  CRKWNEERREKGEA Y+D IFEDL RRFE PD RNRWDSPL+EL+PH+DG+EKSS  I DAVSYLTK V
Subjt:  SIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTV

Query:  DSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSD
        DSKSRDVKILQPTIATQN+RFSEANSLYE+D+ATQEV+NAIVEA  QA+GGPL  I +GQ LPTI ISRSVGLPELRRLRRTFIKLTGQTSLSG PPPSD
Subjt:  DSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSD

Query:  AESAKRMFIDYLNREL
        AESAKRMF+DYLNREL
Subjt:  AESAKRMFIDYLNREL

OMO93409.1 hypothetical protein CCACVL1_06500 [Corchorus capsularis]1.3e-22861.73Show/hide
Query:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC
        ++EE+PPLA++I  T+     +++E+   D VSVGVTVITGYLGAGKSTLVNYILN+QHGKRIAVILNEFGEEIGVERAMINEG+ GALVEEWVELANGC
Subjt:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC

Query:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------
        +CCTVKH+LVQALEQLVQ K+                            RLDHILLETTGLANPAPLASVLWLDDQLESS+KLDSI+T            
Subjt:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------

Query:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG
                          D +ILNKVDLVSSE  +GA+EDLE EI +INSLA IV SVRCQVDLSLILN  +Y+A + +HLEALL+ES+S+ ++DLHD+G
Subjt:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG

Query:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFI--------------------
        VRTLCI+  + VDL+KV  W+EEILWEKK GMDVYRCKGVLSI+NSD LHTLQAVRE+YE+VP+RQW   E Q N+IVFI                    
Subjt:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFI--------------------

Query:  ------------------------------------GRNLNEDVLSETFRACSSVP---PLSLSSLVSPPPPFFRSCRRSIPPNGSALIFHSIVVTDSLN
                                               LNE    +T R           + S    P     R   RS      ++   +I++ DSLN
Subjt:  ------------------------------------GRNLNEDVLSETFRACSSVP---PLSLSSLVSPPPPFFRSCRRSIPPNGSALIFHSIVVTDSLN

Query:  SIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTV
        SIKGYRYELWCLAR  G+RYCVLYCDVEE  CRKWNEERREKGEA Y+D IFEDL RRFE PD RNRWDSPL+EL+PH+DG+EKSS  I DAVSYLTK V
Subjt:  SIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTV

Query:  DSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSD
        DSKSRDVKILQPTIATQN+RFSEANSLYE+D+ATQEV+NAIVEA  QA+GGPL  I +GQ LPTI ISRSVGLPELRRLRRTFIKLTGQTSLSG PPPSD
Subjt:  DSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSD

Query:  AESAKRMFIDYLNREL
        AES KRMFIDYLNREL
Subjt:  AESAKRMFIDYLNREL

XP_021299734.1 LOW QUALITY PROTEIN: COBW domain-containing protein 1 [Herrania umbratica]3.4e-23263.68Show/hide
Query:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC
        ++EE+PPLA++I   +     +  E+ +DD VSVGVTVITGYLGAGKSTLVNYILN+QHGKRIAVILNEFGEEIGVERAMINEG+ GALVEEWVELANGC
Subjt:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC

Query:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------
        +CCTVKH+LVQALEQLVQ K+                            RLDHILLETTGLANPAPLASVLWLDDQLESS+KLDSI+T            
Subjt:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------

Query:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG
                          D +ILNKVDLVS E  +GA+E+LE EIH+INSLA ++HSVRCQVDLSL+LN  +Y+A +  HLEALL+ES+S+ +RDLHD+G
Subjt:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG

Query:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRA-----
        VRTLCI+  + +DL KV  W+EEILW+KK GMDVYRCKGVL I+NSD LHTLQAVRE+YE+VP+RQW   + Q N+IVFIG NL+E++L+++F       
Subjt:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRA-----

Query:  -----CSSVPPLSL------SSLVSPPPPFFRSCRRSIPPNGSA------LIFHSIVVTD----------SLNSIKGYRYELWCLARGTGIRYCVLYCDV
             C ++  L+L      S  +SP     +S   S+ P+ +A      LIF    ++D              IKGYRYELWCLAR  GIRYCVLYCD 
Subjt:  -----CSSVPPLSL------SSLVSPPPPFFRSCRRSIPPNGSA------LIFHSIVVTD----------SLNSIKGYRYELWCLARGTGIRYCVLYCDV

Query:  EETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSL
        EE  CRKWNEERREKGEA Y+D IFEDLVRRFE+PD RNRWDSPL+EL+PH DG+EKSS  I DAVSYLTK VDSKSRDVKILQPTIATQN RFSEANSL
Subjt:  EETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSL

Query:  YELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNRELET
        YELD+ATQEV+NAIVEA  QA GGPL  I +GQ LP+I +SRSVGLPELRRLRRTFIKLTGQTSLSG PPPSDA+SAKRMF+DYLNREL T
Subjt:  YELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNRELET

TrEMBL top hitse value%identityAlignment
A0A061E9Y0 Cobalamin biosynthesis CobW-like protein1.8e-23164.33Show/hide
Query:  DEEDPPLAVEISSTITA-----AERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCI
        DEE+PPLA++I   + A      E+ +DD VSVGVTVITGYLGAGKSTLVNYILN+QHGKRIAVILNEFGEEIGVERAMINEG+ GALVEEWVELANGC+
Subjt:  DEEDPPLAVEISSTITA-----AERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCI

Query:  CCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT-------------
        CCTVKH+LVQALEQLVQ K+                            RLDHILLETTGLANPAPLASVLWLDDQLESS+KLDSI+T             
Subjt:  CCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT-------------

Query:  -----------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGV
                         D +ILNKVDLVS E  +G +E+LE EIH+INSLA ++HSVRCQVDLS ILN  +Y+A +  HLEALL+ES+S+ +RDLHD+GV
Subjt:  -----------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGV

Query:  RTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRA------
        RTLC +  + VDL KV  W+EEILW+KK G+DVYRCKGVLSI+NSD LHTLQAVRE+YE+VP+RQW   + Q N+IVFIG NL+E++L+ +FR       
Subjt:  RTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRA------

Query:  ----CSSVPPLSLSSLVSPPPPFFRSCRRSIPPNGSA------LIFHSIVVTD----------SLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRK
            C ++  L+L   V   P    S   S+ P+ +A      LIF    ++D              IKGYRYELWCLAR  GIRYCVLYCDVEE  CRK
Subjt:  ----CSSVPPLSLSSLVSPPPPFFRSCRRSIPPNGSA------LIFHSIVVTD----------SLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRK

Query:  WNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKAT
        WNEERREKGEA Y+D IFEDL RRFE+PD RNRWDSPL+EL+PH+DG+EKSS  I D VSYLTK VDSKSRDVKILQPTIATQN RFSEANSLYELD+AT
Subjt:  WNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKAT

Query:  QEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNRELET
        QEV+NAIVEA  QA GGPL  I + Q LP+I ISRSVGLPELRRLRRTFIKLTGQTSLSG PPPSDA+SAKRMF+DYLNREL T
Subjt:  QEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNRELET

A0A1R3HNR1 Uncharacterized protein1.6e-22761.03Show/hide
Query:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC
        ++EE+PPLA++I  T+     +++E+   D VSVGVTVITGYLGAGKSTLVNYILN+QHGKRIAVILNEFGEEIGVERAMINEG+ GALVEEWVELANGC
Subjt:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC

Query:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------
        +CCTVKH+LVQALEQLVQ K+                            RLDHILLETTGLANPAPLASVLWLDDQLESS+KLDSI+T            
Subjt:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------

Query:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG
                          D ++LNKVDLVSSE  +GA+EDLE EI +INSLA I+ SVRCQVDLSL+LN  +Y+A + +HLEALL+ES+S+ ++DLHD+G
Subjt:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG

Query:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFI--------------------
        VRTLCI+  + VDL+KV  W+EEILWEKK  +DVYRCKGVLSI+NSD LHTLQAVRE+YE+VP+RQW   E Q N+IVFI                    
Subjt:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFI--------------------

Query:  ------------------------------------GRNLNEDVLSETFRACSSVP---PLSLSSLVSPPPPFFRSCRRSIPPNGSALIFHSIVVTDSLN
                                               LNE    +T R           + S    P     R   RS      ++   +I++ DSLN
Subjt:  ------------------------------------GRNLNEDVLSETFRACSSVP---PLSLSSLVSPPPPFFRSCRRSIPPNGSALIFHSIVVTDSLN

Query:  SIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTV
        SIKGYRYELWCLAR  G+RYCVLYCDVEE  CRKWNEERREKGEA Y+D IFEDL RRFE PD RNRWDSPL+EL+PH+DG+EKSS  I DAVSYLTK V
Subjt:  SIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTV

Query:  DSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSD
        DSKSRDVKILQPTIATQN+RFSEANSLYE+D+ATQEV+NAIVEA  QA+GGPL  I +GQ LPTI ISRSVGLPELRRLRRTFIKLTGQTSLSG PPPSD
Subjt:  DSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSD

Query:  AESAKRMFIDYLNREL
        AESAKRMF+DYLNREL
Subjt:  AESAKRMFIDYLNREL

A0A1R3JEZ9 Uncharacterized protein6.5e-22961.73Show/hide
Query:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC
        ++EE+PPLA++I  T+     +++E+   D VSVGVTVITGYLGAGKSTLVNYILN+QHGKRIAVILNEFGEEIGVERAMINEG+ GALVEEWVELANGC
Subjt:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC

Query:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------
        +CCTVKH+LVQALEQLVQ K+                            RLDHILLETTGLANPAPLASVLWLDDQLESS+KLDSI+T            
Subjt:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------

Query:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG
                          D +ILNKVDLVSSE  +GA+EDLE EI +INSLA IV SVRCQVDLSLILN  +Y+A + +HLEALL+ES+S+ ++DLHD+G
Subjt:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG

Query:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFI--------------------
        VRTLCI+  + VDL+KV  W+EEILWEKK GMDVYRCKGVLSI+NSD LHTLQAVRE+YE+VP+RQW   E Q N+IVFI                    
Subjt:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFI--------------------

Query:  ------------------------------------GRNLNEDVLSETFRACSSVP---PLSLSSLVSPPPPFFRSCRRSIPPNGSALIFHSIVVTDSLN
                                               LNE    +T R           + S    P     R   RS      ++   +I++ DSLN
Subjt:  ------------------------------------GRNLNEDVLSETFRACSSVP---PLSLSSLVSPPPPFFRSCRRSIPPNGSALIFHSIVVTDSLN

Query:  SIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTV
        SIKGYRYELWCLAR  G+RYCVLYCDVEE  CRKWNEERREKGEA Y+D IFEDL RRFE PD RNRWDSPL+EL+PH+DG+EKSS  I DAVSYLTK V
Subjt:  SIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTV

Query:  DSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSD
        DSKSRDVKILQPTIATQN+RFSEANSLYE+D+ATQEV+NAIVEA  QA+GGPL  I +GQ LPTI ISRSVGLPELRRLRRTFIKLTGQTSLSG PPPSD
Subjt:  DSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSD

Query:  AESAKRMFIDYLNREL
        AES KRMFIDYLNREL
Subjt:  AESAKRMFIDYLNREL

A0A6J1BN98 LOW QUALITY PROTEIN: COBW domain-containing protein 11.7e-23263.68Show/hide
Query:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC
        ++EE+PPLA++I   +     +  E+ +DD VSVGVTVITGYLGAGKSTLVNYILN+QHGKRIAVILNEFGEEIGVERAMINEG+ GALVEEWVELANGC
Subjt:  EDEEDPPLAVEISSTI-----TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGC

Query:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------
        +CCTVKH+LVQALEQLVQ K+                            RLDHILLETTGLANPAPLASVLWLDDQLESS+KLDSI+T            
Subjt:  ICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT------------

Query:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG
                          D +ILNKVDLVS E  +GA+E+LE EIH+INSLA ++HSVRCQVDLSL+LN  +Y+A +  HLEALL+ES+S+ +RDLHD+G
Subjt:  ------------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTG

Query:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRA-----
        VRTLCI+  + +DL KV  W+EEILW+KK GMDVYRCKGVL I+NSD LHTLQAVRE+YE+VP+RQW   + Q N+IVFIG NL+E++L+++F       
Subjt:  VRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRA-----

Query:  -----CSSVPPLSL------SSLVSPPPPFFRSCRRSIPPNGSA------LIFHSIVVTD----------SLNSIKGYRYELWCLARGTGIRYCVLYCDV
             C ++  L+L      S  +SP     +S   S+ P+ +A      LIF    ++D              IKGYRYELWCLAR  GIRYCVLYCD 
Subjt:  -----CSSVPPLSL------SSLVSPPPPFFRSCRRSIPPNGSA------LIFHSIVVTD----------SLNSIKGYRYELWCLARGTGIRYCVLYCDV

Query:  EETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSL
        EE  CRKWNEERREKGEA Y+D IFEDLVRRFE+PD RNRWDSPL+EL+PH DG+EKSS  I DAVSYLTK VDSKSRDVKILQPTIATQN RFSEANSL
Subjt:  EETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSL

Query:  YELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNRELET
        YELD+ATQEV+NAIVEA  QA GGPL  I +GQ LP+I +SRSVGLPELRRLRRTFIKLTGQTSLSG PPPSDA+SAKRMF+DYLNREL T
Subjt:  YELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNRELET

F6HF74 CobW C-terminal domain-containing protein3.2e-22856.55Show/hide
Query:  EDEEDPPLAVEISSTITAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICCTV
        E++EDPPLAVEI+   +   + ++D VSVGVTVITGYLGAGKST VN+ILNSQHGKRIAVILNEFGEEIGVERAMINEGD GALVEEWVELANGCICCTV
Subjt:  EDEEDPPLAVEISSTITAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICCTV

Query:  KHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT-----------------
        KH+LVQALEQLVQ KE                            RLDHILLETTGLANPAPLASVLWLDDQLESS++LDSIIT                 
Subjt:  KHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT-----------------

Query:  -------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGVRTLC
                     D +ILNKVDLVS E   G LE+LE EIHNINSLA I+HSVRCQVDLS IL+C +Y+A +  HLEALL+E++SLS++DLHD+ VRTLC
Subjt:  -------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGVRTLC

Query:  ISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRACSSV------
        IS+   VDLDKV  WLEEILW+KK  MDVYRCKGVL + +SDQLHTLQAVRE+YE+VP+R+W N E+Q NKIVFIG NLNED L+ +FRAC S+      
Subjt:  ISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRACSSV------

Query:  --------------------------------PPLSLSSLVS--PPPPFFRS------------------------------------------------
                                        P ++  S++S  P P +  S                                                
Subjt:  --------------------------------PPLSLSSLVS--PPPPFFRS------------------------------------------------

Query:  -CRRSIPPNGSALI----------------------FH-------------------------------SIVVTDSLNSIKGYRYELWCLARGTGIRYCV
         C +      +A +                      FH                               SI++ DSLNSIKGYRYELWCLAR  GIRYCV
Subjt:  -CRRSIPPNGSALI----------------------FH-------------------------------SIVVTDSLNSIKGYRYELWCLARGTGIRYCV

Query:  LYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFS
        L+CDVEE+HCR WNE+R E GEA+YDD IFEDLVRRFE+PD RNRWDSPLFEL+P RDGIEKSS  I DAVSYLTK  DSK+RD+KILQPTIAT+++RFS
Subjt:  LYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKILQPTIATQNSRFS

Query:  EANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNREL
        EANSLYE+D+ATQE++NAIVEA  QA+GGP+N I LGQ L  I IS+SVGLPELRRLRRTFIKLTGQTSLSGPPPPSD++SAKRMF+DYLNREL
Subjt:  EANSLYELDKATQEVVNAIVEA--QALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNREL

SwissProt top hitse value%identityAlignment
B8BK80 Protein KTI12 homolog1.8e-9574.45Show/hide
Query:  SIVVTDSLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILD
        SI+V DSLN+IKGYRYELWCLAR +GIRYCVL+CD E  HCR+WN +R+EKGE  YD+NIF+DLV RFE+PD RNRWDSPLFELFP RDG+ +SS VI +
Subjt:  SIVVTDSLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILD

Query:  AVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEAQA--LGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTS
        AVSYLTK VDSK+RDVK+LQPTIATQ +R +EANSLYE+DKATQEV+NAIVEAQ+  LG P+N I LG +LPTI + RSVGLPELR LRRTFIKL GQ S
Subjt:  AVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEAQA--LGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTS

Query:  LSGPPPPSDAESAKRMFIDYLNRELET
        LSGPPPP+DA+SA RMF+DYLNRE+ +
Subjt:  LSGPPPPSDAESAKRMFIDYLNRELET

B9GAG9 Protein KTI12 homolog1.8e-9574.45Show/hide
Query:  SIVVTDSLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILD
        SI+V DSLN+IKGYRYELWCLAR +GIRYCVL+CD E  HCR+WN +R+EKGE  YD+NIF+DLV RFE+PD RNRWDSPLFELFP RDG+ +SS VI +
Subjt:  SIVVTDSLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILD

Query:  AVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEAQA--LGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTS
        AVSYLTK VDSK+RDVK+LQPTIATQ +R +EANSLYE+DKATQEV+NAIVEAQ+  LG P+N I LG +LPTI + RSVGLPELR LRRTFIKL GQ S
Subjt:  AVSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEAQA--LGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTS

Query:  LSGPPPPSDAESAKRMFIDYLNRELET
        LSGPPPP+DA+SA RMF+DYLNRE+ +
Subjt:  LSGPPPPSDAESAKRMFIDYLNRELET

Q8IUF1 COBW domain-containing protein 29.2e-5535.16Show/hide
Query:  EMED--EEDPPLAVEISSTITAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCI
        E ED  EED P  V + +T +  E     G  + VT+ITGYLGAGK+TL+NYIL  QH KR+AVILNEFGE   +E+++     GG L EEW+EL NGC+
Subjt:  EMED--EEDPPLAVEISSTITAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCI

Query:  CCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT-------------
        CC+VK N ++A+E L+Q+K ++                            D+ILLETTGLA+P  +AS+ W+D +L S I LD IIT             
Subjt:  CCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT-------------

Query:  -----------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGV
                         D I++NK DLV  E     ++ L   I +IN L +I+ + R +VDLS +L+ +++++ +   L+  L+      T+   D  +
Subjt:  -----------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGV

Query:  RTLCISDHDKVDLDKVHSWLEEILWEK------KGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSR-QWNNGESQTNKIVFIGRNLNEDVLSETFR
         T+          + ++ +++ +LWEK         M+V R KG++SIK+  Q   +Q V ELY++  +   W +   +TN++V +GRNL++D+L + F 
Subjt:  RTLCISDHDKVDLDKVHSWLEEILWEK------KGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSR-QWNNGESQTNKIVFIGRNLNEDVLSETFR

Query:  A
        A
Subjt:  A

Q99MB4 COBW domain-containing protein 12.4e-5536.02Show/hide
Query:  EMEDEEDPPLAVEISSTITAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICC
        E E  ED P  V I +     E   D  + + VT++TGYLGAGK+TL+NYIL  QH ++IAVILNEFGE   VE+++     GG L EEW+EL NGC+CC
Subjt:  EMEDEEDPPLAVEISSTITAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICC

Query:  TVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT---------------
        +VK N ++A+E L+Q+K ++                            D+ILLETTGLA+P  +AS+ W+D +L S I LD IIT               
Subjt:  TVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT---------------

Query:  ---------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGVRT
                       D I++NK DLVS E     L  L   I +IN L K++ + R +  LS IL+ ++Y+  +   L+   K+ + +ST    D  + T
Subjt:  ---------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGVRT

Query:  LCISDHDKVDLDKVHSWLEEILWEK----KGG--MDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSR-QWNNGESQTNKIVFIGRNLNEDVLSETF
        +        + + ++ +++ +LWEK    K G  M+V R KG++SIK+  Q   +Q + ELYE+  SR  W +   +  ++VFIG+NL++D+L + F
Subjt:  LCISDHDKVDLDKVHSWLEEILWEK----KGG--MDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSR-QWNNGESQTNKIVFIGRNLNEDVLSETF

Q9LMH0 Protein KTI12 homolog2.7e-9473.3Show/hide
Query:  IVVTDSLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDA
        IV+ DSLNSIKGYRYELWC+AR  GIRYCV+YCDV+E HCR+WN+ER ++GE  YDD IFEDLVRRFE+P+ RNRWDSPLFEL+P R+ I+KSS VIL+A
Subjt:  IVVTDSLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDA

Query:  VSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEAQALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSG
        V+YLTKTVDSK++DV+ILQP+IATQ +RFSEANSLYELD+ATQE++NAIVE Q+LG  ++ + LG ELP I I R +GLPELRRLRRTF+KL GQ+SLSG
Subjt:  VSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEAQALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSG

Query:  PPPPSDAESAKRMFIDYLNRE
        PP P+DA+SAKR F+DYLNRE
Subjt:  PPPPSDAESAKRMFIDYLNRE

Arabidopsis top hitse value%identityAlignment
AT1G13870.1 calmodulin binding;purine nucleotide binding1.9e-9573.3Show/hide
Query:  IVVTDSLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDA
        IV+ DSLNSIKGYRYELWC+AR  GIRYCV+YCDV+E HCR+WN+ER ++GE  YDD IFEDLVRRFE+P+ RNRWDSPLFEL+P R+ I+KSS VIL+A
Subjt:  IVVTDSLNSIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDA

Query:  VSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEAQALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSG
        V+YLTKTVDSK++DV+ILQP+IATQ +RFSEANSLYELD+ATQE++NAIVE Q+LG  ++ + LG ELP I I R +GLPELRRLRRTF+KL GQ+SLSG
Subjt:  VSYLTKTVDSKSRDVKILQPTIATQNSRFSEANSLYELDKATQEVVNAIVEAQALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSG

Query:  PPPPSDAESAKRMFIDYLNRE
        PP P+DA+SAKR F+DYLNRE
Subjt:  PPPPSDAESAKRMFIDYLNRE

AT1G15730.1 Cobalamin biosynthesis CobW-like protein5.4e-5032.61Show/hide
Query:  DPPLAVEISSTI--TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICCTVKH
        D  LAV     I   A+E L D+   +  T+ITG+LG+GK+TL+N+IL   HGKRIAVI NEFG E+ ++ +++     GA  E+ + L NGC+CCTV+ 
Subjt:  DPPLAVEISSTI--TAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICCTVKH

Query:  NLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT-------------------
        +LV+ + ++VQ K+                            R DHI++ETTGLANPAP+    + +D++ + +KLD ++T                   
Subjt:  NLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT-------------------

Query:  -----------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAAN-TAHLEALLKESRS----------------
                   D II+NK DLV    G+  L  +   I  INS+A +  +   +VDL  +L    ++     + +    KE R                 
Subjt:  -----------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAAN-TAHLEALLKESRS----------------

Query:  -----------LSTRDLHDTGVRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVF
                    S    HD GV ++ I     +DL+K + WL  +L+++    D+YR KG+LS+++ D+    Q V E++E  P R W   E++TNKIVF
Subjt:  -----------LSTRDLHDTGVRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVF

Query:  IGRNLNEDVLSETFRAC
        IG+NLN + L   FRAC
Subjt:  IGRNLNEDVLSETFRAC

AT1G26520.1 Cobalamin biosynthesis CobW-like protein1.9e-12459.44Show/hide
Query:  EDEEDPPLAVEISSTITAAE-RLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICCT
        +D+E+PP+AV+I   ++  +     D VSVGV+VITGYLGAGKSTLVNYILN +HGKRIAVILNEFGEEIGVERAMINEG+ GA+VEEWVELANGC+CCT
Subjt:  EDEEDPPLAVEISSTITAAE-RLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICCT

Query:  VKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT----------------
        VKH+LVQALEQLVQRK+                            RLDHILLETTGLANPAPLAS+LWLDDQLES +KLD I+T                
Subjt:  VKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT----------------

Query:  --------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGVRTL
                      DTII+NKVDL+S E  D    +LE EIH+INSLA ++ SVRCQVDLS ILNC +Y++ + + LE+LL+ ++SL+T DLHD+GVRTL
Subjt:  --------------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGVRTL

Query:  CISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRAC
        CIS+   ++LDKV  WLEEILW+KK  MDVYRCK VLSI+NSDQ+H LQAVR++YE+VP+R+W+  E++TNKIVFIG  L+E+VL    R C
Subjt:  CISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRAC

AT1G80480.1 plastid transcriptionally active 172.5e-4730.46Show/hide
Query:  SSTITAAERLEDDGVS--------VGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICCTVKHNL
        SS   +A + ED  V+        +  T+ITG+LG+GK+TL+N+IL   HGKRIAVI NEFG E+ ++ +++     GA  E+ V L NGC+CCTV+ +L
Subjt:  SSTITAAERLEDDGVS--------VGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMINEGDGGALVEEWVELANGCICCTVKHNL

Query:  VQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT---------------------
        V+ + +LV  K+                            + DHI++ETTGLANPAP+    + ++++ + +KLD ++T                     
Subjt:  VQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIIT---------------------

Query:  ---------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRD--------------
                 D II+NK DLV    G+  L  +   I  INS+A++  +    VDL  +L    +   +   +E+ + E       D              
Subjt:  ---------DTIILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRD--------------

Query:  ----------------LHDTGVRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVF
                         HD GV ++ I     +DL+K + WL  +L E+    D+YR KG+LS+   ++    Q V ++++  P R W   E + NKIVF
Subjt:  ----------------LHDTGVRTLCISDHDKVDLDKVHSWLEEILWEKKGGMDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVF

Query:  IGRNLNEDVLSETFRAC
        IG+NLN + L + F+AC
Subjt:  IGRNLNEDVLSETFRAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTACCTCAACTAATTTCAGTCACAAAGCCGATGTATTAAAGCTCCATTGTCAAATTGACACTACCAAACCTAAATACACGGCTACGATTCTAGAAATGGAAGATGA
AGAAGATCCACCACTCGCTGTTGAGATCAGCTCAACGATTACGGCTGCCGAGCGGCTTGAAGACGATGGTGTTTCTGTTGGCGTTACAGTTATCACTGGATATCTTGGAG
CTGGAAAATCCACCCTGGTTAATTACATATTGAACTCGCAACACGGCAAGAGAATCGCTGTTATACTGAATGAGTTTGGGGAGGAAATTGGAGTTGAGAGAGCGATGATA
AATGAAGGTGATGGTGGGGCATTGGTTGAGGAATGGGTCGAATTAGCAAACGGATGTATTTGTTGCACTGTCAAGCATAACCTGGTTCAAGCCTTGGAGCAACTTGTGCA
GAGAAAAGAAAGGTATTCTCTTGCTATTATATCAAATTTCAATGGAGCTTCTGTGAAGGATAGGTTGATGATTCAGTCATTCTTGCATGTCTCTAGGCTTGATCATATTT
TGTTGGAAACTACGGGGTTAGCCAATCCTGCTCCACTTGCTTCGGTACTTTGGCTGGACGATCAATTGGAATCGTCGATCAAGCTTGATTCAATTATTACCGATACTATT
ATTCTTAACAAGGTCGACCTGGTCTCTTCAGAACTTGGCGATGGTGCACTGGAGGACCTGGAGTATGAAATACATAATATTAATTCCCTTGCCAAAATAGTTCACTCTGT
TCGGTGTCAAGTTGACCTGTCTCTCATACTAAACTGCAATTCCTATAATGCTGCTAATACAGCTCATTTGGAAGCATTGTTGAAAGAAAGTCGTTCTTTATCTACTAGAG
ATCTTCATGATACCGGTGTTCGAACGTTGTGCATTAGTGATCATGACAAAGTTGACCTTGATAAGGTTCATTCTTGGCTTGAGGAGATACTTTGGGAGAAAAAAGGTGGC
ATGGATGTCTATCGTTGCAAAGGAGTTTTAAGCATAAAAAATTCTGATCAACTACATACTTTGCAGGCAGTGAGGGAATTATACGAGGTGGTTCCTAGTCGCCAATGGAA
TAATGGAGAAAGTCAGACGAACAAGATAGTCTTCATAGGTCGTAATCTGAACGAGGATGTTCTCAGCGAAACCTTTAGAGCATGCAGCTCAGTTCCGCCGCTGTCTCTCT
CCAGTTTAGTGTCGCCGCCGCCGCCGTTCTTCCGCTCGTGCCGGCGCAGCATCCCCCCCAACGGATCTGCACTGATTTTCCACTCTATTGTTGTTACTGATTCTTTGAAC
AGTATAAAGGGTTATCGATACGAATTGTGGTGTTTGGCTCGTGGAACAGGAATCAGGTATTGTGTGCTGTACTGCGATGTTGAAGAAACCCATTGTAGGAAGTGGAATGA
AGAGCGGCGAGAGAAAGGAGAAGCGAACTATGATGATAATATATTTGAAGATCTGGTCAGAAGGTTTGAGAGACCAGATAGTAGAAATAGATGGGACTCGCCATTGTTTG
AGTTATTCCCACATAGAGATGGAATAGAGAAATCATCTCAGGTCATTCTAGATGCTGTATCATACCTGACTAAAACAGTGGACTCCAAGTCGCGAGATGTAAAAATTTTA
CAGCCAACAATTGCTACTCAGAACTCTCGATTTTCAGAGGCGAATTCTCTATATGAGTTGGACAAAGCAACACAAGAGGTGGTTAATGCTATTGTAGAAGCACAAGCTCT
GGGAGGGCCTCTGAATAGTATCCCTCTGGGTCAGGAATTACCAACCATCTACATTTCAAGGTCGGTCGGACTACCGGAGCTTCGGAGACTGCGGAGAACTTTCATTAAAC
TAACAGGGCAAACTAGTTTAAGCGGACCACCACCACCTTCTGATGCCGAGAGCGCCAAGAGGATGTTTATTGACTACCTGAATAGGGAACTGGAGACGAAACTAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTACCTCAACTAATTTCAGTCACAAAGCCGATGTATTAAAGCTCCATTGTCAAATTGACACTACCAAACCTAAATACACGGCTACGATTCTAGAAATGGAAGATGA
AGAAGATCCACCACTCGCTGTTGAGATCAGCTCAACGATTACGGCTGCCGAGCGGCTTGAAGACGATGGTGTTTCTGTTGGCGTTACAGTTATCACTGGATATCTTGGAG
CTGGAAAATCCACCCTGGTTAATTACATATTGAACTCGCAACACGGCAAGAGAATCGCTGTTATACTGAATGAGTTTGGGGAGGAAATTGGAGTTGAGAGAGCGATGATA
AATGAAGGTGATGGTGGGGCATTGGTTGAGGAATGGGTCGAATTAGCAAACGGATGTATTTGTTGCACTGTCAAGCATAACCTGGTTCAAGCCTTGGAGCAACTTGTGCA
GAGAAAAGAAAGGTATTCTCTTGCTATTATATCAAATTTCAATGGAGCTTCTGTGAAGGATAGGTTGATGATTCAGTCATTCTTGCATGTCTCTAGGCTTGATCATATTT
TGTTGGAAACTACGGGGTTAGCCAATCCTGCTCCACTTGCTTCGGTACTTTGGCTGGACGATCAATTGGAATCGTCGATCAAGCTTGATTCAATTATTACCGATACTATT
ATTCTTAACAAGGTCGACCTGGTCTCTTCAGAACTTGGCGATGGTGCACTGGAGGACCTGGAGTATGAAATACATAATATTAATTCCCTTGCCAAAATAGTTCACTCTGT
TCGGTGTCAAGTTGACCTGTCTCTCATACTAAACTGCAATTCCTATAATGCTGCTAATACAGCTCATTTGGAAGCATTGTTGAAAGAAAGTCGTTCTTTATCTACTAGAG
ATCTTCATGATACCGGTGTTCGAACGTTGTGCATTAGTGATCATGACAAAGTTGACCTTGATAAGGTTCATTCTTGGCTTGAGGAGATACTTTGGGAGAAAAAAGGTGGC
ATGGATGTCTATCGTTGCAAAGGAGTTTTAAGCATAAAAAATTCTGATCAACTACATACTTTGCAGGCAGTGAGGGAATTATACGAGGTGGTTCCTAGTCGCCAATGGAA
TAATGGAGAAAGTCAGACGAACAAGATAGTCTTCATAGGTCGTAATCTGAACGAGGATGTTCTCAGCGAAACCTTTAGAGCATGCAGCTCAGTTCCGCCGCTGTCTCTCT
CCAGTTTAGTGTCGCCGCCGCCGCCGTTCTTCCGCTCGTGCCGGCGCAGCATCCCCCCCAACGGATCTGCACTGATTTTCCACTCTATTGTTGTTACTGATTCTTTGAAC
AGTATAAAGGGTTATCGATACGAATTGTGGTGTTTGGCTCGTGGAACAGGAATCAGGTATTGTGTGCTGTACTGCGATGTTGAAGAAACCCATTGTAGGAAGTGGAATGA
AGAGCGGCGAGAGAAAGGAGAAGCGAACTATGATGATAATATATTTGAAGATCTGGTCAGAAGGTTTGAGAGACCAGATAGTAGAAATAGATGGGACTCGCCATTGTTTG
AGTTATTCCCACATAGAGATGGAATAGAGAAATCATCTCAGGTCATTCTAGATGCTGTATCATACCTGACTAAAACAGTGGACTCCAAGTCGCGAGATGTAAAAATTTTA
CAGCCAACAATTGCTACTCAGAACTCTCGATTTTCAGAGGCGAATTCTCTATATGAGTTGGACAAAGCAACACAAGAGGTGGTTAATGCTATTGTAGAAGCACAAGCTCT
GGGAGGGCCTCTGAATAGTATCCCTCTGGGTCAGGAATTACCAACCATCTACATTTCAAGGTCGGTCGGACTACCGGAGCTTCGGAGACTGCGGAGAACTTTCATTAAAC
TAACAGGGCAAACTAGTTTAAGCGGACCACCACCACCTTCTGATGCCGAGAGCGCCAAGAGGATGTTTATTGACTACCTGAATAGGGAACTGGAGACGAAACTAGATTAA
Protein sequenceShow/hide protein sequence
MPTSTNFSHKADVLKLHCQIDTTKPKYTATILEMEDEEDPPLAVEISSTITAAERLEDDGVSVGVTVITGYLGAGKSTLVNYILNSQHGKRIAVILNEFGEEIGVERAMI
NEGDGGALVEEWVELANGCICCTVKHNLVQALEQLVQRKERYSLAIISNFNGASVKDRLMIQSFLHVSRLDHILLETTGLANPAPLASVLWLDDQLESSIKLDSIITDTI
ILNKVDLVSSELGDGALEDLEYEIHNINSLAKIVHSVRCQVDLSLILNCNSYNAANTAHLEALLKESRSLSTRDLHDTGVRTLCISDHDKVDLDKVHSWLEEILWEKKGG
MDVYRCKGVLSIKNSDQLHTLQAVRELYEVVPSRQWNNGESQTNKIVFIGRNLNEDVLSETFRACSSVPPLSLSSLVSPPPPFFRSCRRSIPPNGSALIFHSIVVTDSLN
SIKGYRYELWCLARGTGIRYCVLYCDVEETHCRKWNEERREKGEANYDDNIFEDLVRRFERPDSRNRWDSPLFELFPHRDGIEKSSQVILDAVSYLTKTVDSKSRDVKIL
QPTIATQNSRFSEANSLYELDKATQEVVNAIVEAQALGGPLNSIPLGQELPTIYISRSVGLPELRRLRRTFIKLTGQTSLSGPPPPSDAESAKRMFIDYLNRELETKLD