; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019150 (gene) of Snake gourd v1 genome

Gene IDTan0019150
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibonuclease H-like domain containing protein
Genome locationLG07:69253486..69257380
RNA-Seq ExpressionTan0019150
SyntenyTan0019150
Gene Ontology termsNA
InterPro domainsIPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606146.1 hypothetical protein SDJN03_03463, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0093.93Show/hide
Query:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
        MAATNASTPP P AVESGSSSADELTAKAVHKRYEGLL+VRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
Subjt:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT

Query:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA-----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPH
        CPNFNSLPKPISSVSPSSFL   P SPPAL H NRKRTSSAAAA     G+GGSSYQV PLAIVDPSRFCGEL YSQSV A AA+GGGG GGGLLAQQPH
Subjt:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA-----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPH

Query:  LMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAES
        LMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLP+ISRR+FTSSRLNLKFEEAKAES
Subjt:  LMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAES

Query:  EVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTW
        EVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQN W
Subjt:  EVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTW

Query:  MVNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDET
        MVNLSCQ QGFSSLVKDF KQLPLFKTVTEHCIKLANFVNYKSQ+RNCFHKFQLQEYGH ALLRVPPRDHEKLN GPVFTLM+DIL+FSRALQLVVLDET
Subjt:  MVNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDET

Query:  CKIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFI
        CKIASM+DP+AREVA+LIGDVGFWNELEAVHSLVKLVTEMA+EIEKERPLVGQCLPLWDQLRAKVKDWC KFQI EGPVEKVIEKRF KNYHPAWAA+FI
Subjt:  CKIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFI

Query:  LDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLG
        LDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPV+GKMRVANPQSSRLVWETYLTEFKSLG
Subjt:  LDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLG

Query:  KVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE
        KVAVRLIFLHATSCGFKCNWS+LRWVSSHTHQ  GMERAQKLIFISAHSKLERRDFSSDEDKDAELFSL N ++
Subjt:  KVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE

XP_022958369.1 uncharacterized protein LOC111459613 isoform X1 [Cucurbita moschata]0.0e+0093.92Show/hide
Query:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
        MAATNASTPP P AVESGSSSADELTAKAVHKRYEGLL+VRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
Subjt:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT

Query:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL
        CPNFNSLPKPISSVSPSSFL   P SPPAL H NRKRTSSAAAA    G+GGSSYQV PLAIVDPSRFCGEL YSQSV A AA+GGGG GGGLLAQQPHL
Subjt:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL

Query:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE
        MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLP++SRR+FTSSRLNLKFEEAKAESE
Subjt:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE

Query:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM
        VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQN WM
Subjt:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM

Query:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC
        VNLSCQ QGFSSLVKDF KQLPLFKTVTEHCIKLANFVNYKSQ+RNCFHKFQLQEYGHAALLRVPPRDHEKLN GPVFTLM+DIL+FSRALQLVVLDETC
Subjt:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC

Query:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL
        KIASM+DP+AREVA+LIGDVGFWNELEAVHSLVKLVTEMA+EIEK+RPLVGQCLPLWDQLRAKVKDWC KFQI EGPVEKVIEKRF KNYHPAWAA+FIL
Subjt:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL

Query:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK
        DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPV+GKMRVANPQSSRLVWETYLTEFKSLGK
Subjt:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK

Query:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE
        VAVRLIFLHATSCGFKCNWS+LRWVSSHTHQ  GMERAQKLIFISAHSKLERRDFSSDEDKDAELFSL N ++
Subjt:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE

XP_022958370.1 uncharacterized protein LOC111459613 isoform X2 [Cucurbita moschata]0.0e+0094.29Show/hide
Query:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
        MAATNASTPP P AVESGSSSADELTAKAVHKRYEGLL+VRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
Subjt:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT

Query:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL
        CPNFNSLPKPISSVSPSSFL   P SPPAL H NRKRTSSAAAA    G+GGSSYQV PLAIVDPSRFCGEL YSQSV A AA+GGGG GGGLLAQQPHL
Subjt:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL

Query:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE
        MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLP++SRR+FTSSRLNLKFEEAKAESE
Subjt:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE

Query:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM
        VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQN WM
Subjt:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM

Query:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC
        VNLSCQ QGFSSLVKDF KQLPLFKTVTEHCIKLANFVNYKSQ+RNCFHKFQLQEYGHAALLRVPPRDHEKLN GPVFTLM+DIL+FSRALQLVVLDETC
Subjt:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC

Query:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL
        KIASM+DP+AREVA+LIGDVGFWNELEAVHSLVKLVTEMA+EIEK+RPLVGQCLPLWDQLRAKVKDWC KFQI EGPVEKVIEKRF KNYHPAWAA+FIL
Subjt:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL

Query:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK
        DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPV+GKMRVANPQSSRLVWETYLTEFKSLGK
Subjt:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK

Query:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLAN
        VAVRLIFLHATSCGFKCNWS+LRWVSSHTHQ  GMERAQKLIFISAHSKLERRDFSSDEDKDAELFSL N
Subjt:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLAN

XP_022958371.1 uncharacterized protein LOC111459613 isoform X3 [Cucurbita moschata]0.0e+0093.96Show/hide
Query:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
        MAATNASTPP P AVESGSSSADELTAKAVHKRYEGLL+VRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
Subjt:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT

Query:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL
        CPNFNSLPKPISSVSPSSFL   P SPPAL H NRKRTSSAAAA    G+GGSSYQV PLAIVDPSRFCGEL YSQSV A AA+GGGG GGGLLAQQPHL
Subjt:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL

Query:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE
        MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLP++SRR+FTSSRLNLKFEEAKAESE
Subjt:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE

Query:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM
        VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQN WM
Subjt:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM

Query:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC
        VNLSCQ QGFSSLVKDF KQLPLFKTVTEHCIKLANFVNYKSQ+RNCFHKFQLQEYGHAALLRVPPRDHEKLN GPVFTLM+DIL+FSRALQLVVLDETC
Subjt:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC

Query:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL
        KIASM+DP+AREVA+LIGDVGFWNELEAVHSLVKLVTEMA+EIEK+RPLVGQCLPLWDQLRAKVKDWC KFQI EGPVEKVIEKRF KNYHPAWAA+FIL
Subjt:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL

Query:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK
        DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPV+GKMRVANPQSSRLVWETYLTEFKSLGK
Subjt:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK

Query:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKEKQVLP
        VAVRLIFLHATSCGFKCNWS+LRWVSSHTHQ  GMERAQKLIFISAHSKLERRDFSSDEDKDAELFSL NAK+  VLP
Subjt:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKEKQVLP

XP_023532891.1 uncharacterized protein LOC111794927 [Cucurbita pepo subsp. pepo]0.0e+0093.66Show/hide
Query:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
        MAATNASTPP P AVESGSSSADELTAKAVHKRYEGLL+VRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
Subjt:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT

Query:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL
        CPNFNSLPKPISSVSPSSFL   P SPPAL H NRKRTSSAAAA    G+GGSSYQV PLAIVDPSRFCGEL YSQSVA  A+ GGGG GGGLLAQQPHL
Subjt:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL

Query:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE
        MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLP+ISRR+FTSSRLNLKFEEAKAESE
Subjt:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE

Query:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM
        VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQN WM
Subjt:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM

Query:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC
        VNLSCQ QGFSSLVKDF KQLPLFKTVTEHCIKLANFVNYKSQ+RNCFHKFQLQEYGHAALLRVPPRDHEKLN GPVFTLM+DIL+FSRALQLVVLDETC
Subjt:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC

Query:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL
        KIASM+DP+AREVA+LIGDVGFWNELEAVHSLVKLVTEMA+EIEK+RPLVGQCLPLWDQLRAKVKDWC KFQI EGPVEKVIEKRF KNYHPAWAA+FIL
Subjt:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL

Query:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK
        DPLYL RDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPV+GKMRVANPQSSRLVWETYLTEFKSLGK
Subjt:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK

Query:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE
        VAVRLIFLHATSCGFKCNWS+LRW+SSHTHQ  GMERAQKLIFISAHSKLERRDFSSDEDKDAELFSL N ++
Subjt:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE

TrEMBL top hitse value%identityAlignment
A0A1S4DT49 uncharacterized protein LOC103482664 isoform X10.0e+0092.18Show/hide
Query:  MAATNAST----PPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHL
        MAATNAST      A PAVESGSSSADE+TAKAV KRYEGL+MVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHL
Subjt:  MAATNAST----PPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHL

Query:  KRGTCPNFNSLPKPISSVSPSSFLPP---SPPALLH-NNRKRTSSAAA------AGAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGL
        KRGTCPNFNSLPKPISSVSPSSFLPP   SPP L H NNRKRTSSA A      AG GGSSYQVPPLAIVDPSRFCGELTYS SV       GGGGGGGL
Subjt:  KRGTCPNFNSLPKPISSVSPSSFLPP---SPPALLH-NNRKRTSSAAA------AGAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGL

Query:  LAQQPHLMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFE
        L QQPHLMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCA+DFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFT+SRLN KFE
Subjt:  LAQQPHLMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFE

Query:  EAKAESEVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNL
        EAKAESEVKIRDAMFFQLASDGWKDKNYAVFG+DKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNL
Subjt:  EAKAESEVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNL

Query:  ENQNTWMVNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQL
        ENQN WMVNLSCQFQGFSSLVKDF KQLPLF TVTEHC+KLANFVNYKSQIRNCFHK QLQEYG+A LLRVP R+HEKLNFGPVFTLMEDIL+FSRALQL
Subjt:  ENQNTWMVNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQL

Query:  VVLDETCKIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPA
        VVLDET KIAS+DDP+AREVAELIGDVGFWNELEAVHSLVKL+T+MA+EIEKERPLVGQCLPLWDQLR KVKDWC KFQIAEGPVEKVIEKRFKKNYHPA
Subjt:  VVLDETCKIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPA

Query:  WAASFILDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLT
        WAASFILDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDP+TGKMRVANPQSSRLVWETYLT
Subjt:  WAASFILDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLT

Query:  EFKSLGKVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE
        EFKSLGKVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGM+RAQKLIFISAHSKLERRDFS+DEDKDAELFSLAN ++
Subjt:  EFKSLGKVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE

A0A5A7THM6 F5O11.10 isoform 30.0e+0092.31Show/hide
Query:  MAATNAST----PPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHL
        MAATNAST      A PAVESGSSSADE+TAKAV KRYEGL+MVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHL
Subjt:  MAATNAST----PPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHL

Query:  KRGTCPNFNSLPKPISSVSPSSFLPP---SPPALLH-NNRKRTSSAAA------AGAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGL
        KRGTCPNFNSLPKPISSVSPSSFLPP   SPP L H NNRKRTSSA A      AG GGSSYQVPPLAIVDPSRFCGELTYS SV       GGGGGGGL
Subjt:  KRGTCPNFNSLPKPISSVSPSSFLPP---SPPALLH-NNRKRTSSAAA------AGAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGL

Query:  LAQQPHLMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFE
        L QQPHLMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCA+DFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFT+SRLN KFE
Subjt:  LAQQPHLMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFE

Query:  EAKAESEVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNL
        EAKAESEVKIRDAMFFQLASDGWKDKNYAVFG+DKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNL
Subjt:  EAKAESEVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNL

Query:  ENQNTWMVNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQL
        ENQN WMVNLSCQFQGFSSLVKDF KQLPLF TVTEHC+KLANFVNYKSQIRNCFHK QLQEYG+A LLRVP R+HEKLNFGPVFTLMEDIL+FSRALQL
Subjt:  ENQNTWMVNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQL

Query:  VVLDETCKIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPA
        VVLDET KIAS+DDP+AREVAELIGDVGFWNELEAVHSLVKL+T+MA+EIEKERPLVGQCLPLWDQLR KVKDWCSKFQIAEGPVEKVIEKRFKKNYHPA
Subjt:  VVLDETCKIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPA

Query:  WAASFILDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLT
        WAASFILDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDP+TGKMRVANPQSSRLVWETYLT
Subjt:  WAASFILDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLT

Query:  EFKSLGKVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE
        EFKSLGKVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGM+RAQKLIFISAHSKLERRDFS+DEDKDAELFSLAN ++
Subjt:  EFKSLGKVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE

A0A6J1H1W5 uncharacterized protein LOC111459613 isoform X30.0e+0093.96Show/hide
Query:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
        MAATNASTPP P AVESGSSSADELTAKAVHKRYEGLL+VRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
Subjt:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT

Query:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL
        CPNFNSLPKPISSVSPSSFL   P SPPAL H NRKRTSSAAAA    G+GGSSYQV PLAIVDPSRFCGEL YSQSV A AA+GGGG GGGLLAQQPHL
Subjt:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL

Query:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE
        MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLP++SRR+FTSSRLNLKFEEAKAESE
Subjt:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE

Query:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM
        VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQN WM
Subjt:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM

Query:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC
        VNLSCQ QGFSSLVKDF KQLPLFKTVTEHCIKLANFVNYKSQ+RNCFHKFQLQEYGHAALLRVPPRDHEKLN GPVFTLM+DIL+FSRALQLVVLDETC
Subjt:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC

Query:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL
        KIASM+DP+AREVA+LIGDVGFWNELEAVHSLVKLVTEMA+EIEK+RPLVGQCLPLWDQLRAKVKDWC KFQI EGPVEKVIEKRF KNYHPAWAA+FIL
Subjt:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL

Query:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK
        DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPV+GKMRVANPQSSRLVWETYLTEFKSLGK
Subjt:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK

Query:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKEKQVLP
        VAVRLIFLHATSCGFKCNWS+LRWVSSHTHQ  GMERAQKLIFISAHSKLERRDFSSDEDKDAELFSL NAK+  VLP
Subjt:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKEKQVLP

A0A6J1H3A1 uncharacterized protein LOC111459613 isoform X20.0e+0094.29Show/hide
Query:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
        MAATNASTPP P AVESGSSSADELTAKAVHKRYEGLL+VRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
Subjt:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT

Query:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL
        CPNFNSLPKPISSVSPSSFL   P SPPAL H NRKRTSSAAAA    G+GGSSYQV PLAIVDPSRFCGEL YSQSV A AA+GGGG GGGLLAQQPHL
Subjt:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL

Query:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE
        MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLP++SRR+FTSSRLNLKFEEAKAESE
Subjt:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE

Query:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM
        VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQN WM
Subjt:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM

Query:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC
        VNLSCQ QGFSSLVKDF KQLPLFKTVTEHCIKLANFVNYKSQ+RNCFHKFQLQEYGHAALLRVPPRDHEKLN GPVFTLM+DIL+FSRALQLVVLDETC
Subjt:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC

Query:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL
        KIASM+DP+AREVA+LIGDVGFWNELEAVHSLVKLVTEMA+EIEK+RPLVGQCLPLWDQLRAKVKDWC KFQI EGPVEKVIEKRF KNYHPAWAA+FIL
Subjt:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL

Query:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK
        DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPV+GKMRVANPQSSRLVWETYLTEFKSLGK
Subjt:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK

Query:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLAN
        VAVRLIFLHATSCGFKCNWS+LRWVSSHTHQ  GMERAQKLIFISAHSKLERRDFSSDEDKDAELFSL N
Subjt:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLAN

A0A6J1H4W9 uncharacterized protein LOC111459613 isoform X10.0e+0093.92Show/hide
Query:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
        MAATNASTPP P AVESGSSSADELTAKAVHKRYEGLL+VRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT
Subjt:  MAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRGT

Query:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL
        CPNFNSLPKPISSVSPSSFL   P SPPAL H NRKRTSSAAAA    G+GGSSYQV PLAIVDPSRFCGEL YSQSV A AA+GGGG GGGLLAQQPHL
Subjt:  CPNFNSLPKPISSVSPSSFL---PPSPPALLHNNRKRTSSAAAA----GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQPHL

Query:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE
        MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLP++SRR+FTSSRLNLKFEEAKAESE
Subjt:  MLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKAESE

Query:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM
        VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQN WM
Subjt:  VKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWM

Query:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC
        VNLSCQ QGFSSLVKDF KQLPLFKTVTEHCIKLANFVNYKSQ+RNCFHKFQLQEYGHAALLRVPPRDHEKLN GPVFTLM+DIL+FSRALQLVVLDETC
Subjt:  VNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETC

Query:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL
        KIASM+DP+AREVA+LIGDVGFWNELEAVHSLVKLVTEMA+EIEK+RPLVGQCLPLWDQLRAKVKDWC KFQI EGPVEKVIEKRF KNYHPAWAA+FIL
Subjt:  KIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFIL

Query:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK
        DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPV+GKMRVANPQSSRLVWETYLTEFKSLGK
Subjt:  DPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGK

Query:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE
        VAVRLIFLHATSCGFKCNWS+LRWVSSHTHQ  GMERAQKLIFISAHSKLERRDFSSDEDKDAELFSL N ++
Subjt:  VAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12380.1 unknown protein3.6e-29665.17Show/hide
Query:  MAATNAST------------PPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNP
        MA TNA T            PPAP        + DELTAKA++KRYEGL+ VRTKA+KGKGAWYW HLEPILV NTDTGLPKAVKLRCSLCDAVFSASNP
Subjt:  MAATNAST------------PPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNP

Query:  SRTASEHLKRGTCPNFNSLPKPISSVSPSSFLPPSPPALLHNNRKRTSSAAAAGA---------GGSSYQVPPLAIVDPSRFC-GELTYSQSVAAVAAAG
        SRTASEHLKRGTCPNFNS+  PIS+++PS     S P   H  RKR SS A   A          G SY V P+ +VDPSRFC GEL YS          
Subjt:  SRTASEHLKRGTCPNFNSLPKPISSVSPSSFLPPSPPALLHNNRKRTSSAAAAGA---------GGSSYQVPPLAIVDPSRFC-GELTYSQSVAAVAAAG

Query:  GGGGGGGLLAQQPHLMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTS
                     HLMLSGGK+DLG LAMLEDSVKKLKSPK S   +L+++QI+ ALD L+DWV+ES GSVS S LEHPKFRAFL QVGLP IS+RDF +
Subjt:  GGGGGGGLLAQQPHLMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTS

Query:  SRLNLKFEEAKAESEVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKF
        +RL+LK EEA+AE+E +IRDAMFFQ++SDGWK         + LVNL VNLPNGTSLYRRAV V+G+VPS+YA+E+L ETV  I GN  Q+CVGIV+DKF
Subjt:  SRLNLKFEEAKAESEVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKF

Query:  KAKALKNLENQNTWMVNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPR---DHEKLN---------
        K KAL+NLE+Q+ WMVNLSCQFQG +SL+KDF K+LPLFK+V+++C++LA F+N  +QIRN   K+QLQE+G + +LR+P     D E+ +         
Subjt:  KAKALKNLENQNTWMVNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPR---DHEKLN---------

Query:  ----FGPVFTLMEDILNFSRALQLVVLDETCKIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCS
            + P+F L+ED+L+ +RA+QLVV D+ CK+  M+D +AREV E++GD GFWNE+EAVH+L+KLV EMA  IE+E+ LVGQCLPLWD+LRAKVKDW S
Subjt:  ----FGPVFTLMEDILNFSRALQLVVLDETCKIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCS

Query:  KFQIAEGPVEKVIEKRFKKNYHPAWAASFILDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKER
        KF + EG VEKV+E+RFKK+YHPAWAA+FILDPLYLIRD+SGKYLPPFKCL+P+QEKDVDKLITRLVSR+EAHIALMELMKWRTEGLDP+YARAVQMKER
Subjt:  KFQIAEGPVEKVIEKRFKKNYHPAWAASFILDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKER

Query:  DPVTGKMRVANPQSSRLVWETYLTEFKSLGKVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLA
        DPV+GKMR+ANPQSSRLVWETYL+EF+SLGKVAVRLIFLHAT+ GFKCN SLL+WV+S+    A ++RAQKLIFISA+SK ERRDFS++ED+DAEL ++A
Subjt:  DPVTGKMRVANPQSSRLVWETYLTEFKSLGKVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLA

Query:  NAKE
        N  +
Subjt:  NAKE

AT1G62870.1 unknown protein2.8e-30166.67Show/hide
Query:  TNASTPPAPPAVESGS----SSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRG
        TNA+T       ++ S     SA+EL  KA+ KRYEGL+MVRTKA+KGKGAWYW+HLEPIL+HNTDTG PKAVKLRCSLCDAVFSASNPSRTASEHLKRG
Subjt:  TNASTPPAPPAVESGS----SSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHLKRG

Query:  TCPNFNSLPKPISSVSPSSFLPPSPPALLHNNRKRTSSAAAA---------GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQ
        TCPNFNSLPKPIS++SPS   PP PP+  H  RKR SSA  A              SY V PL++VDPSRFCG+                      + QQ
Subjt:  TCPNFNSLPKPISSVSPSSFLPPSPPALLHNNRKRTSSAAAA---------GAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAAGGGGGGGGLLAQQ

Query:  PHLMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKA
        PHLMLSGGK+DLG LAMLEDSVKKLKSPKTS    L+K QID ALD L+DWV+ES GSVS S LEHPK RAFL QVGLP ISRRDF + RL+LK+E+++A
Subjt:  PHLMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEEAKA

Query:  ESEVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQN
        E+E +I DAMFFQ+ASDGWK  +      + LVNL VNLPNGTSLYRRAVFV+G+VPS+YA+E+LWETV  I GN  Q+CVGIV+D+F +KAL+NLE+Q+
Subjt:  ESEVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQN

Query:  TWMVNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLD
         WMVNLSCQFQGF+SL++DF K+LPLFK+V++ C +L NFVN  +QIRN   K+QLQE G   +L +P    +   F P++ L+ED+L+F+RA+QLV+ D
Subjt:  TWMVNLSCQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLD

Query:  ETCKIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIA-EGPVEKVIEKRFKKNYHPAWAA
        + CK   M+D +AREV E++GDVGFWNE+EAV+ L+KLV EMA  IE+ERPLVGQCLPLWD+LR+K+KDW +KF +  E  VEK++E+RFKK+YHPAWAA
Subjt:  ETCKIASMDDPVAREVAELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIA-EGPVEKVIEKRFKKNYHPAWAA

Query:  SFILDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFK
        +FILDPLYLI+D+SGKYLPPFKCL+P+QEKDVDKLITRLVSR+EAHIA+MELMKWRTEGLDPVYARAVQMKERDPV+GKMR+ANPQSSRLVWETYL+EF+
Subjt:  SFILDPLYLIRDTSGKYLPPFKCLTPDQEKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFK

Query:  SLGKVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE
        SLG+VAVRLIFLHATSCGFKCN S+LRWV+S+   +A ++RAQKLIFISA+SK ERRDFS++E++DAEL ++AN ++
Subjt:  SLGKVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAGMERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGACGTTGTTACAGGCCAAAATGAACTTTGGATCGTGAAAGAGAAAAAAGGGGTGTTACAGGCCAAACTTGTCGGAAGTTGGAAGCTATTTTTGTGTGTGTGTGA
GTGTGTGTTTGAGGCCGCAATGGCGGCTACTAACGCCTCTACTCCGCCGGCGCCTCCGGCAGTGGAGTCTGGCAGTTCCTCAGCTGATGAGCTGACTGCCAAAGCCGTAC
ACAAGCGATACGAAGGACTGCTGATGGTACGGACGAAGGCTATAAAGGGTAAAGGCGCTTGGTACTGGGCACACCTCGAGCCCATCCTCGTCCATAACACTGACACTGGT
TTGCCCAAAGCCGTTAAACTCAGGTGCTCCTTATGCGACGCCGTTTTCTCCGCCTCCAACCCTTCCAGAACCGCCTCCGAGCATCTCAAGCGCGGTACTTGCCCCAATTT
CAACTCTCTTCCTAAACCCATTTCCTCTGTTTCCCCTTCTTCTTTTTTACCTCCTTCTCCGCCTGCCCTTCTTCACAACAATCGCAAACGAACCTCCTCTGCCGCCGCAG
CCGGAGCTGGTGGGTCTTCTTACCAAGTCCCACCTCTTGCGATTGTCGATCCTTCGCGATTCTGTGGCGAGCTAACCTATTCTCAGTCGGTCGCGGCGGTGGCAGCTGCT
GGTGGCGGCGGCGGCGGTGGGGGCCTGTTAGCGCAGCAGCCGCATTTGATGTTATCGGGAGGGAAAGAGGATTTGGGGGCTTTGGCTATGTTGGAGGACAGTGTGAAGAA
GCTAAAGAGTCCAAAAACTTCGCCTGGGCCAACGCTGAGCAAGACGCAGATTGATTGCGCCCTCGATTTTCTCGCCGATTGGGTCTACGAATCCGGCGGCTCTGTGTCGT
TTTCGAGCCTGGAACATCCGAAATTCAGGGCGTTTCTTAACCAAGTTGGATTACCCGCGATTTCTCGCCGGGATTTTACGAGTTCGCGGTTGAATTTGAAATTCGAGGAG
GCGAAAGCAGAGTCGGAAGTTAAGATTCGTGATGCTATGTTTTTTCAGCTCGCCTCCGATGGGTGGAAGGATAAAAACTACGCCGTTTTTGGCGTCGATAAGCTTGTGAA
TCTCACCGTGAATCTCCCCAATGGGACTAGCTTGTACCGGCGGGCCGTGTTCGTCAGTGGCTCTGTTCCTTCTTCGTACGCTCAGGAAATTTTGTGGGAAACCGTCGCTG
ATATAAGTGGGAATGTGGTTCAACAGTGTGTAGGGATTGTAGCAGACAAGTTCAAGGCGAAGGCACTGAAGAATTTGGAGAATCAAAATACCTGGATGGTTAATCTCTCT
TGTCAGTTTCAGGGTTTTTCGAGTTTGGTTAAGGATTTCGGCAAACAGCTCCCACTGTTCAAGACTGTCACTGAGCATTGTATAAAGCTTGCTAATTTCGTCAATTACAA
GTCTCAGATTCGTAATTGCTTCCATAAATTTCAGCTACAAGAGTACGGCCACGCCGCTTTGCTCCGAGTGCCTCCTCGTGACCATGAGAAGCTCAACTTTGGCCCTGTTT
TCACTTTAATGGAGGACATTCTCAACTTCTCTCGAGCTCTGCAGCTGGTCGTGCTCGACGAGACCTGCAAAATCGCTTCCATGGACGACCCGGTCGCACGAGAAGTAGCC
GAGTTGATTGGAGATGTGGGATTTTGGAATGAATTGGAGGCAGTTCACTCTTTGGTTAAATTGGTAACAGAAATGGCTCTTGAGATTGAGAAAGAGAGGCCACTAGTTGG
GCAATGCCTCCCATTGTGGGATCAGCTGAGGGCCAAAGTCAAGGATTGGTGTTCAAAGTTTCAGATTGCTGAAGGGCCTGTTGAGAAAGTGATTGAAAAGAGGTTCAAAA
AGAACTACCACCCAGCTTGGGCAGCGTCGTTTATCCTTGATCCATTGTATTTGATAAGGGACACGAGCGGGAAGTACCTCCCGCCATTCAAATGTCTGACTCCTGATCAA
GAGAAGGATGTGGATAAGCTCATAACCCGGCTGGTGTCGAGGGAGGAAGCTCACATTGCATTGATGGAGCTTATGAAATGGAGGACAGAAGGGCTTGATCCAGTTTATGC
AAGAGCTGTACAAATGAAGGAAAGAGACCCAGTAACTGGGAAGATGAGAGTTGCAAATCCACAAAGCAGTAGGCTTGTTTGGGAAACTTATCTTACAGAATTCAAGTCAT
TAGGCAAAGTTGCAGTGAGGCTCATATTCCTTCATGCCACTTCCTGTGGATTCAAATGCAACTGGTCTTTGTTGAGATGGGTGTCTTCTCATACTCACCAAAAGGCTGGT
ATGGAGAGGGCTCAGAAGTTGATATTCATTTCTGCTCATTCGAAGCTCGAACGGAGGGATTTTTCCAGTGATGAAGACAAGGATGCCGAGCTATTCTCCCTAGCCAACGC
AAAAGAAAAGCAAGTTCTTCCTGCATTTACCAAACAAAAGTATGTCTTGTTTGATTCATCAAACAGTCACAAATTCTTCTACATTTTTCTCGTAACACGTCGCGTAATCG
CAAAGAAAATTGATGTTTTACCTCGAAGATAA
mRNA sequenceShow/hide mRNA sequence
AACACACAAACAACTTAGTTGAAACCAAAAACTGACCTTGACCCAGTTCCCCAGATCGCCGGCCTCCGCCATGGAAACTGGCCGGAAATTCCAGCGGCGGCTCCGGCCGG
TTCAATGCACGACGTTGTTACAGGCCAAAATGAACTTTGGATCGTGAAAGAGAAAAAAGGGGTGTTACAGGCCAAACTTGTCGGAAGTTGGAAGCTATTTTTGTGTGTGT
GTGAGTGTGTGTTTGAGGCCGCAATGGCGGCTACTAACGCCTCTACTCCGCCGGCGCCTCCGGCAGTGGAGTCTGGCAGTTCCTCAGCTGATGAGCTGACTGCCAAAGCC
GTACACAAGCGATACGAAGGACTGCTGATGGTACGGACGAAGGCTATAAAGGGTAAAGGCGCTTGGTACTGGGCACACCTCGAGCCCATCCTCGTCCATAACACTGACAC
TGGTTTGCCCAAAGCCGTTAAACTCAGGTGCTCCTTATGCGACGCCGTTTTCTCCGCCTCCAACCCTTCCAGAACCGCCTCCGAGCATCTCAAGCGCGGTACTTGCCCCA
ATTTCAACTCTCTTCCTAAACCCATTTCCTCTGTTTCCCCTTCTTCTTTTTTACCTCCTTCTCCGCCTGCCCTTCTTCACAACAATCGCAAACGAACCTCCTCTGCCGCC
GCAGCCGGAGCTGGTGGGTCTTCTTACCAAGTCCCACCTCTTGCGATTGTCGATCCTTCGCGATTCTGTGGCGAGCTAACCTATTCTCAGTCGGTCGCGGCGGTGGCAGC
TGCTGGTGGCGGCGGCGGCGGTGGGGGCCTGTTAGCGCAGCAGCCGCATTTGATGTTATCGGGAGGGAAAGAGGATTTGGGGGCTTTGGCTATGTTGGAGGACAGTGTGA
AGAAGCTAAAGAGTCCAAAAACTTCGCCTGGGCCAACGCTGAGCAAGACGCAGATTGATTGCGCCCTCGATTTTCTCGCCGATTGGGTCTACGAATCCGGCGGCTCTGTG
TCGTTTTCGAGCCTGGAACATCCGAAATTCAGGGCGTTTCTTAACCAAGTTGGATTACCCGCGATTTCTCGCCGGGATTTTACGAGTTCGCGGTTGAATTTGAAATTCGA
GGAGGCGAAAGCAGAGTCGGAAGTTAAGATTCGTGATGCTATGTTTTTTCAGCTCGCCTCCGATGGGTGGAAGGATAAAAACTACGCCGTTTTTGGCGTCGATAAGCTTG
TGAATCTCACCGTGAATCTCCCCAATGGGACTAGCTTGTACCGGCGGGCCGTGTTCGTCAGTGGCTCTGTTCCTTCTTCGTACGCTCAGGAAATTTTGTGGGAAACCGTC
GCTGATATAAGTGGGAATGTGGTTCAACAGTGTGTAGGGATTGTAGCAGACAAGTTCAAGGCGAAGGCACTGAAGAATTTGGAGAATCAAAATACCTGGATGGTTAATCT
CTCTTGTCAGTTTCAGGGTTTTTCGAGTTTGGTTAAGGATTTCGGCAAACAGCTCCCACTGTTCAAGACTGTCACTGAGCATTGTATAAAGCTTGCTAATTTCGTCAATT
ACAAGTCTCAGATTCGTAATTGCTTCCATAAATTTCAGCTACAAGAGTACGGCCACGCCGCTTTGCTCCGAGTGCCTCCTCGTGACCATGAGAAGCTCAACTTTGGCCCT
GTTTTCACTTTAATGGAGGACATTCTCAACTTCTCTCGAGCTCTGCAGCTGGTCGTGCTCGACGAGACCTGCAAAATCGCTTCCATGGACGACCCGGTCGCACGAGAAGT
AGCCGAGTTGATTGGAGATGTGGGATTTTGGAATGAATTGGAGGCAGTTCACTCTTTGGTTAAATTGGTAACAGAAATGGCTCTTGAGATTGAGAAAGAGAGGCCACTAG
TTGGGCAATGCCTCCCATTGTGGGATCAGCTGAGGGCCAAAGTCAAGGATTGGTGTTCAAAGTTTCAGATTGCTGAAGGGCCTGTTGAGAAAGTGATTGAAAAGAGGTTC
AAAAAGAACTACCACCCAGCTTGGGCAGCGTCGTTTATCCTTGATCCATTGTATTTGATAAGGGACACGAGCGGGAAGTACCTCCCGCCATTCAAATGTCTGACTCCTGA
TCAAGAGAAGGATGTGGATAAGCTCATAACCCGGCTGGTGTCGAGGGAGGAAGCTCACATTGCATTGATGGAGCTTATGAAATGGAGGACAGAAGGGCTTGATCCAGTTT
ATGCAAGAGCTGTACAAATGAAGGAAAGAGACCCAGTAACTGGGAAGATGAGAGTTGCAAATCCACAAAGCAGTAGGCTTGTTTGGGAAACTTATCTTACAGAATTCAAG
TCATTAGGCAAAGTTGCAGTGAGGCTCATATTCCTTCATGCCACTTCCTGTGGATTCAAATGCAACTGGTCTTTGTTGAGATGGGTGTCTTCTCATACTCACCAAAAGGC
TGGTATGGAGAGGGCTCAGAAGTTGATATTCATTTCTGCTCATTCGAAGCTCGAACGGAGGGATTTTTCCAGTGATGAAGACAAGGATGCCGAGCTATTCTCCCTAGCCA
ACGCAAAAGAAAAGCAAGTTCTTCCTGCATTTACCAAACAAAAGTATGTCTTGTTTGATTCATCAAACAGTCACAAATTCTTCTACATTTTTCTCGTAACACGTCGCGTA
ATCGCAAAGAAAATTGATGTTTTACCTCGAAGATAATTGTGGTAATTATGGTGAGTTTCTGTTTTTCTTTGCAGACATATTTGACAAATTTATCCTCGGACTTGAAATAC
CACCTTAACCATACACAACATGGTGAGTTTGTTGGATGAACTCAACGCCTAATGATTAGAGTTGAAAATTTCAGCATCGTCGAACGAAACGAGAAATGTCTGGTTGCGTA
TCGAACAGGGG
Protein sequenceShow/hide protein sequence
MHDVVTGQNELWIVKEKKGVLQAKLVGSWKLFLCVCECVFEAAMAATNASTPPAPPAVESGSSSADELTAKAVHKRYEGLLMVRTKAIKGKGAWYWAHLEPILVHNTDTG
LPKAVKLRCSLCDAVFSASNPSRTASEHLKRGTCPNFNSLPKPISSVSPSSFLPPSPPALLHNNRKRTSSAAAAGAGGSSYQVPPLAIVDPSRFCGELTYSQSVAAVAAA
GGGGGGGGLLAQQPHLMLSGGKEDLGALAMLEDSVKKLKSPKTSPGPTLSKTQIDCALDFLADWVYESGGSVSFSSLEHPKFRAFLNQVGLPAISRRDFTSSRLNLKFEE
AKAESEVKIRDAMFFQLASDGWKDKNYAVFGVDKLVNLTVNLPNGTSLYRRAVFVSGSVPSSYAQEILWETVADISGNVVQQCVGIVADKFKAKALKNLENQNTWMVNLS
CQFQGFSSLVKDFGKQLPLFKTVTEHCIKLANFVNYKSQIRNCFHKFQLQEYGHAALLRVPPRDHEKLNFGPVFTLMEDILNFSRALQLVVLDETCKIASMDDPVAREVA
ELIGDVGFWNELEAVHSLVKLVTEMALEIEKERPLVGQCLPLWDQLRAKVKDWCSKFQIAEGPVEKVIEKRFKKNYHPAWAASFILDPLYLIRDTSGKYLPPFKCLTPDQ
EKDVDKLITRLVSREEAHIALMELMKWRTEGLDPVYARAVQMKERDPVTGKMRVANPQSSRLVWETYLTEFKSLGKVAVRLIFLHATSCGFKCNWSLLRWVSSHTHQKAG
MERAQKLIFISAHSKLERRDFSSDEDKDAELFSLANAKEKQVLPAFTKQKYVLFDSSNSHKFFYIFLVTRRVIAKKIDVLPRR