; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016206 (gene) of Snake gourd v1 genome

Gene IDTan0016206
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSWIM-type domain-containing protein
Genome locationLG06:80289788..80337499
RNA-Seq ExpressionTan0016206
SyntenyTan0016206
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000270 - PB1 domain
IPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138593.1 uncharacterized protein LOC101203713 isoform X1 [Cucumis sativus]0.0e+0089.55Show/hide
Query:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF
        MASSVQRSPRLDQIHGEIRDSFR + Y  TNTPANKRVELFDERGI E +TDDNVQ                             MVVEQT+E+GRQSAF
Subjt:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF

Query:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT
        TLEGQATATKKIIAICQSGGEFVKNKDGSLSY GGEAYAIDIDQQTNL+DFKTE+AEMF+CSIDTMS+KYFLPGNKKTLIS+SKDKDL RMVNFLKDS+T
Subjt:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT

Query:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE
         DVFILSEEAAARNLS+MPASRSSRTTVSEAVVPVVEPVDV VE IIT+DQIG+DIS+EVPLICVP+GSSDEK RKAAQQWENAIIGVDQRFNSFSEFRE
Subjt:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE

Query:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
        ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNT+HSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
Subjt:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE

Query:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA
        YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLF+SFHAS++GFQQGCRPLLFLDSTPLNSKYQG F TATA
Subjt:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA

Query:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA
         DGED IFPAAFAVVDAETEENW WFL+ELKSAVK SEQITFVADFQNGLNKSL EIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYA ALA
Subjt:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA

Query:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
         KLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFY SI+EAHELPITQMIDVLRGKMMETIY RRVESDQW+TKLTP NEE
Subjt:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE

Query:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES
        KLQKEISIA+SFQV LSHG+ FEVRGESV  VDVD+WDCSCK WQLTGLPCCHAIAV ECIGRSPYDYC RYFTVE+YRLTYAESIHP+PNVDRLILGES
Subjt:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES

Query:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
        TQA+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHNKKTCKDS
Subjt:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS

XP_008458283.1 PREDICTED: uncharacterized protein LOC103497748 isoform X1 [Cucumis melo]0.0e+0089.91Show/hide
Query:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF
        MASSVQRSP+LDQIHGEIRDSFR + Y  TNTPANKRVELFDERGI E TTDDNVQ                             MVVEQT+E+GRQSAF
Subjt:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF

Query:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT
        TLEGQATATKKIIAICQSGGEFVKNKDGSLSY GGEAYAIDIDQQTNL+DFKTE+AEMF+CSI+TMS+KYFLPGNKKTLIS+SKDKDL RMVNFLKDS+T
Subjt:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT

Query:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE
         DVFILSEEAAARNLS+MPASRSSRTTVSEAVVPVVEPVDV VETIIT+DQIG+DIS+EVPLICVP+GS DEK RKAAQQWENAIIGVDQRFNSFSEFRE
Subjt:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE

Query:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
        ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
Subjt:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE

Query:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA
        YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLF+SFHAS++GFQQGCRPLLFLDSTPLNSKYQG FLTATA
Subjt:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA

Query:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA
         DGED IFPAAFAVVDAETEENW+WFL+ELKSAVK SEQITFVADFQNGLNKSL EIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYA ALA
Subjt:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA

Query:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
         KLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFG QFY SI+EAHELPITQMIDVLRGKMMETIY RRVESDQWMTKLTP NEE
Subjt:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE

Query:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES
        KLQKEISIA+SFQV LSHGS FEVRGESV IVDVD+WDCSCK WQLTGLPCCHAIAV ECIGRSPYDYC RYFTVE+YRLTYAESIHP+PNVDRLILGES
Subjt:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES

Query:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
        TQA+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHNKKTCKDS
Subjt:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS

XP_022959325.1 uncharacterized protein LOC111460335 isoform X1 [Cucurbita moschata]0.0e+0089.55Show/hide
Query:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF
        MASSVQRSPRLDQIHGE+RDSFRVLSY  TN PAN+RVE+FDERGI EPTTDDN+Q                             MVVEQTVEVGRQS+ 
Subjt:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF

Query:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT
        T +GQATATKKIIAICQSGGEFVKNKDG L YNGGEAYAIDIDQQTNLDDF+TEIAEMF+CSIDTMS+KYFLPGNKKTLISISKDKDL RM+NFLKDSIT
Subjt:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT

Query:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE
        VDVFI+SEEA ARNLS MPASRSSRTTVSEAVVPVVEPVDVAVE II +DQIG+D+SNE+PLICVP  SSDEK RKAAQQWENAIIGVDQRFNSF+EFRE
Subjt:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE

Query:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
        ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKR+
Subjt:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE

Query:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA
        YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCE IKETNPGSVASFTTK+DSSFHRLF+SFHAS+ GFQ  CRPLLFLDSTPLNSKYQGVFLTATA
Subjt:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA

Query:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA
         DGED IFPAAFAVVDAETEENW WFL+ELKSAVKTSEQITFVADFQ GLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYA A A
Subjt:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA

Query:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
         KLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQ+FYCSI+EAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
Subjt:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE

Query:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES
        KLQKEISIA+SFQV LS GSTFEVRGESVDIVD+DHWDCSCKGWQLTGLPCCHAIAV ECIGRSPYDYC RYFTVE+YRLTYAESIHPVPNVDRLIL ES
Subjt:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES

Query:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
        T ALVTVTPPPTRRPPGRPK+KQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
Subjt:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS

XP_023006497.1 uncharacterized protein LOC111499213 isoform X1 [Cucurbita maxima]0.0e+0089.44Show/hide
Query:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF
        MASSVQRSPRLDQIHGE+RDSFRVLSY  TN PAN+RV++FDERGI EPTTDDN+Q                             MVVEQTVEVGRQS+ 
Subjt:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF

Query:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT
        T +GQATATKKIIAICQSGGEFVKNKDG L YNGGEAYAIDIDQQTNLDDF+TEIAEMF+CSIDTMS+KYFLPGNKKTLISISKDKDL RM+NFLKDSIT
Subjt:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT

Query:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE
        VDVFI+SEEA ARNLS MPASRSSRTTVSEAVVPVVEPVDVAVE II +DQIG+D+SNE+PLICVP  SSDEK RKAAQQWENAIIGVDQRFNSF+EFRE
Subjt:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE

Query:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
        ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKR+
Subjt:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE

Query:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA
        YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCE IKETNPGSVASFTTK+DSSFHRLF+SFHAS+ GFQ  CRPLLFLDSTPLNSKYQGVFLTATA
Subjt:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA

Query:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA
         DGED IFPAAFAVVDAETEENW WFL+ELKSAVKTSEQITFVADFQ GLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYA A A
Subjt:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA

Query:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
         KLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQ+FYCSI+EAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
Subjt:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE

Query:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES
        KLQKEISIA+SFQV LS GSTFEVRGESVDIVD+DHWDCSCKGWQLTGLPCCHAIAV ECIGRSPYDYC RYFTVE+YRLTYAESIHPVPNVDRLIL ES
Subjt:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES

Query:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
        T ALVTVTPPPTRRPPGRPK+KQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
Subjt:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS

XP_038906599.1 uncharacterized protein LOC120092551 isoform X1 [Benincasa hispida]0.0e+0090.73Show/hide
Query:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF
        MASSVQRSPRLDQIHGEIRDSFR L Y   NTPANKRVELFDERGI E TTDDNVQ                             MVVEQT+E+GRQSAF
Subjt:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF

Query:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT
        TLE QATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTE+AEMF+CSIDTMS+KYFLP NKKTLISISKDKDL RM+NFLKDSIT
Subjt:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT

Query:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE
        VDVFILSEEAAARNLS+MPASRSSRTTVSEAVVPVVEP DVAVET+I IDQIG+D+SNEVPLICVP+GS+DEK RKAAQQWENAIIGVDQRFNSFSEFRE
Subjt:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE

Query:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
        ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
Subjt:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE

Query:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA
        YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLF+SFHAS++GFQQGCRPLLFLDSTPLNSKYQG F TATA
Subjt:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA

Query:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA
         DGED IFPAAFAVVDAETEENW WFL+ELKSAVK SEQITFVADFQNGLNK L EIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYA ALA
Subjt:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA

Query:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
         KLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFY SI+EAHELPITQMIDVLRGKMMETIYRRRVESDQW TKLTPANEE
Subjt:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE

Query:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES
        KLQKE SIA+SFQV LSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAV ECIGRSPYDYC RYFTVE+YRLTYAESIHP+PNVDRLILGES
Subjt:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES

Query:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
        TQA+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
Subjt:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS

TrEMBL top hitse value%identityAlignment
A0A0A0KCW9 SWIM-type domain-containing protein0.0e+0089.55Show/hide
Query:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF
        MASSVQRSPRLDQIHGEIRDSFR + Y  TNTPANKRVELFDERGI E +TDDNVQ                             MVVEQT+E+GRQSAF
Subjt:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF

Query:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT
        TLEGQATATKKIIAICQSGGEFVKNKDGSLSY GGEAYAIDIDQQTNL+DFKTE+AEMF+CSIDTMS+KYFLPGNKKTLIS+SKDKDL RMVNFLKDS+T
Subjt:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT

Query:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE
         DVFILSEEAAARNLS+MPASRSSRTTVSEAVVPVVEPVDV VE IIT+DQIG+DIS+EVPLICVP+GSSDEK RKAAQQWENAIIGVDQRFNSFSEFRE
Subjt:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE

Query:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
        ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNT+HSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
Subjt:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE

Query:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA
        YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLF+SFHAS++GFQQGCRPLLFLDSTPLNSKYQG F TATA
Subjt:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA

Query:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA
         DGED IFPAAFAVVDAETEENW WFL+ELKSAVK SEQITFVADFQNGLNKSL EIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYA ALA
Subjt:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA

Query:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
         KLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFY SI+EAHELPITQMIDVLRGKMMETIY RRVESDQW+TKLTP NEE
Subjt:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE

Query:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES
        KLQKEISIA+SFQV LSHG+ FEVRGESV  VDVD+WDCSCK WQLTGLPCCHAIAV ECIGRSPYDYC RYFTVE+YRLTYAESIHP+PNVDRLILGES
Subjt:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES

Query:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
        TQA+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHNKKTCKDS
Subjt:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS

A0A1S3C8R6 uncharacterized protein LOC103497748 isoform X10.0e+0089.91Show/hide
Query:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF
        MASSVQRSP+LDQIHGEIRDSFR + Y  TNTPANKRVELFDERGI E TTDDNVQ                             MVVEQT+E+GRQSAF
Subjt:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF

Query:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT
        TLEGQATATKKIIAICQSGGEFVKNKDGSLSY GGEAYAIDIDQQTNL+DFKTE+AEMF+CSI+TMS+KYFLPGNKKTLIS+SKDKDL RMVNFLKDS+T
Subjt:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT

Query:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE
         DVFILSEEAAARNLS+MPASRSSRTTVSEAVVPVVEPVDV VETIIT+DQIG+DIS+EVPLICVP+GS DEK RKAAQQWENAIIGVDQRFNSFSEFRE
Subjt:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE

Query:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
        ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
Subjt:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE

Query:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA
        YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLF+SFHAS++GFQQGCRPLLFLDSTPLNSKYQG FLTATA
Subjt:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA

Query:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA
         DGED IFPAAFAVVDAETEENW+WFL+ELKSAVK SEQITFVADFQNGLNKSL EIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYA ALA
Subjt:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA

Query:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
         KLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFG QFY SI+EAHELPITQMIDVLRGKMMETIY RRVESDQWMTKLTP NEE
Subjt:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE

Query:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES
        KLQKEISIA+SFQV LSHGS FEVRGESV IVDVD+WDCSCK WQLTGLPCCHAIAV ECIGRSPYDYC RYFTVE+YRLTYAESIHP+PNVDRLILGES
Subjt:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES

Query:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
        TQA+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHNKKTCKDS
Subjt:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS

A0A6J1CAN4 uncharacterized protein LOC111009921 isoform X10.0e+0090.23Show/hide
Query:  VLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAFTLEGQATATKKIIAICQSGGEFV
        +L +  TNT ANKR+ELF ERG  EPTTDDN++                             MVVEQTVEVGRQSA  LEGQATATKKIIAICQSGGEFV
Subjt:  VLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAFTLEGQATATKKIIAICQSGGEFV

Query:  KNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSITVDVFILSEEAAARNLSSMPASRS
        KNKDGSL YNGGEAYAIDIDQQTNLDDFKTE+AEMF+CSI TMS+KYFLPGNK+TLISISKDKDL RMV FL+DSITVDVFILSEEAAARNLS+MPASRS
Subjt:  KNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSITVDVFILSEEAAARNLSSMPASRS

Query:  SRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRV
        SRTTVSEAVVPVV PVDVAVETII IDQIG+D+S+EVPLICVP+GSSDEK RKAAQQWENAIIGVDQRFNSFSEFREALHKYS+AHGFAYRYKKNDSHRV
Subjt:  SRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRV

Query:  TVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQG
        TVKCKYQGCPWRIYASRLSTTQLICIKKMNT+HSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQG
Subjt:  TVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQG

Query:  SYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATAADGEDDIFPAAFAVVDAETEENW
        SYKEAYNQLPYFC+KIKETNPGSVASFTTKDDSSFHRLF+SFHAS++GFQQGCRPLLFLDSTPLNSKYQGVFLTATAADGED IFPAAFAVVDAET+ENW
Subjt:  SYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATAADGEDDIFPAAFAVVDAETEENW

Query:  QWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALAPKLEDFQRCAESIKGISPDAYNW
         WFL++LKSAV+TSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYA ALA KLEDFQRCAESIKGISPDAYNW
Subjt:  QWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALAPKLEDFQRCAESIKGISPDAYNW

Query:  IIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIAQSFQVLLSHGSTFE
        IIQSEPEHWANAFFGGARYNHITSNFGQQFYCS++EAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTP+NEEKLQKEISIA+SFQVLLSHGSTFE
Subjt:  IIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIAQSFQVLLSHGSTFE

Query:  VRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGESTQALVTVTPPPTRRPPGRPKMKQ
        VRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVF+CIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDR ILGESTQ LVTVTPPPTRRPPGRPKMKQ
Subjt:  VRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGESTQALVTVTPPPTRRPPGRPKMKQ

Query:  NESLEVVKRQLQCSKCKGLGHNKKTCKDS
        NESLEVVKRQLQCSKCKGLGHNKKTCKDS
Subjt:  NESLEVVKRQLQCSKCKGLGHNKKTCKDS

A0A6J1H5Z5 uncharacterized protein LOC111460335 isoform X10.0e+0089.55Show/hide
Query:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF
        MASSVQRSPRLDQIHGE+RDSFRVLSY  TN PAN+RVE+FDERGI EPTTDDN+Q                             MVVEQTVEVGRQS+ 
Subjt:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF

Query:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT
        T +GQATATKKIIAICQSGGEFVKNKDG L YNGGEAYAIDIDQQTNLDDF+TEIAEMF+CSIDTMS+KYFLPGNKKTLISISKDKDL RM+NFLKDSIT
Subjt:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT

Query:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE
        VDVFI+SEEA ARNLS MPASRSSRTTVSEAVVPVVEPVDVAVE II +DQIG+D+SNE+PLICVP  SSDEK RKAAQQWENAIIGVDQRFNSF+EFRE
Subjt:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE

Query:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
        ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKR+
Subjt:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE

Query:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA
        YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCE IKETNPGSVASFTTK+DSSFHRLF+SFHAS+ GFQ  CRPLLFLDSTPLNSKYQGVFLTATA
Subjt:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA

Query:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA
         DGED IFPAAFAVVDAETEENW WFL+ELKSAVKTSEQITFVADFQ GLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYA A A
Subjt:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA

Query:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
         KLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQ+FYCSI+EAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
Subjt:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE

Query:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES
        KLQKEISIA+SFQV LS GSTFEVRGESVDIVD+DHWDCSCKGWQLTGLPCCHAIAV ECIGRSPYDYC RYFTVE+YRLTYAESIHPVPNVDRLIL ES
Subjt:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES

Query:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
        T ALVTVTPPPTRRPPGRPK+KQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
Subjt:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS

A0A6J1KW10 uncharacterized protein LOC111499213 isoform X10.0e+0089.44Show/hide
Query:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF
        MASSVQRSPRLDQIHGE+RDSFRVLSY  TN PAN+RV++FDERGI EPTTDDN+Q                             MVVEQTVEVGRQS+ 
Subjt:  MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQ-----------------------------MVVEQTVEVGRQSAF

Query:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT
        T +GQATATKKIIAICQSGGEFVKNKDG L YNGGEAYAIDIDQQTNLDDF+TEIAEMF+CSIDTMS+KYFLPGNKKTLISISKDKDL RM+NFLKDSIT
Subjt:  TLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSIT

Query:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE
        VDVFI+SEEA ARNLS MPASRSSRTTVSEAVVPVVEPVDVAVE II +DQIG+D+SNE+PLICVP  SSDEK RKAAQQWENAIIGVDQRFNSF+EFRE
Subjt:  VDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFRE

Query:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE
        ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKR+
Subjt:  ALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKRE

Query:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA
        YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCE IKETNPGSVASFTTK+DSSFHRLF+SFHAS+ GFQ  CRPLLFLDSTPLNSKYQGVFLTATA
Subjt:  YGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATA

Query:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA
         DGED IFPAAFAVVDAETEENW WFL+ELKSAVKTSEQITFVADFQ GLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYA A A
Subjt:  ADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALA

Query:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
         KLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQ+FYCSI+EAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE
Subjt:  PKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEE

Query:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES
        KLQKEISIA+SFQV LS GSTFEVRGESVDIVD+DHWDCSCKGWQLTGLPCCHAIAV ECIGRSPYDYC RYFTVE+YRLTYAESIHPVPNVDRLIL ES
Subjt:  KLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES

Query:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
        T ALVTVTPPPTRRPPGRPK+KQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
Subjt:  TQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase4.2e-3624.25Show/hide
Query:  SGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAG
        SGS       +    E   + V   F   +E ++A+   SI        ++ +     V+C+   C W I ASR     L  I + +  H C        
Subjt:  SGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAG

Query:  YRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNY-------SQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETN----PGSVASFT-TKD
               +   I+  ++V P     ++    ++++G  L+             AK  A ++  G + +++  +P     +  +N         S T   +
Subjt:  YRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNY-------SQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETN----PGSVASFT-TKD

Query:  DSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATAADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVA----DFQNGL
         +SF  LF +F  S+ GFQ  CRPL+ +D+  L  KY+   + A+A D  +  FP AFAV    + ++W+WFL  ++  V   + I  ++    D    +
Subjt:  DSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATAADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVA----DFQNGL

Query:  NKSLAEIFDK-SYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALAPKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSN-
        N+  ++  +  +YH +CL HL  KL +   G F +    F++++      + + E+F    + IK  +P+A+ W+ Q  P  WA A   G RY  +  + 
Subjt:  NKSLAEIFDK-SYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALAPKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSN-

Query:  ----------------------FGQQFYCSIAEAHELPITQMI--DVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIAQSFQVLLSHGSTFEV
                              FG Q   + AE+ +L    +   DV    +ME +     +SD W+  +TP   +  Q  +S+A   +  L   S    
Subjt:  ----------------------FGQQFYCSIAEAHELPITQMI--DVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIAQSFQVLLSHGSTFEV

Query:  RGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGESTQALV-TVTPPPTRRPPGRPKMKQ
         G    IV ++   C+C  +Q    PC HA+AV + +  +P  Y    +TVE Y  TY+    PVP +           L+  V  PP  +  G+ K K 
Subjt:  RGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGESTQALV-TVTPPPTRRPPGRPKMKQ

Query:  NE
         E
Subjt:  NE

AT1G64255.1 MuDR family transposase5.5e-2823.7Show/hide
Query:  KCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDI----KREYGIQLNYSQAWRAKEIAREQL
        +C    C W + A+R+    L+ I K    H+C     +        +      ++++ +  Y P     ++    K++ G +L  S    AKE A +++
Subjt:  KCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDI----KREYGIQLNYSQAWRAKEIAREQL

Query:  QGSYKEAYNQLPYFCEKIKETN----PGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATAADGEDDIFPAAFAVVDA
         G + +++   P     +  +N          F   + +SF  +F +F  S+ GFQ  CRPL+ +D+  LN +YQ   + A+  D  +  FP AFAV   
Subjt:  QGSYKEAYNQLPYFCEKIKETN----PGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQGVFLTATAADGEDDIFPAAFAVVDA

Query:  ETEENWQWFLMELKSAVKTSEQITFVA----DFQNGLNKSLAEIFDK-SYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALAPKLED-FQRCAE
         + + W+WFL  ++  V   + +  ++    D    +N+S ++  +  +YH + L H           QFS     F +      A +   +D F     
Subjt:  ETEENWQWFLMELKSAVKTSEQITFVA----DFQNGLNKSLAEIFDK-SYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYATALAPKLED-FQRCAE

Query:  SIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQF-YCSIAEAHELPIT----QMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQ-KE
         IK  +P+A  W+ Q     WA A   G RY  +  N    F  C+  E     +T     + D LR K  ++    R   +       P  ++  + + 
Subjt:  SIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQF-YCSIAEAHELPIT----QMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQ-KE

Query:  ISIAQSFQVLLSHGSTFEV-----RGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES
          +  S+ V     + F+V     +GE   IV +    C+C  +Q    PC HA+AV + +  +P  Y    +T+E  + TYA     VP +        
Subjt:  ISIAQSFQVLLSHGSTFEV-----RGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGES

Query:  TQALVTVTPPPTRRPPGRP
           L+    PP+  PP  P
Subjt:  TQALVTVTPPPTRRPPGRP

AT1G64260.1 MuDR family transposase9.4e-3623.27Show/hide
Query:  TATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNF--LKDSITVDVF
        T  +K++ +C   G      DG + Y G     I I ++T L      + ++F         K F     K  + +S   DL   V+F  L DS    + 
Subjt:  TATKKIIAICQSGGEFVKNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNF--LKDSITVDVF

Query:  ILSEEAAARN-----LSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQW--ENAIIGVDQRFNSFSE
         +     + N     L + P S +    V+    P+  P   +    IT  +I  + S    ++     S   K     + W  ++  + +   F    E
Subjt:  ILSEEAAARN-----LSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITIDQIGIDISNEVPLICVPSGSSDEKQRKAAQQW--ENAIIGVDQRFNSFSE

Query:  FREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDI
         ++A+  + I        ++ +    T +C    C W + A+R+    L+ I K    H+C   + +        +  + I+  +++ P     ++    
Subjt:  FREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDI

Query:  KREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETN----PGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQG
        K + G +L  S+    K    +++ G   +++  +P        +N          F   D +SF  +F SF  S+ GFQ  CRPL+ +D+  LN KYQ 
Subjt:  KREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETN----PGSVASFTTKDDSSFHRLFISFHASMTGFQQGCRPLLFLDSTPLNSKYQG

Query:  VFLTATAADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDK--------SYHSYCLRHLAEKLNNDLKGQFSHE
          + A+  D  +  FP AFAV    + ++W+WF  +++  V   + +  ++   + L   +A + +         ++H +CL H        L+ QF   
Subjt:  VFLTATAADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDK--------SYHSYCLRHLAEKLNNDLKGQFSHE

Query:  ARRFMINDFYATA-LAPKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQF-------YCSIAEAHELPITQMIDVLRG---
         R + +      A    + E+F      IK  +P+A+ W+ Q     WA A   G RY  I  +    F       YC++A    + +  M D LR    
Subjt:  ARRFMINDFYATA-LAPKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQF-------YCSIAEAHELPITQMIDVLRG---

Query:  KMMETIY----RRRVESDQWMTKLTPANEEKLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCS
        K + +IY    R  V ++ +M KL     + +   I+  +     +S  S  E       IV ++   C+C+ +Q    PC HA+AVFE +  +P  Y  
Subjt:  KMMETIY----RRRVESDQWMTKLTPANEEKLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCS

Query:  RYFTVENYRLTYAESIHPVPNV
          +TVE Y  TYA +  PVP+V
Subjt:  RYFTVENYRLTYAESIHPVPNV

AT3G18230.1 Octicosapeptide/Phox/Bem1p family protein7.3e-0429.76Show/hide
Query:  KIIAICQSGGEFV-KNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPG-NKKTLISISKDKDLMRMV
        K+  +C  GG  + +  D SL+Y+GGE   + +D++ +L   ++ ++ M   +  + +LKY LP  +  +L++I+ D+DL  M+
Subjt:  KIIAICQSGGEFV-KNKDGSLSYNGGEAYAIDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPG-NKKTLISISKDKDLMRMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGCAGTGTGCAGCGGAGCCCTCGGTTAGATCAGATCCATGGTGAAATTCGTGACAGTTTTCGTGTCCTTTCGTATACTAATACCAATACTCCCGCTAATAAGAG
GGTAGAGCTGTTTGATGAGAGAGGCATCAAGGAACCTACAACTGATGATAATGTTCAGATGGTAGTCGAACAAACCGTTGAAGTGGGAAGGCAATCTGCTTTTACATTGG
AAGGCCAAGCCACAGCTACGAAGAAAATAATTGCTATATGCCAATCAGGTGGTGAGTTTGTGAAAAATAAAGATGGATCCCTGTCTTACAATGGGGGTGAGGCCTATGCT
ATAGATATTGATCAACAAACAAATTTGGATGATTTCAAGACGGAAATAGCAGAAATGTTTAATTGTAGTATTGATACAATGTCCCTCAAGTATTTTCTTCCTGGTAATAA
GAAAACTCTCATTTCGATATCCAAGGACAAGGATCTAATGCGGATGGTGAACTTTTTAAAGGATTCTATCACAGTTGATGTGTTTATCTTGTCAGAGGAAGCTGCTGCTC
GAAATTTATCTAGTATGCCTGCTAGTAGGTCAAGTAGAACAACAGTTTCTGAGGCAGTAGTTCCTGTAGTTGAGCCTGTAGATGTTGCAGTTGAGACTATCATCACCATT
GATCAGATTGGGATTGACATATCCAATGAAGTCCCTTTAATTTGTGTTCCTTCTGGGTCTAGTGATGAGAAACAACGAAAAGCTGCACAACAATGGGAAAATGCCATTAT
TGGTGTAGACCAAAGGTTTAACAGCTTTAGTGAATTCCGAGAAGCCTTGCATAAGTATTCAATTGCACATGGTTTTGCTTACCGGTATAAGAAAAATGACAGCCATCGTG
TCACTGTTAAATGCAAATACCAAGGTTGCCCCTGGCGAATATATGCTTCAAGGCTGTCTACTACCCAACTGATTTGTATTAAGAAAATGAACACTAATCATTCATGTGAA
GGGGCTGCTGCTAAAGCAGGGTATCGGGCTACTAGGGGATGGGTGGGAAATATCATAAAGGAAAAGTTAAAGGTCTCCCCAAACTACAAGCCAAAAGATATTGCTGATGA
CATAAAACGAGAATATGGAATTCAATTGAACTATTCACAGGCATGGCGTGCAAAAGAGATTGCAAGAGAGCAGCTTCAAGGTTCCTACAAGGAAGCATATAATCAGTTGC
CATACTTTTGTGAGAAGATCAAGGAAACTAACCCTGGAAGTGTTGCTTCATTCACCACCAAAGATGACTCAAGCTTTCACCGCCTATTTATATCATTTCATGCATCAATG
ACTGGTTTCCAGCAAGGTTGTCGACCTCTCCTTTTCCTCGATAGTACTCCATTGAACTCAAAGTATCAGGGGGTGTTTTTGACTGCCACAGCTGCAGATGGGGAAGATGA
CATATTTCCAGCAGCTTTTGCAGTTGTAGATGCTGAGACTGAGGAAAATTGGCAATGGTTTCTAATGGAATTGAAATCTGCAGTGAAAACATCAGAGCAGATTACATTTG
TTGCAGATTTTCAGAATGGACTGAACAAGTCTTTGGCTGAAATATTTGACAAGTCTTACCATAGCTACTGTTTGCGCCATCTAGCTGAAAAGCTTAACAACGATTTAAAG
GGCCAATTTTCTCATGAGGCCAGGCGATTTATGATTAATGATTTCTATGCTACTGCTCTTGCACCTAAACTTGAAGATTTCCAGCGTTGTGCTGAAAGTATAAAAGGAAT
TTCACCTGATGCTTACAATTGGATCATACAAAGTGAACCAGAACACTGGGCAAATGCATTCTTTGGAGGAGCAAGGTACAACCATATCACATCAAATTTTGGACAGCAGT
TCTATTGTTCCATAGCTGAAGCACATGAATTGCCGATAACACAGATGATTGATGTTTTACGGGGAAAAATGATGGAAACAATATATAGACGCAGGGTTGAATCAGATCAA
TGGATGACAAAATTAACACCAGCCAATGAGGAAAAGTTACAAAAAGAAATATCAATTGCACAGTCGTTTCAGGTTTTACTATCCCATGGAAGCACATTTGAGGTTCGGGG
GGAATCTGTTGACATTGTTGATGTTGATCATTGGGATTGTAGCTGTAAGGGATGGCAGCTTACTGGTTTGCCTTGTTGCCATGCTATTGCTGTCTTTGAATGCATTGGTA
GGAGCCCATATGATTATTGCTCCAGATATTTCACAGTTGAGAATTACCGCCTGACATATGCTGAGTCAATCCATCCTGTCCCAAATGTGGACAGACTAATCCTGGGTGAA
TCCACTCAAGCACTAGTCACAGTAACCCCTCCACCTACTAGAAGACCACCTGGTCGTCCAAAAATGAAGCAGAATGAATCATTAGAAGTAGTTAAGCGCCAGCTCCAATG
TAGCAAGTGTAAAGGCCTTGGCCACAATAAGAAGACATGCAAAGATTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGCAGTGTGCAGCGGAGCCCTCGGTTAGATCAGATCCATGGTGAAATTCGTGACAGTTTTCGTGTCCTTTCGTATACTAATACCAATACTCCCGCTAATAAGAG
GGTAGAGCTGTTTGATGAGAGAGGCATCAAGGAACCTACAACTGATGATAATGTTCAGATGGTAGTCGAACAAACCGTTGAAGTGGGAAGGCAATCTGCTTTTACATTGG
AAGGCCAAGCCACAGCTACGAAGAAAATAATTGCTATATGCCAATCAGGTGGTGAGTTTGTGAAAAATAAAGATGGATCCCTGTCTTACAATGGGGGTGAGGCCTATGCT
ATAGATATTGATCAACAAACAAATTTGGATGATTTCAAGACGGAAATAGCAGAAATGTTTAATTGTAGTATTGATACAATGTCCCTCAAGTATTTTCTTCCTGGTAATAA
GAAAACTCTCATTTCGATATCCAAGGACAAGGATCTAATGCGGATGGTGAACTTTTTAAAGGATTCTATCACAGTTGATGTGTTTATCTTGTCAGAGGAAGCTGCTGCTC
GAAATTTATCTAGTATGCCTGCTAGTAGGTCAAGTAGAACAACAGTTTCTGAGGCAGTAGTTCCTGTAGTTGAGCCTGTAGATGTTGCAGTTGAGACTATCATCACCATT
GATCAGATTGGGATTGACATATCCAATGAAGTCCCTTTAATTTGTGTTCCTTCTGGGTCTAGTGATGAGAAACAACGAAAAGCTGCACAACAATGGGAAAATGCCATTAT
TGGTGTAGACCAAAGGTTTAACAGCTTTAGTGAATTCCGAGAAGCCTTGCATAAGTATTCAATTGCACATGGTTTTGCTTACCGGTATAAGAAAAATGACAGCCATCGTG
TCACTGTTAAATGCAAATACCAAGGTTGCCCCTGGCGAATATATGCTTCAAGGCTGTCTACTACCCAACTGATTTGTATTAAGAAAATGAACACTAATCATTCATGTGAA
GGGGCTGCTGCTAAAGCAGGGTATCGGGCTACTAGGGGATGGGTGGGAAATATCATAAAGGAAAAGTTAAAGGTCTCCCCAAACTACAAGCCAAAAGATATTGCTGATGA
CATAAAACGAGAATATGGAATTCAATTGAACTATTCACAGGCATGGCGTGCAAAAGAGATTGCAAGAGAGCAGCTTCAAGGTTCCTACAAGGAAGCATATAATCAGTTGC
CATACTTTTGTGAGAAGATCAAGGAAACTAACCCTGGAAGTGTTGCTTCATTCACCACCAAAGATGACTCAAGCTTTCACCGCCTATTTATATCATTTCATGCATCAATG
ACTGGTTTCCAGCAAGGTTGTCGACCTCTCCTTTTCCTCGATAGTACTCCATTGAACTCAAAGTATCAGGGGGTGTTTTTGACTGCCACAGCTGCAGATGGGGAAGATGA
CATATTTCCAGCAGCTTTTGCAGTTGTAGATGCTGAGACTGAGGAAAATTGGCAATGGTTTCTAATGGAATTGAAATCTGCAGTGAAAACATCAGAGCAGATTACATTTG
TTGCAGATTTTCAGAATGGACTGAACAAGTCTTTGGCTGAAATATTTGACAAGTCTTACCATAGCTACTGTTTGCGCCATCTAGCTGAAAAGCTTAACAACGATTTAAAG
GGCCAATTTTCTCATGAGGCCAGGCGATTTATGATTAATGATTTCTATGCTACTGCTCTTGCACCTAAACTTGAAGATTTCCAGCGTTGTGCTGAAAGTATAAAAGGAAT
TTCACCTGATGCTTACAATTGGATCATACAAAGTGAACCAGAACACTGGGCAAATGCATTCTTTGGAGGAGCAAGGTACAACCATATCACATCAAATTTTGGACAGCAGT
TCTATTGTTCCATAGCTGAAGCACATGAATTGCCGATAACACAGATGATTGATGTTTTACGGGGAAAAATGATGGAAACAATATATAGACGCAGGGTTGAATCAGATCAA
TGGATGACAAAATTAACACCAGCCAATGAGGAAAAGTTACAAAAAGAAATATCAATTGCACAGTCGTTTCAGGTTTTACTATCCCATGGAAGCACATTTGAGGTTCGGGG
GGAATCTGTTGACATTGTTGATGTTGATCATTGGGATTGTAGCTGTAAGGGATGGCAGCTTACTGGTTTGCCTTGTTGCCATGCTATTGCTGTCTTTGAATGCATTGGTA
GGAGCCCATATGATTATTGCTCCAGATATTTCACAGTTGAGAATTACCGCCTGACATATGCTGAGTCAATCCATCCTGTCCCAAATGTGGACAGACTAATCCTGGGTGAA
TCCACTCAAGCACTAGTCACAGTAACCCCTCCACCTACTAGAAGACCACCTGGTCGTCCAAAAATGAAGCAGAATGAATCATTAGAAGTAGTTAAGCGCCAGCTCCAATG
TAGCAAGTGTAAAGGCCTTGGCCACAATAAGAAGACATGCAAAGATTCTTAG
Protein sequenceShow/hide protein sequence
MASSVQRSPRLDQIHGEIRDSFRVLSYTNTNTPANKRVELFDERGIKEPTTDDNVQMVVEQTVEVGRQSAFTLEGQATATKKIIAICQSGGEFVKNKDGSLSYNGGEAYA
IDIDQQTNLDDFKTEIAEMFNCSIDTMSLKYFLPGNKKTLISISKDKDLMRMVNFLKDSITVDVFILSEEAAARNLSSMPASRSSRTTVSEAVVPVVEPVDVAVETIITI
DQIGIDISNEVPLICVPSGSSDEKQRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCE
GAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFISFHASM
TGFQQGCRPLLFLDSTPLNSKYQGVFLTATAADGEDDIFPAAFAVVDAETEENWQWFLMELKSAVKTSEQITFVADFQNGLNKSLAEIFDKSYHSYCLRHLAEKLNNDLK
GQFSHEARRFMINDFYATALAPKLEDFQRCAESIKGISPDAYNWIIQSEPEHWANAFFGGARYNHITSNFGQQFYCSIAEAHELPITQMIDVLRGKMMETIYRRRVESDQ
WMTKLTPANEEKLQKEISIAQSFQVLLSHGSTFEVRGESVDIVDVDHWDCSCKGWQLTGLPCCHAIAVFECIGRSPYDYCSRYFTVENYRLTYAESIHPVPNVDRLILGE
STQALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS