; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029411 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029411
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTranscription factor TFIIIB component B''-like isoform X3
Genome locationchr8:38709968..38722974
RNA-Seq ExpressionLag0029411
SyntenyLag0029411
Gene Ontology termsGO:0070898 - RNA polymerase III preinitiation complex assembly (biological process)
GO:0000126 - transcription factor TFIIIB complex (cellular component)
GO:0001156 - TFIIIC-class transcription factor complex binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR039467 - Transcription factor TFIIIB component B'', Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579077.1 hypothetical protein SDJN03_23525, partial [Cucurbita argyrosperma subsp. sororia]1.9e-30866.67Show/hide
Query:  TPRLFKRKSLKAVSLQRILVVVGQASLMDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPV
        T  L +  S K  +    LV + +AS MDP DEILSD G TSR G RFQPKIKPRPKKQT AP  +T SQDKK T  DAKSCHDGRGNTK I+SSSQLPV
Subjt:  TPRLFKRKSLKAVSLQRILVVVGQASLMDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPV

Query:  IKENKESEDDLLLATARSDFIGCLHPTSVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNA
        I+ENKESEDDLLLAT RSDFIGC H TSVESA MVDS Q DLDSC  ILPSGST D                                            
Subjt:  IKENKESEDDLLLATARSDFIGCLHPTSVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNA

Query:  TFKKFLEHPHDILLKDAPDFFKEFEGLPLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGV
                                                                                                       G VGV
Subjt:  TFKKFLEHPHDILLKDAPDFFKEFEGLPLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGV

Query:  KNLTDDAKNLGILNYSHPSASTV-PAPVLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEE
        +N TDD+KN GILNYSH SAS    A VL QSGLGSIQ ED H  DG IAG+N D+F ELE LDDFHNQPKNEAD ++LKQATISNEDGDLD Q LE+EE
Subjt:  KNLTDDAKNLGILNYSHPSASTV-PAPVLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEE

Query:  CESGADITMDTISSRATNPPEQPACKYTPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAP
        CE+GA+IT D ISS  T P EQPACKY PKPK R A  D+CTQISQPEISN LPPSPQ  SCDTR MHEASIGTHSDG+LNDSSINFD Y+ VNQH EAP
Subjt:  CESGADITMDTISSRATNPPEQPACKYTPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAP

Query:  VNVESLEFDSYGDILVDDFNSDDQDEMSREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDI
        VNVESL +DSYGDILVDDFNSDDQDEM RE  GK+ EE P  +SN+SQQQE+F  VGEEI+H KT RK+R+QVSHQ D+PEDGVD+   FP+E  SNCDI
Subjt:  VNVESLEFDSYGDILVDDFNSDDQDEMSREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDI

Query:  DGDGYNKNETSKGGTSRGKKATKKSSKPSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEK
         GDG+ KN TSKG   RG K   KS KPSSDNEKP RKRKEANKAVPD QAEKRPKKFSHSTRRN+RQVNKVLLETPEDEIDFQKISF+DLIIYHE+KEK
Subjt:  DGDGYNKNETSKGGTSRGKKATKKSSKPSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEK

Query:  LEKKEASTRKSATNQRTDTVGEEIFNDGEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRT
        LEKKEA+TR+SATNQRTDTVGEEI+NDGEESLA EQGRGTDDDETPDVVDMTSAYFNY SFMDK  RTKWSK DTE FY+AVRQFGTDFCMIQQLFPGRT
Subjt:  LEKKEASTRKSATNQRTDTVGEEIFNDGEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRT

Query:  RRQIKLKFKSEERHHPFRLSDAITNRAKDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSP
        R QIKLKFK+EERHHPFRLSDAITNRAKDHSQFLSLIG L+EAANKAK ESN+DELTENTGDEEL ELSPE NEEEVAKP  G V++T+ EE +GEIHSP
Subjt:  RRQIKLKFKSEERHHPFRLSDAITNRAKDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSP

Query:  LKADESDDDDPHRWDEYKFDY
        LKADESDDDDPHRWDEYKFDY
Subjt:  LKADESDDDDPHRWDEYKFDY

KAG6601983.1 hypothetical protein SDJN03_07216, partial [Cucurbita argyrosperma subsp. sororia]1.1e-30869.08Show/hide
Query:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT
        MDP D+ILSD   +SR G+RFQPKIKPRPKK T AP S T SQDKKET+ DAKSCHDGRGNTK IQSSSQLPVI+ENKESEDDLLLATARSDFI   HP 
Subjt:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT

Query:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL
        SVESAK VDSMQ D DSC SILPSGSTTD                                                                       
Subjt:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL

Query:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP
                DP                                                            VGVK LT D KN GILNYS PS+ST   A 
Subjt:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP

Query:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY
        VLDQSGLGSIQTEDR+F DG I  ENIDIFYELE LDDFHNQPKNEAD A+L QATIS+EDGDLDKQ LEIEE  +GADITMDTISS  T+P EQPACKY
Subjt:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY

Query:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM
        T KPK R+A+ED C +ISQPEISNMLPPSPQV S DTR MHE SIG+  DGILNDSSINFDGYT VNQHTE PVNVE L FDSY DILVDDFNSDDQD  
Subjt:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM

Query:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK
        SREVSGK+ E KPSMESNISQ QEMFSQVGEEIE  KT RK RK+VSHQ DEPEDGVDENRNFP                NETSK GTSRGKKA KKSSK
Subjt:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK

Query:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND
        PSSD+EKP RKRK+ANK+VPD QAEKRPKKFSHSTRRN+RQVNKVLLETPEDEIDFQKISF+DLIIYHEYKEKLEKKEAS RKSATNQRTDT GEEIFND
Subjt:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA
        GEESLASEQGRGTDD+ETPDVVDMTSAYFNY SFMDK  R KWSK DTE FY+AVRQFGTDFCMIQQLFPG+TR QIKLKFK+EERHHPFRLSDA+TNRA
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA

Query:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKAD--ESDDDDPHRWDEYKFDY
         DH+QFLSLIG LKEAANKAKQESNQDELTENTGDEEL ELSPETN EEVAKP+   VQET KEEC+GEIHSPLKAD  ESD+DDPHRWDEYKFD+
Subjt:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKAD--ESDDDDPHRWDEYKFDY

XP_038875902.1 uncharacterized protein LOC120068262 isoform X1 [Benincasa hispida]0.0e+0069Show/hide
Query:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT
        MDP DEI SD G TSRAG RFQPKIKPRPKKQT  PP +T S DKK TI D KSC DG GN+K I SSSQLPV +E +ESEDDLLLATARSDFIG  HPT
Subjt:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT

Query:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL
        SVESAKMVDS QFDLDS   ILPSGST                                                                         
Subjt:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL

Query:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP
                                                   E+G          V   + L    LG VGVKNL DD KNL +LNYSHPSAS+   A 
Subjt:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP

Query:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIE----ECESGADITMDTISSRATNPPEQP
        VLDQ GLGSIQ+E  HF DG IAG+NID+FYELECLDDFHNQPKNEAD ++LK ATISNEDGDLDKQ LEIE    EC +GA+ITMDTISS  T P EQP
Subjt:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIE----ECESGADITMDTISSRATNPPEQP

Query:  ACKYTPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDD
        ACKY PKPK R+A  D CTQISQPEISNMLP SPQV SCDT  MHEASIGTH DG LNDSSI+FDGY  VNQHTE PVNVESL +DSYGDIL+DDFNSDD
Subjt:  ACKYTPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDD

Query:  QDEMSREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATK
        +DEM RE  GK+ EE+PS ESNISQQQ+MF  VGEEIEH KT RK+RK+VSHQLDEPEDGVDENRNFPNEPSSN D+ GDGYNKNE  KGG  RGKK + 
Subjt:  QDEMSREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATK

Query:  KSSKPSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEE
        KSSKPS+DNEKPTRKRKEANKAVPD QAEKRPKKFSHSTRRN+RQVNKVLLETPEDEIDFQKISF+DLIIYHE+KEKLEKK ASTRKSATNQRTDT GEE
Subjt:  KSSKPSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEE

Query:  IFNDGEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAI
        ++NDGEE+LASEQGRGTDDDETPDVVDMTSAYFNYQSFMDK  RTKWSK+DTE FY+AVRQFGTDFCMIQQLFPGRTR QIKLKFKSEERHHPFRLSDAI
Subjt:  IFNDGEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAI

Query:  TNRAKDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECI-GEIHSPLKADES-DDDDPHRWDEYKFDY
         NRAKDHSQFL LI  LKEAANKAK ESNQDELTEN+GDEE +ELSPETNEEEVAKP+  G++ET KEE + GE+HSPLKADES DDDDP+RWDEYKFDY
Subjt:  TNRAKDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECI-GEIHSPLKADES-DDDDPHRWDEYKFDY

XP_038875904.1 uncharacterized protein LOC120068262 isoform X2 [Benincasa hispida]2.1e-30968.89Show/hide
Query:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT
        MDP DEI SD G TSRAG RFQPKIKPRPKKQT  PP +T S DKK TI D KSC DG GN+K I SSSQLPV +E +ESEDDLLLATARSDFIG  HPT
Subjt:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT

Query:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL
        SVESAKMVDS QFDLDS   ILPSGST                                                                         
Subjt:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL

Query:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP
                                                   E+G          V   + L    LG VGVKNL DD KNL +LNYSHPSAS+   A 
Subjt:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP

Query:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIE----ECESGADITMDTISSRATNPPEQP
        VLDQ GLGSIQ+E  HF DG IAG+NID+FYELECLDDFHNQPKNEAD ++LK ATISNEDGDLDKQ LEIE    EC +GA+ITMDTISS  T P EQP
Subjt:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIE----ECESGADITMDTISSRATNPPEQP

Query:  ACKYTPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDD
        ACKY PKPK R+A  D CTQISQPEISNMLP SPQV SCDT  MHEASIGTH DG LNDSSI+FDGY  VNQHTE PVNVESL +DSYGDIL+DDFNSDD
Subjt:  ACKYTPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDD

Query:  QDEMSREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATK
        +DEM RE  GK+ EE+PS ESNISQQQ+MF  VGEEIEH KT RK+RK+VSHQLDEPEDGVDENRNFPNEPSSN D+ GDGYNKNE  KGG  RGKK + 
Subjt:  QDEMSREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATK

Query:  KSSKPSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEE
        KSSKPS+DNEKPTRKRKEANKAVPD QAEKRPKKFSHSTRRN+RQVNKVLLETPEDEIDFQKISF+DLIIYHE+KEKLE K ASTRKSATNQRTDT GEE
Subjt:  KSSKPSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEE

Query:  IFNDGEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAI
        ++NDGEE+LASEQGRGTDDDETPDVVDMTSAYFNYQSFMDK  RTKWSK+DTE FY+AVRQFGTDFCMIQQLFPGRTR QIKLKFKSEERHHPFRLSDAI
Subjt:  IFNDGEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAI

Query:  TNRAKDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECI-GEIHSPLKADES-DDDDPHRWDEYKFDY
         NRAKDHSQFL LI  LKEAANKAK ESNQDELTEN+GDEE +ELSPETNEEEVAKP+  G++ET KEE + GE+HSPLKADES DDDDP+RWDEYKFDY
Subjt:  TNRAKDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECI-GEIHSPLKADES-DDDDPHRWDEYKFDY

XP_038875905.1 uncharacterized protein LOC120068262 isoform X3 [Benincasa hispida]0.0e+0069.31Show/hide
Query:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT
        MDP DEI SD G TSRAG RFQPKIKPRPKKQT  PP +T S DKK TI D KSC DG GN+K I SSSQLPV +E +ESEDDLLLATARSDFIG  HPT
Subjt:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT

Query:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL
        SVESAKMVDS QFDLDS   ILPSGST                                                                         
Subjt:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL

Query:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP
                                                   E+G          V   + L    LG VGVKNL DD KNL +LNYSHPSAS+   A 
Subjt:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP

Query:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY
        VLDQ GLGSIQ+E  HF DG IAG+NID+FYELECLDDFHNQPKNEAD ++LK ATISNEDGDLDKQ LEIEEC +GA+ITMDTISS  T P EQPACKY
Subjt:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY

Query:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM
         PKPK R+A  D CTQISQPEISNMLP SPQV SCDT  MHEASIGTH DG LNDSSI+FDGY  VNQHTE PVNVESL +DSYGDIL+DDFNSDD+DEM
Subjt:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM

Query:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK
         RE  GK+ EE+PS ESNISQQQ+MF  VGEEIEH KT RK+RK+VSHQLDEPEDGVDENRNFPNEPSSN D+ GDGYNKNE  KGG  RGKK + KSSK
Subjt:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK

Query:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND
        PS+DNEKPTRKRKEANKAVPD QAEKRPKKFSHSTRRN+RQVNKVLLETPEDEIDFQKISF+DLIIYHE+KEKLEKK ASTRKSATNQRTDT GEE++ND
Subjt:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA
        GEE+LASEQGRGTDDDETPDVVDMTSAYFNYQSFMDK  RTKWSK+DTE FY+AVRQFGTDFCMIQQLFPGRTR QIKLKFKSEERHHPFRLSDAI NRA
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA

Query:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECI-GEIHSPLKADES-DDDDPHRWDEYKFDY
        KDHSQFL LI  LKEAANKAK ESNQDELTEN+GDEE +ELSPETNEEEVAKP+  G++ET KEE + GE+HSPLKADES DDDDP+RWDEYKFDY
Subjt:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECI-GEIHSPLKADES-DDDDPHRWDEYKFDY

TrEMBL top hitse value%identityAlignment
A0A6J1CIL9 uncharacterized protein LOC111011932 isoform X13.5e-30567.44Show/hide
Query:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT
        MDP DEILSD G T+R G RFQPKIKPRPKKQT AP  +T SQDKKETI D+ SCHDG G+TK I+SSSQLPVI+ENKESEDDLLLAT  SDF  C  P 
Subjt:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT

Query:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL
        SVESA+ VDSMQFDLDSC SILPS STTD                                                                       
Subjt:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL

Query:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP
                                                                            G  G +NLTDD KNLG+LN SHPS ST   A 
Subjt:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP

Query:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY
        VLDQSGL   QTED HF DG IAGENID+FY LECLDDFHNQPKNEAD A L+QAT S+EDGDL+KQ LEIEEC  GADI  D ++S AT P     CKY
Subjt:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY

Query:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM
         PKP+ R+ KEDT TQISQPEISNMLPPSPQVTSCD R ++EASI THSDGILNDSSINFDGYTSVNQ TEAPVNVESLE+DSYGDILV+DFNSDDQDEM
Subjt:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM

Query:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK
        SR VSGKS  EKPS+E NISQQQE F QV EE EH+KT RK+RKQVSH LD+PEDGVDEN+NFPNEPS     +GDG NKNETS    SRGKKA+KKSSK
Subjt:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK

Query:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND
         SS+NEKPTRKRKEANKAVPD QAEKRPKKFSHSTRRN+RQVNKVLLETPEDEIDFQKISF+DLIIYHEYKEKLEKKEASTR+ ATNQRTDTV EE +ND
Subjt:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA
         EES+ASEQGRG+DDDETPDVVDMTSAYFNYQSFMDK  RTKWSK+DTELFY AVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHP RLSDAITNRA
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA

Query:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKP--------DDGGVQETEKEECIGEIHSPLKAD-ESDDDDPHRWDEYK
         DHS FLS+IG LKEAANKAKQE+NQDELTENTGDEEL ELSPETNEEEVAKP         +G  Q TEKEE +GEIHSPLK+D  SDDDDPHRWDEYK
Subjt:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKP--------DDGGVQETEKEECIGEIHSPLKAD-ESDDDDPHRWDEYK

Query:  FDY
        FD+
Subjt:  FDY

A0A6J1FJU1 uncharacterized protein LOC111444681 isoform X11.0e-30467.23Show/hide
Query:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT
        MDP DEILSD G TSR G RFQPKIKPRPKKQT AP  +T SQDKK T  DAKSCHDGRGNTK I+SSSQLPVI+ENKESEDDLLLAT RSDFIGC H T
Subjt:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT

Query:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL
        SVESA MVDS Q DLDSC  ILPSGST D                                                                       
Subjt:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL

Query:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP
                                                                            G VGV+N TDD+KN GILNYSH SAS    A 
Subjt:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP

Query:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY
        VL QSGLGSIQ ED H  DG IAG+N D+F ELE LDDFHNQPKNEAD ++LKQATISNEDGDLD Q LE+EEC +GA+IT D ISS  T P EQPACKY
Subjt:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY

Query:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM
         PKPK R A  D+CTQISQPEISN LPPSPQV SCDTR MHEASIGTHSDG+LNDSSINFD Y+ VNQH EAPVNVESL +DSYGDILVDDFNSDDQDEM
Subjt:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM

Query:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK
         RE  GK+ EE P  +SN+SQQQEMF  VGEEI+H KT RK+R+QVSHQLD+PEDGVD+   FP+E  SN DI GDGY KN        RG K   KS K
Subjt:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK

Query:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND
        PSSDNEKP RKRKEANKAVPD QAEKRPKKFSHSTRRN+RQVNKVLLETPEDEIDFQKISF+DLIIYHE+KEKLEKKEA+TR+ ATNQRTDTVGEEI+ND
Subjt:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA
        GEESLA EQGRGTDDDETPDVVDMTSAYFNY SFMDK  RTKWSK DTE FY+AVRQFGTDFCMIQQLFPGRTR QIKLKFK+EERHHPFRLSDAITNRA
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA

Query:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKADESDDDDPHRWDEYKFDY
        KDHSQFLSLIG L+EAANKAK ESN+DELTEN+GDEEL EL+PETNEEEVAKP  G V++T+ EE +GEIHSPLKAD SDDDDPHRWDEYKFDY
Subjt:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKADESDDDDPHRWDEYKFDY

A0A6J1GL69 transcription factor TFIIIB component B''-like3.4e-30868.86Show/hide
Query:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT
        MDP D+ILSD   +SR G+RFQPKIKPRPKK T AP S T SQDKKET+ DAKSCHDGRGNTK IQSSSQLPVI+ENKESEDDLLLATARSDFI   HP 
Subjt:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT

Query:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL
        SVESAK VDSMQ D DSC SILPSGSTTD                                                                       
Subjt:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL

Query:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP
                DP                                                            VGVK LT D KN GILNYS PS+ST   A 
Subjt:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP

Query:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY
        VLDQSGLGSIQTEDRHF DG I  ENIDIFYELE LDDFHNQPKNEAD A+L QATIS+EDGDLDKQ LEIEE  +GADITMDTISS  T+P EQPACKY
Subjt:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY

Query:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM
        T KPK R+A+ED C +ISQPEISNMLPPSPQV S DTR MHE SIG+  DGILNDSSINFDGYT  NQHTE PVNVE L FDSY DILVDDFN DDQD  
Subjt:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM

Query:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK
        SREVSGK+ E KPSMESNISQ QEMFSQVGEEIE  KT RK RK+VSHQ DEPEDGVDENRNFP                NETSK GTSRGKKA+KKSSK
Subjt:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK

Query:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND
        PSSD+EKP RKRK+ANK+VPD QAEKRPKKFSHSTRRN+RQVNKVLLETPEDEIDFQKISF+DLIIYHEYKEKLEKKEAS RKSATNQRTDT GEEIFND
Subjt:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA
        GEESLASEQGRGTDD+ETPDVVDMTSAYFNY SFMDK  R KWSK DTE FY+AVRQFGTDFCMIQQLFPG+TR QIKLKFK+EERHHPFRLSDA+T RA
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA

Query:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKAD--ESDDDDPHRWDEYKFDY
         DH+QFLSLIG LKEAANKAKQESNQDELTENTGDEEL ELSPETN EEVAKP+   VQET KEEC+GEIHSPLKAD  ESD+DDPHRWDEYKFD+
Subjt:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKAD--ESDDDDPHRWDEYKFDY

A0A6J1JUR8 uncharacterized protein LOC1114876814.9e-30768.42Show/hide
Query:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT
        MDP D+ILSD   +SR G+RFQPKIKPRPKK T AP S T SQDKKET+ DAKSCHDGRGNTK IQSSSQLPVI+ENKESEDD+LLATARSDFI   HP 
Subjt:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT

Query:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL
        SVESAKMVDSMQ D DSC  ILPSGSTTD                                                                       
Subjt:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL

Query:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP
                DP                                                             GVK LT D +N GILNYS PS+ST   A 
Subjt:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP

Query:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY
        VLDQSGLGSIQTEDRHF DG I  ENIDIFYELE LDDFHNQPKNEAD A+LKQATIS++DGDLDKQ LEIEE  +GADITMDTISS  T+P EQP CKY
Subjt:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY

Query:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM
        T KPK R+A+ED C +ISQPEISNMLPPSPQV S DTR MH+ SIG+  DGILNDSSINFDGYT VNQHTEAPVNVE L FDSY DILVDDFNSDDQD  
Subjt:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM

Query:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK
        SREVSGK+ E KPSMESNISQ QEMFSQVGEEIE  KT RK RK+VSHQ DEPEDGVDENRNFP                NETSK GTSRGKKA+KKSSK
Subjt:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK

Query:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND
        PSSD+EKP RKRK++NK+VPD+QAEKRPKKFSHSTRRN+RQVNKVLLETPEDEIDFQKISF+DLIIYHEYKEKLEKKEAS RKSATNQRTDT G E FND
Subjt:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA
        GEESLASEQGRGTDD+ETPDVVDMTSAYFNY SFMDK  R KWSK DTE FY+AVRQFGTDFCMIQQLFPG+TR QIKLKFK+EERHHPFRLSDA+TNRA
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA

Query:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKAD--ESDDDDPHRWDEYKFDY
         DHSQFLSLIG LKEAANKAKQESNQDELTENTGDEEL ELSPETN EEVAKP+   VQET KEEC+GE+HSPLKAD  ESD+DDPHRWDEYKFD+
Subjt:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKAD--ESDDDDPHRWDEYKFDY

A0A6J1JVG9 uncharacterized protein LOC111489243 isoform X14.5e-30868.12Show/hide
Query:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT
        MDP DEILSD G TSR G RFQPKIKPRPKKQT AP  +T SQDKK T  DAKSCHD RGNTK I+SSSQL VI+ENKESEDDLLLAT RSDFIGC H T
Subjt:  MDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIKENKESEDDLLLATARSDFIGCLHPT

Query:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL
        SVESA MVDS Q DLDSC  ILPSGST D                                                                       
Subjt:  SVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDILLKDAPDFFKEFEGL

Query:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP
                                                                            G VGV+NLTDD+KN GILN SH SAS    A 
Subjt:  PLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSASTV-PAP

Query:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY
        VL QSGLGSIQ ED H  DG IAG+N D+FY+LE LDDFHNQPKNEAD ++LKQATISNEDGDLDKQ LE+EEC +GA+IT DTISS  T   EQPACKY
Subjt:  VLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKY

Query:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM
         PKPK R A  D+CTQISQPEISN LPPSPQV SCDT+ MHEASIGTHSDG+LNDSSINFD Y+ VNQH EAPVNVESL +DSYGDILVDDFNSDDQDEM
Subjt:  TPKPKTRSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEM

Query:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK
         RE  GK+ EE PS  SN+SQQQEMF  VGEEIEH KT RK+R+QVSHQL +PEDGVD+   FP+E  SNCDI GDGY KNETSKG   RG K   KS K
Subjt:  SREVSGKSSEEKPSMESNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSK

Query:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND
        PSSDNEKPTRKRKEANKAVPD QAEKRPKKFSHSTRRN+RQVNKVLLETPEDEIDFQKISF+DLIIYHE+KEKLEKKEA+TR+SATNQRTDTVGEEI+ND
Subjt:  PSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFND

Query:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA
        GEESLA EQGRGTDDDETPDVVDMTSAYFNY SFMDK  RTKWSK DTE FY+AVRQFGTDFCMIQQLFPGRTR QIKLKFK+EERHHPFRLSDAITNRA
Subjt:  GEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRA

Query:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKADESDDDDPHRWDEYKFDY
        KDHSQFLSLIG L+EAANKAK ESN+DELTENTG+EEL ELSPE NEEEVAKP  G V++T+ EE +GEIHSPLKADESDDDDPHRWDEYKFDY
Subjt:  KDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKADESDDDDPHRWDEYKFDY

SwissProt top hitse value%identityAlignment
O94481 Transcription factor TFIIIB component B''3.0e-1152.46Show/hide
Query:  DKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAI
        ++ +  KW+  DTE FY+A+ Q+GTDF +I  +FP R RRQIKLKFK EER +P R++ A+
Subjt:  DKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAI

Arabidopsis top hitse value%identityAlignment
AT4G39160.1 Homeodomain-like superfamily protein2.0e-3433.51Show/hide
Query:  EHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSKPSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSH
        E    P   R+ V     +   G  E       P  N  + G+     E   G T   ++ +K+ SK        +RKRK+ ++    S  +   KKF H
Subjt:  EHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSKPSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSH

Query:  STRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTV---GEEIFNDG--EESLASEQGRGTDDDETPDVVDMTSAY
        S+RR KR + K LLETP+ EI  + +  +D++   EYKE ++KKEA       +Q ++ +   G +  + G  EE    + G  + + +  +VV   S  
Subjt:  STRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTV---GEEIFNDG--EESLASEQGRGTDDDETPDVVDMTSAY

Query:  FNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRAKDHSQFLSLIGLLKEAANKAKQESNQDE
         NYQ++M+K  RT+WSK+DTELFY+ +++FG++  MIQQLFP RTR Q+KLKFK EER +P +L+DA+++R+K  + F ++I  L++ A  AK+   ++E
Subjt:  FNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRAKDHSQFLSLIGLLKEAANKAKQESNQDE

Query:  LTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKADESD--DDDPHRWDEYKFD
                ++ E       EE  +  D GV    KE   G++ + +++D  D  DDD   W+ YK D
Subjt:  LTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKADESD--DDDPHRWDEYKFD

AT4G39160.2 Homeodomain-like superfamily protein2.0e-3433.51Show/hide
Query:  EHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSKPSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSH
        E    P   R+ V     +   G  E       P  N  + G+     E   G T   ++ +K+ SK        +RKRK+ ++    S  +   KKF H
Subjt:  EHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSKPSSDNEKPTRKRKEANKAVPDSQAEKRPKKFSH

Query:  STRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTV---GEEIFNDG--EESLASEQGRGTDDDETPDVVDMTSAY
        S+RR KR + K LLETP+ EI  + +  +D++   EYKE ++KKEA       +Q ++ +   G +  + G  EE    + G  + + +  +VV   S  
Subjt:  STRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTV---GEEIFNDG--EESLASEQGRGTDDDETPDVVDMTSAY

Query:  FNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRAKDHSQFLSLIGLLKEAANKAKQESNQDE
         NYQ++M+K  RT+WSK+DTELFY+ +++FG++  MIQQLFP RTR Q+KLKFK EER +P +L+DA+++R+K  + F ++I  L++ A  AK+   ++E
Subjt:  FNYQSFMDKARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRAKDHSQFLSLIGLLKEAANKAKQESNQDE

Query:  LTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKADESD--DDDPHRWDEYKFD
                ++ E       EE  +  D GV    KE   G++ + +++D  D  DDD   W+ YK D
Subjt:  LTENTGDEELQELSPETNEEEVAKPDDGGVQETEKEECIGEIHSPLKADESD--DDDPHRWDEYKFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCACACCGCCTCGAGGCTTACGCCTCGCCTCTTCAAGAGAAAAAGCCTCAAGGCCGTGAGTTTACAAAGGATTCTTGTTGTTGTCGGACAGGCTTCACTTATGGA
CCCTTTGGATGAGATTCTTTCTGATTCTGGTGCCACTTCCCGAGCTGGAAATAGATTTCAACCAAAGATCAAACCACGTCCTAAAAAACAAACTTCGGCACCACCATCGG
CCACACCATCTCAGGATAAGAAGGAAACAATATTGGACGCTAAATCTTGTCATGATGGTAGAGGAAATACAAAGTTAATCCAATCATCATCCCAACTTCCTGTGATAAAG
GAGAACAAAGAATCTGAAGATGATTTGCTCTTGGCTACCGCAAGATCTGATTTCATTGGTTGTTTACATCCCACCTCTGTGGAAAGTGCTAAAATGGTAGACTCTATGCA
GTTTGATTTGGATTCTTGTCGTAGTATTCTTCCTTCGGGTTCTACTACTGATGTTCAACAATTGCAGGGGTGGATAACCAGACCATCAACATTTGAGATGGTAATTTGTG
CCTTATCCATTGAGCTATGCTCGGGTTGGCAAATGAAAAATATATGTCTCGTTGAGACTTGCATTTATAATGCCACTTTTAAAAAATTCTTGGAACACCCTCATGATATC
CTCCTCAAGGATGCTCCAGATTTTTTTAAAGAATTTGAAGGTCTTCCTTTAGGTCACAACCCGTTTTTGGACCCATGTGGTGGAGAAGTTTACTTTTTTTCGCTATTTTG
GGCTCTGTGTGGGGTGTGTCGGGTTCTTGAGAGACTCATGAGGAACTTTTTATGGGAGGGTTTCGAAGAGGGGAGAGGGGCGCATCTTGTAAGTTGGGAGGTAGTAGAGA
AACCTGTGGGTCTTGGTGGGCTTGAGCTAGGTCTAGTTGGAGTGAAAAATCTGACTGATGATGCCAAAAATTTGGGAATATTGAATTATTCCCATCCAAGTGCTTCTACA
GTTCCGGCTCCGGTTTTGGATCAAAGTGGACTAGGTTCAATTCAAACTGAGGACAGACATTTCGGTGATGGTAATATAGCAGGAGAGAATATCGATATATTTTATGAATT
GGAATGTCTAGATGATTTTCATAACCAACCAAAGAATGAAGCAGATCATGCAACCCTTAAGCAAGCAACAATCTCCAATGAGGATGGGGATTTGGATAAACAAAACTTGG
AAATAGAGGAATGTGAGTCAGGGGCTGATATCACTATGGATACAATAAGTTCTAGGGCCACTAATCCCCCCGAACAACCTGCTTGCAAGTATACACCAAAGCCCAAAACG
AGAAGTGCAAAAGAAGACACTTGCACACAAATCTCTCAGCCAGAAATCTCTAATATGCTTCCACCATCTCCACAAGTTACTTCTTGTGATACTAGGTACATGCATGAAGC
CTCAATTGGTACCCATTCAGATGGCATTTTGAATGATTCATCGATCAACTTTGATGGTTACACTTCTGTCAACCAACATACTGAAGCTCCGGTTAATGTAGAGTCATTAG
AATTTGACTCTTATGGTGACATACTGGTGGATGATTTTAATTCCGATGATCAGGATGAAATGTCAAGAGAAGTGAGTGGTAAGAGCAGTGAAGAAAAACCTTCGATGGAA
TCAAATATTTCTCAACAGCAAGAGATGTTCTCCCAAGTTGGTGAAGAAATTGAGCATAGAAAAACTCCTAGGAAGATGAGAAAGCAGGTTTCTCATCAACTTGATGAGCC
AGAAGATGGTGTTGATGAGAATAGGAACTTCCCTAATGAACCCTCGAGTAACTGTGACATAGATGGAGATGGCTATAACAAAAATGAAACCTCAAAAGGAGGTACATCTC
GAGGAAAAAAAGCGACAAAAAAGTCTTCAAAACCTTCTAGTGATAATGAAAAACCAACTCGGAAGCGCAAGGAGGCTAATAAAGCAGTTCCAGATTCGCAAGCTGAAAAG
CGTCCTAAGAAGTTCTCTCATTCAACTCGCCGGAATAAAAGGCAAGTGAACAAGGTTTTGCTTGAAACTCCGGAGGATGAAATTGACTTTCAAAAGATAAGTTTTAAGGA
TCTCATTATTTATCACGAGTACAAGGAGAAGTTAGAGAAGAAAGAGGCAAGCACAAGAAAGTCAGCAACCAATCAAAGAACTGACACTGTCGGTGAGGAAATTTTTAATG
ATGGAGAGGAAAGCCTTGCTTCTGAACAAGGTAGAGGTACCGATGATGATGAAACACCTGATGTAGTTGACATGACTTCTGCTTACTTTAATTACCAGTCTTTCATGGAC
AAAGCACGACGTACAAAGTGGTCAAAGAAGGATACAGAGCTTTTTTATCAGGCTGTTCGACAATTCGGAACAGATTTTTGTATGATACAACAGTTGTTTCCAGGCCGAAC
ACGCCGTCAAATTAAGCTAAAATTCAAAAGTGAAGAACGTCATCATCCATTCCGCCTTTCTGATGCTATAACTAATCGTGCCAAAGACCACTCCCAATTTTTATCCTTGA
TTGGGCTGCTGAAAGAAGCTGCTAATAAGGCAAAACAGGAGTCCAATCAAGATGAGTTGACTGAAAATACTGGGGATGAGGAGCTGCAAGAGTTATCTCCTGAAACTAAC
GAGGAAGAAGTGGCAAAACCTGATGATGGAGGAGTGCAGGAGACAGAGAAGGAAGAATGTATTGGTGAAATTCACAGTCCATTAAAGGCTGATGAAAGTGATGATGATGA
TCCTCATAGGTGGGATGAATATAAATTTGATTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCACACCGCCTCGAGGCTTACGCCTCGCCTCTTCAAGAGAAAAAGCCTCAAGGCCGTGAGTTTACAAAGGATTCTTGTTGTTGTCGGACAGGCTTCACTTATGGA
CCCTTTGGATGAGATTCTTTCTGATTCTGGTGCCACTTCCCGAGCTGGAAATAGATTTCAACCAAAGATCAAACCACGTCCTAAAAAACAAACTTCGGCACCACCATCGG
CCACACCATCTCAGGATAAGAAGGAAACAATATTGGACGCTAAATCTTGTCATGATGGTAGAGGAAATACAAAGTTAATCCAATCATCATCCCAACTTCCTGTGATAAAG
GAGAACAAAGAATCTGAAGATGATTTGCTCTTGGCTACCGCAAGATCTGATTTCATTGGTTGTTTACATCCCACCTCTGTGGAAAGTGCTAAAATGGTAGACTCTATGCA
GTTTGATTTGGATTCTTGTCGTAGTATTCTTCCTTCGGGTTCTACTACTGATGTTCAACAATTGCAGGGGTGGATAACCAGACCATCAACATTTGAGATGGTAATTTGTG
CCTTATCCATTGAGCTATGCTCGGGTTGGCAAATGAAAAATATATGTCTCGTTGAGACTTGCATTTATAATGCCACTTTTAAAAAATTCTTGGAACACCCTCATGATATC
CTCCTCAAGGATGCTCCAGATTTTTTTAAAGAATTTGAAGGTCTTCCTTTAGGTCACAACCCGTTTTTGGACCCATGTGGTGGAGAAGTTTACTTTTTTTCGCTATTTTG
GGCTCTGTGTGGGGTGTGTCGGGTTCTTGAGAGACTCATGAGGAACTTTTTATGGGAGGGTTTCGAAGAGGGGAGAGGGGCGCATCTTGTAAGTTGGGAGGTAGTAGAGA
AACCTGTGGGTCTTGGTGGGCTTGAGCTAGGTCTAGTTGGAGTGAAAAATCTGACTGATGATGCCAAAAATTTGGGAATATTGAATTATTCCCATCCAAGTGCTTCTACA
GTTCCGGCTCCGGTTTTGGATCAAAGTGGACTAGGTTCAATTCAAACTGAGGACAGACATTTCGGTGATGGTAATATAGCAGGAGAGAATATCGATATATTTTATGAATT
GGAATGTCTAGATGATTTTCATAACCAACCAAAGAATGAAGCAGATCATGCAACCCTTAAGCAAGCAACAATCTCCAATGAGGATGGGGATTTGGATAAACAAAACTTGG
AAATAGAGGAATGTGAGTCAGGGGCTGATATCACTATGGATACAATAAGTTCTAGGGCCACTAATCCCCCCGAACAACCTGCTTGCAAGTATACACCAAAGCCCAAAACG
AGAAGTGCAAAAGAAGACACTTGCACACAAATCTCTCAGCCAGAAATCTCTAATATGCTTCCACCATCTCCACAAGTTACTTCTTGTGATACTAGGTACATGCATGAAGC
CTCAATTGGTACCCATTCAGATGGCATTTTGAATGATTCATCGATCAACTTTGATGGTTACACTTCTGTCAACCAACATACTGAAGCTCCGGTTAATGTAGAGTCATTAG
AATTTGACTCTTATGGTGACATACTGGTGGATGATTTTAATTCCGATGATCAGGATGAAATGTCAAGAGAAGTGAGTGGTAAGAGCAGTGAAGAAAAACCTTCGATGGAA
TCAAATATTTCTCAACAGCAAGAGATGTTCTCCCAAGTTGGTGAAGAAATTGAGCATAGAAAAACTCCTAGGAAGATGAGAAAGCAGGTTTCTCATCAACTTGATGAGCC
AGAAGATGGTGTTGATGAGAATAGGAACTTCCCTAATGAACCCTCGAGTAACTGTGACATAGATGGAGATGGCTATAACAAAAATGAAACCTCAAAAGGAGGTACATCTC
GAGGAAAAAAAGCGACAAAAAAGTCTTCAAAACCTTCTAGTGATAATGAAAAACCAACTCGGAAGCGCAAGGAGGCTAATAAAGCAGTTCCAGATTCGCAAGCTGAAAAG
CGTCCTAAGAAGTTCTCTCATTCAACTCGCCGGAATAAAAGGCAAGTGAACAAGGTTTTGCTTGAAACTCCGGAGGATGAAATTGACTTTCAAAAGATAAGTTTTAAGGA
TCTCATTATTTATCACGAGTACAAGGAGAAGTTAGAGAAGAAAGAGGCAAGCACAAGAAAGTCAGCAACCAATCAAAGAACTGACACTGTCGGTGAGGAAATTTTTAATG
ATGGAGAGGAAAGCCTTGCTTCTGAACAAGGTAGAGGTACCGATGATGATGAAACACCTGATGTAGTTGACATGACTTCTGCTTACTTTAATTACCAGTCTTTCATGGAC
AAAGCACGACGTACAAAGTGGTCAAAGAAGGATACAGAGCTTTTTTATCAGGCTGTTCGACAATTCGGAACAGATTTTTGTATGATACAACAGTTGTTTCCAGGCCGAAC
ACGCCGTCAAATTAAGCTAAAATTCAAAAGTGAAGAACGTCATCATCCATTCCGCCTTTCTGATGCTATAACTAATCGTGCCAAAGACCACTCCCAATTTTTATCCTTGA
TTGGGCTGCTGAAAGAAGCTGCTAATAAGGCAAAACAGGAGTCCAATCAAGATGAGTTGACTGAAAATACTGGGGATGAGGAGCTGCAAGAGTTATCTCCTGAAACTAAC
GAGGAAGAAGTGGCAAAACCTGATGATGGAGGAGTGCAGGAGACAGAGAAGGAAGAATGTATTGGTGAAATTCACAGTCCATTAAAGGCTGATGAAAGTGATGATGATGA
TCCTCATAGGTGGGATGAATATAAATTTGATTATTAA
Protein sequenceShow/hide protein sequence
MPHTASRLTPRLFKRKSLKAVSLQRILVVVGQASLMDPLDEILSDSGATSRAGNRFQPKIKPRPKKQTSAPPSATPSQDKKETILDAKSCHDGRGNTKLIQSSSQLPVIK
ENKESEDDLLLATARSDFIGCLHPTSVESAKMVDSMQFDLDSCRSILPSGSTTDVQQLQGWITRPSTFEMVICALSIELCSGWQMKNICLVETCIYNATFKKFLEHPHDI
LLKDAPDFFKEFEGLPLGHNPFLDPCGGEVYFFSLFWALCGVCRVLERLMRNFLWEGFEEGRGAHLVSWEVVEKPVGLGGLELGLVGVKNLTDDAKNLGILNYSHPSAST
VPAPVLDQSGLGSIQTEDRHFGDGNIAGENIDIFYELECLDDFHNQPKNEADHATLKQATISNEDGDLDKQNLEIEECESGADITMDTISSRATNPPEQPACKYTPKPKT
RSAKEDTCTQISQPEISNMLPPSPQVTSCDTRYMHEASIGTHSDGILNDSSINFDGYTSVNQHTEAPVNVESLEFDSYGDILVDDFNSDDQDEMSREVSGKSSEEKPSME
SNISQQQEMFSQVGEEIEHRKTPRKMRKQVSHQLDEPEDGVDENRNFPNEPSSNCDIDGDGYNKNETSKGGTSRGKKATKKSSKPSSDNEKPTRKRKEANKAVPDSQAEK
RPKKFSHSTRRNKRQVNKVLLETPEDEIDFQKISFKDLIIYHEYKEKLEKKEASTRKSATNQRTDTVGEEIFNDGEESLASEQGRGTDDDETPDVVDMTSAYFNYQSFMD
KARRTKWSKKDTELFYQAVRQFGTDFCMIQQLFPGRTRRQIKLKFKSEERHHPFRLSDAITNRAKDHSQFLSLIGLLKEAANKAKQESNQDELTENTGDEELQELSPETN
EEEVAKPDDGGVQETEKEECIGEIHSPLKADESDDDDPHRWDEYKFDY