; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0008 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0008
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionWD_REPEATS_REGION domain-containing protein
Genome locationMC06:79499..100387
RNA-Seq ExpressionMC06g0008
SyntenyMC06g0008
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR016177 - DNA-binding domain superfamily
IPR036322 - WD40-repeat-containing domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134731.1 uncharacterized protein LOC111006933 isoform X1 [Momordica charantia]0.092.48Show/hide
Query:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
        MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
Subjt:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK

Query:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI
        CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRW                         
Subjt:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI

Query:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
                      KDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
Subjt:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP

Query:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
        EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
Subjt:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV

Query:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
        MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
Subjt:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP

Query:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA
        DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWL +
Subjt:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA

XP_022134734.1 uncharacterized protein LOC111006933 isoform X2 [Momordica charantia]0.091.93Show/hide
Query:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
        MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
Subjt:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK

Query:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI
        CVEAEPRPASPIHANG   AITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRW                         
Subjt:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI

Query:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
                      KDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
Subjt:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP

Query:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
        EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
Subjt:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV

Query:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
        MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
Subjt:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP

Query:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA
        DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWL +
Subjt:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA

XP_022134735.1 uncharacterized protein LOC111006933 isoform X3 [Momordica charantia]0.091.38Show/hide
Query:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
        MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
Subjt:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK

Query:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI
        CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKF                                 
Subjt:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI

Query:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
                    EQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
Subjt:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP

Query:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
        EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
Subjt:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV

Query:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
        MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
Subjt:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP

Query:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA
        DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWL +
Subjt:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA

XP_022134736.1 uncharacterized protein LOC111006933 isoform X4 [Momordica charantia]0.088.07Show/hide
Query:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
        MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
Subjt:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK

Query:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI
        CVEAEPRPASPIHANG                        AEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRW                         
Subjt:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI

Query:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
                      KDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
Subjt:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP

Query:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
        EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
Subjt:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV

Query:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
        MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
Subjt:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP

Query:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA
        DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWL +
Subjt:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA

XP_022134737.1 uncharacterized protein LOC111006933 isoform X5 [Momordica charantia]0.086.97Show/hide
Query:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
        MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
Subjt:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK

Query:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI
        CVEAEPRPASPIHANG                        AEQDRPCTVSSTEGVVEDESDRRKRKF                                 
Subjt:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI

Query:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
                    EQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
Subjt:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP

Query:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
        EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
Subjt:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV

Query:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
        MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
Subjt:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP

Query:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA
        DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWL +
Subjt:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA

TrEMBL top hitse value%identityAlignment
A0A6J1BYM9 uncharacterized protein LOC111006933 isoform X40.088.07Show/hide
Query:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
        MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
Subjt:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK

Query:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI
        CVEAEPRPASPIHANG                        AEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRW                         
Subjt:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI

Query:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
                      KDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
Subjt:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP

Query:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
        EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
Subjt:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV

Query:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
        MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
Subjt:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP

Query:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA
        DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWL +
Subjt:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA

A0A6J1BZ58 uncharacterized protein LOC111006933 isoform X30.091.38Show/hide
Query:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
        MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
Subjt:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK

Query:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI
        CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKF                                 
Subjt:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI

Query:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
                    EQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
Subjt:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP

Query:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
        EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
Subjt:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV

Query:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
        MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
Subjt:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP

Query:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA
        DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWL +
Subjt:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA

A0A6J1BZL7 uncharacterized protein LOC111006933 isoform X20.091.93Show/hide
Query:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
        MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
Subjt:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK

Query:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI
        CVEAEPRPASPIHANG   AITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRW                         
Subjt:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI

Query:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
                      KDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
Subjt:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP

Query:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
        EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
Subjt:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV

Query:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
        MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
Subjt:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP

Query:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA
        DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWL +
Subjt:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA

A0A6J1C0F9 uncharacterized protein LOC111006933 isoform X50.086.97Show/hide
Query:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
        MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
Subjt:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK

Query:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI
        CVEAEPRPASPIHANG                        AEQDRPCTVSSTEGVVEDESDRRKRKF                                 
Subjt:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI

Query:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
                    EQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
Subjt:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP

Query:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
        EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
Subjt:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV

Query:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
        MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
Subjt:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP

Query:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA
        DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWL +
Subjt:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA

A0A6J1C2U7 uncharacterized protein LOC111006933 isoform X10.092.48Show/hide
Query:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
        MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK
Subjt:  MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFK

Query:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI
        CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRW                         
Subjt:  CVEAEPRPASPIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLI

Query:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
                      KDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP
Subjt:  LLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWP

Query:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
        EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV
Subjt:  EDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAV

Query:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
        MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP
Subjt:  MGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSP

Query:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA
        DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWL +
Subjt:  DGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQA

SwissProt top hitse value%identityAlignment
P93007 Ethylene-responsive transcription factor ERF1121.6e-2345.75Show/hide
Query:  EKALFPMYSARSQHDMSAMVCALAEVIKSNRTPSDHTTSQPQPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDT
        ++ L P   A ++ +   + C       S    SD  +     P P   +D          S SR+R+YRGVRQRPWGKWAAEIRDP KAARVWLGTFDT
Subjt:  EKALFPMYSARSQHDMSAMVCALAEVIKSNRTPSDHTTSQPQPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDT

Query:  AEAAALAYDQAALSFKGTKAKLNFPERLQPQPHNFFFNAPPPHPHNLFFNAPP
        AE AALAYD+AA  F+G KAKLNFPE ++  P   +    P   H+     PP
Subjt:  AEAAALAYDQAALSFKGTKAKLNFPERLQPQPHNFFFNAPPPHPHNLFFNAPP

Q70II3 Ethylene-responsive transcription factor ERF1109.5e-2478.57Show/hide
Query:  RRRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGTKAKLNFPE--RLQPQP
        ++R YRGVRQRPWGKWAAEIRDP +AARVWLGTFDTAEAAA AYD+AAL F+G KAKLNFPE  R+ P P
Subjt:  RRRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGTKAKLNFPE--RLQPQP

Q9FH54 Ethylene-responsive transcription factor ERF1144.4e-3750.52Show/hide
Query:  MHGKRTVCSD--DESEEEKALFPMYSARSQHDMSAMVCALAEVIKSNRTPSDHTTS----------QPQPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVR
        M+GKR    D  +E EE++ LFP++SARSQHDM  MV AL +VI + ++ S    S           PQ P  Q    +  QG        RRRHYRGVR
Subjt:  MHGKRTVCSD--DESEEEKALFPMYSARSQHDMSAMVCALAEVIKSNRTPSDHTTS----------QPQPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVR

Query:  QRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGTKAKLNFPERLQPQPHNFFFNA---PPPHPHNLFFNAPPPTFNSSHQD
        QRPWGKWAAEIRDPKKAARVWLGTF+TAE+AALAYD+AAL FKG+KAKLNFPER+Q   ++ ++++   P   P ++      P +N  + D
Subjt:  QRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGTKAKLNFPERLQPQPHNFFFNA---PPPHPHNLFFNAPPPTFNSSHQD

Q9LY29 Ethylene-responsive transcription factor ERF1151.1e-3551.38Show/hide
Query:  HGKRTVCSDDESEEEKA-----LFPMYSARSQHDMSAMVCALAEVIKSNRTPSDHTTSQP-----QPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVRQRP
        +GKR    D+  E+++A     +FP +SARSQ+DM AMV AL +VI +  +  D+   QP     Q P P      DQ          R+RHYRGVRQRP
Subjt:  HGKRTVCSDDESEEEKA-----LFPMYSARSQHDMSAMVCALAEVIKSNRTPSDHTTSQP-----QPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVRQRP

Query:  WGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGTKAKLNFPERLQPQPHNFFFNAPPPH--PHNLFFNAPPPT
        WGKWAAEIRDP+KAARVWLGTF+TAEAAALAYD AAL FKG+KAKLNFPER Q   +      PP +   +N  + + P T
Subjt:  WGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGTKAKLNFPERLQPQPHNFFFNAPPPH--PHNLFFNAPPPT

Q9LYU3 Ethylene-responsive transcription factor ERF1133.4e-2954.61Show/hide
Query:  MVCALAEVIKSNRTPSDHTTSQPQPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGT
        MV AL+ VI++   P+D    Q      Q   D DQ          RRRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTF+TAE AALAYD+AAL FKGT
Subjt:  MVCALAEVIKSNRTPSDHTTSQPQPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGT

Query:  KAKLNFPERLQ-----------PQPHNFFFNAPPPHPHNLFFNAPPPTFNSS
        KAKLNFPER+Q           P+  +   N+PPP P       PP T  +S
Subjt:  KAKLNFPERLQ-----------PQPHNFFFNAPPPHPHNLFFNAPPPTFNSS

Arabidopsis top hitse value%identityAlignment
AT5G07310.1 Integrase-type DNA-binding superfamily protein7.6e-3751.38Show/hide
Query:  HGKRTVCSDDESEEEKA-----LFPMYSARSQHDMSAMVCALAEVIKSNRTPSDHTTSQP-----QPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVRQRP
        +GKR    D+  E+++A     +FP +SARSQ+DM AMV AL +VI +  +  D+   QP     Q P P      DQ          R+RHYRGVRQRP
Subjt:  HGKRTVCSDDESEEEKA-----LFPMYSARSQHDMSAMVCALAEVIKSNRTPSDHTTSQP-----QPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVRQRP

Query:  WGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGTKAKLNFPERLQPQPHNFFFNAPPPH--PHNLFFNAPPPT
        WGKWAAEIRDP+KAARVWLGTF+TAEAAALAYD AAL FKG+KAKLNFPER Q   +      PP +   +N  + + P T
Subjt:  WGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGTKAKLNFPERLQPQPHNFFFNAPPPH--PHNLFFNAPPPT

AT5G13330.1 related to AP2 6l2.4e-3054.61Show/hide
Query:  MVCALAEVIKSNRTPSDHTTSQPQPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGT
        MV AL+ VI++   P+D    Q      Q   D DQ          RRRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTF+TAE AALAYD+AAL FKGT
Subjt:  MVCALAEVIKSNRTPSDHTTSQPQPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGT

Query:  KAKLNFPERLQ-----------PQPHNFFFNAPPPHPHNLFFNAPPPTFNSS
        KAKLNFPER+Q           P+  +   N+PPP P       PP T  +S
Subjt:  KAKLNFPERLQ-----------PQPHNFFFNAPPPHPHNLFFNAPPPTFNSS

AT5G19920.1 Transducin/WD40 repeat-like superfamily protein7.0e-10755.76Show/hide
Query:  EQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWPEDIAWHPDGNSL
        E + H ELI L+  SS   T +   +    S H ++MRSLA  P N +LF TSALDG ++ W++Q   S+A+L  T + ++  Q++W EDIAWHP  N+L
Subjt:  EQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWPEDIAWHPDGNSL

Query:  FSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAVMGVSGMQLKQIV
        FSVY+AD G  QIS +  N+  ER    F+ED+P+ KG+IN I F PW+   FITGGSDHAV+LW  + + N WKP LLHRDLHSSAVMGV+GM+    V
Subjt:  FSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAVMGVSGMQLKQIV

Query:  LSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSPDGLHLTSGSADP
        LS G D+R +GFD +   V FKH+L+++C +++PNP D NL MV T   ++QLRL+D+R  Q E+  FGWKQESSESQSALINQ+WSPDGLH++SGSADP
Subjt:  LSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSPDGLHLTSGSADP

Query:  VIHVFDIRYNSHMPSCSIKAHQKRVFKAVW
        VIH+FDIRYN+  PS S+KAH+KRVFKA W
Subjt:  VIHVFDIRYNSHMPSCSIKAHQKRVFKAVW

AT5G50970.1 transducin family protein / WD-40 repeat family protein2.1e-14351.55Show/hide
Query:  KKPKVEELSVSDAQPTAIDAVDECN-STAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRY-TLSSKSSQDSGFKCVE
        KKPK++E           D  +E N S  EEQE  L+ALVEHR+ E++ L + IS Y  +L EAE+ LQ S++ LA+ RG    ++S         K + 
Subjt:  KKPKVEELSVSDAQPTAIDAVDECN-STAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRY-TLSSKSSQDSGFKCVE

Query:  --------AEPRPASPIHANGGS-KAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQ
                A P P+  +  +  S    +  GSS  K+ +++ +   +E  R        G+        KRKF                           
Subjt:  --------AEPRPASPIHANGGS-KAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQ

Query:  YLRSLILLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQ
                          EQK+HKELI L+  +SSP T +C  S   SSQHKRK+RSL  CPVN+QLF TS+LDGM++LWQ+Q     ASLL TTDC+S+
Subjt:  YLRSLILLFGLVMFACIPEQKDHKELIPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQ

Query:  KQRRWPEDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWS-MKDKKNTWKPELLHR
        KQRRW ED+AWHP GN+LFSVY+AD GDSQISIL  NKT+E   V FLE+KP+VKGIIN+I F+PWE+  F+TGGSDHAV+LW+   D++N WK + LHR
Subjt:  KQRRWPEDIAWHPDGNSLFSVYSADSGDSQISILKFNKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWS-MKDKKNTWKPELLHR

Query:  DLHSSAVMGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSAL
        +LHS+AVMGV GM+ K ++LS GADKRI GFDVQVG   +KHQ++ KCMSVL NPCDFNLFMVQ+G PEKQLRLFDIR ++ E+H FGWKQ+SSESQSAL
Subjt:  DLHSSAVMGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESKCMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSAL

Query:  INQAWSPDGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVW
        INQ+WSPDGL++TSGS DPVIHVFDIRYN+  P+ SIKAHQKRVFKA W
Subjt:  INQAWSPDGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVW

AT5G61890.1 Integrase-type DNA-binding superfamily protein3.1e-3850.52Show/hide
Query:  MHGKRTVCSD--DESEEEKALFPMYSARSQHDMSAMVCALAEVIKSNRTPSDHTTS----------QPQPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVR
        M+GKR    D  +E EE++ LFP++SARSQHDM  MV AL +VI + ++ S    S           PQ P  Q    +  QG        RRRHYRGVR
Subjt:  MHGKRTVCSD--DESEEEKALFPMYSARSQHDMSAMVCALAEVIKSNRTPSDHTTS----------QPQPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVR

Query:  QRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGTKAKLNFPERLQPQPHNFFFNA---PPPHPHNLFFNAPPPTFNSSHQD
        QRPWGKWAAEIRDPKKAARVWLGTF+TAE+AALAYD+AAL FKG+KAKLNFPER+Q   ++ ++++   P   P ++      P +N  + D
Subjt:  QRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDQAALSFKGTKAKLNFPERLQPQPHNFFFNA---PPPHPHNLFFNAPPPTFNSSHQD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCAGCCTCTGAAGAAACCAAAAGTGGAGGAACTTTCCGTTTCCGATGCCCAACCAACTGCAATCGACGCGGTGGATGAATGCAATAGCACCGCGGAAGAGCAAGA
GGCGGAGCTCTTGGCACTAGTCGAGCATCGTACTCGGGAAGTTCAACATCTGCAGCACCGTATTTCCTACTACACTCGTCAGCTAGAGGAAGCAGAGAAGCGATTGCAGG
AATCTGAATCTATGTTGGCACGCTTTCGAGGTCGGCGTTATACTCTTTCATCAAAAAGTTCTCAAGACTCTGGATTCAAATGTGTGGAGGCTGAGCCTAGACCAGCCAGT
CCTATTCATGCCAATGGAGGTTCCAAAGCAATAACAATTCTTGGTTCCAGTGCTCAAAAGAGCCCTTCAATTCTCAATCTTGCCATGACAGCTGAACAAGACAGGCCTTG
TACAGTATCTTCTACTGAGGGAGTTGTTGAAGATGAAAGTGATAGAAGAAAAAGAAAGTTTGGTAATACTTCAACCTTCCGTTGGGTAACCTTGCAGTGCTATCTTCTTC
ATGCAACTTCGTTAGTTTCATATTCTATACAGTACCTCAGGTCCCTCATTCTTTTGTTTGGACTCGTGATGTTTGCATGTATACCAGAGCAGAAAGATCACAAAGAATTG
ATTCCATTAGTACGTAGTAGCTCATCGCCATTAACTGCCCAATGTGATAGAAGTTATTATTTCTCAAGTCAGCACAAGAGAAAAATGAGAAGTCTTGCTCCATGTCCAGT
TAATGACCAGCTTTTTGTGACCAGTGCTTTGGATGGAATGATCAACTTGTGGCAAGTTCAGTACAAGGGGTCATCGGCCTCTCTACTTTGTACTACTGACTGTATGTCTC
AAAAGCAGAGAAGATGGCCTGAAGATATAGCTTGGCACCCGGATGGAAACAGCCTATTTTCTGTGTACAGTGCCGATAGTGGTGACTCTCAAATATCGATCTTGAAATTC
AACAAGACTAAAGAGAGGGCCTCTGTGATTTTCTTAGAGGATAAGCCTTATGTTAAAGGCATTATCAACAGCATCAGTTTCTTGCCCTGGGAATCTGTCCCTTTCATTAC
TGGTGGCAGTGACCATGCTGTCATTCTATGGAGTATGAAAGATAAAAAGAATACATGGAAGCCAGAACTGTTGCACAGAGACCTGCATTCTTCAGCTGTCATGGGCGTCT
CTGGGATGCAGCTAAAGCAGATCGTACTTTCTGCTGGTGCAGACAAGAGGATCCTTGGTTTTGATGTTCAAGTAGGAAGCGTACTCTTCAAGCATCAATTAGAGAGCAAA
TGTATGAGTGTCTTGCCAAATCCATGTGATTTTAACCTGTTCATGGTCCAAACAGGGAGTCCAGAAAAGCAACTTCGGCTGTTTGACATTAGGTCAAAACAGAAAGAAGT
CCACGGTTTTGGGTGGAAGCAAGAAAGCAGTGAATCTCAATCAGCTCTGATAAACCAGGCGTGGTCTCCTGATGGTTTGCACCTAACATCTGGTTCAGCTGATCCTGTAA
TTCATGTTTTTGATATCAGATATAATTCTCACATGCCATCTTGTTCAATTAAAGCTCATCAGAAACGTGTCTTTAAAGCTGTTTGGCTCCAGGCTCCGATTGGAAAACAA
ATGCATGGGAAGAGAACGGTATGTTCGGATGATGAATCGGAGGAAGAGAAGGCGTTGTTTCCGATGTACTCGGCTAGATCTCAACACGACATGTCGGCCATGGTGTGTGC
CCTAGCTGAAGTCATCAAATCCAACCGAACCCCTTCAGATCATACAACTTCACAACCGCAACCACCACAACCGCAGCCACATGAAGATAATGATCAACAAGGCAGGCGTA
GCGGCGGAAGCTGCAGCAGAAGAAGACACTATCGTGGAGTAAGGCAGAGACCGTGGGGCAAGTGGGCGGCCGAGATCCGTGATCCGAAAAAGGCAGCTAGAGTTTGGCTG
GGGACTTTCGACACTGCCGAGGCTGCCGCACTTGCTTACGACCAAGCTGCCCTCAGCTTCAAAGGAACCAAAGCAAAGCTCAATTTCCCAGAGAGGCTTCAGCCTCAGCC
TCACAACTTCTTCTTCAACGCTCCTCCTCCTCACCCTCACAACCTCTTCTTCAACGCTCCTCCTCCCACTTTTAATTCTTCTCATCAGGATCACGAACAGGAACAGGAAC
AAGATGCTGCACCCCCCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATTTTGAAATTTTAAGCACTAAAATAAAACAAACTTAAATTAAATTTAAACATTCCTCCGAAAAAAAAAAATTAAATCTAAACATTTATATTTCTGAAAGCAATCACTGT
TTTCTGGCCTGGAATTCATTCGACGCTGTTGTTCTACTTCTCAACTCAACTTCGGTTAAAAAATCTCTGCTGCGATGGAGCACCGCGTTTCATTGTTCTGCAAATTCGTG
ACCTCAGGAGTGGATGTATTCCTCGGAACTTCTCTCGGCATGCCTTCACCCCCTCACCCAAACGGCATAGAAACTTAAACATGACGCACACGATCCTCTCTATGCAGATA
TACCCAAACCCTGTACGAGGAAGTCCGAAAGAAAACCCTATACCACTGATACGAAGCTTTTGATTCTGAGGGCAAAAGAGGAGAGAGAAGCCAGAAAGTCCCCCCCCCNT
GCTGGTGCCTGATCTTGTCCACGTAGCTCAAAGACTCTACCTAGCTTGGAAATCGCTCCTCTTCGGCATGCCGAGGCTCGTGGACGCCATTCCCGTTCATCTATGTAGTT
GCCTGTTGTACGATTCTACATTATCTGTATGATTCATCTCCTCTAAGTGTGAGTCATCCTTCTTCTCCTTTGCTTATCCTGAAAAATATAATTATATCTGGAAGGACCAT
TACGGAGCTGTGATACATAGGAATTGAAGGAGCAGTTACCTACGAATGCAAGCATAACTTTATTGATATGTGAAAAGTTACAATGTGTATGAAAAACTGCAATAAGAAGA
TTACAATAAAACATGGCAATTACAACAAATATCACTATTAGAGTAACTAAGCTACTAAATGCTCTCGATTTGACACAAAAAGAGGTTATTTGGAATTTAACTAATTCCAC
CAAATCCACTCAGCTAGAAAACTTGTTCAATCTGCCCATTAAATCTGGCACTGAGTTTTATTTGTCTGTCCGCCAGTGGTATTGAACTTCCATTTTGCAGACTACCTTGC
GGGTGAAATGGCCCTCTTGGGAGACTTGACTGACCATCAGTGTTACAGTTTGGTTGATTTGATCTAGAGGTGGGTGAGTTGTAGTTCTGAAGCCCCTTATTTGGTGGGAA
AGAAAGGCTCAAGAAATTTGCCTGTGCTATCTAGTTCATAGGATTTTGTTTCTAAAGACAGAAATCAATGCTTGTGGTGTCTCACATCTTTAGAAGAATGTCAAATTTAA
TAGGAGATCTTTAAGAGTGGCTCACTTTAGAAGACTCGTAGCTTTTGGCATTTCCTTCCTCCGTAGAACCTAGTAAAGCTGATGAGACATGCTAAACAATTCTCCTGTGG
AAGTAAAAGATGGAAAGAAGATTAAGTTTTGGGAAGACGCTGGTGGACACTAAACCATTAAGCTTCTCTGACCACTATGCAATGGTGGAGCCTAAGCTAAAGACAGTTAA
TAGACTTTGGGATGGGATTTCGTGAAGTTGGAATATAGTTACTCGAAGGTAGTGAATTTGGCGAACTTAATGACTTGAAGGTGTTGGAATTAAACAGACAGGTGGCGGAT
AGGGTGAGAGATTATATATTTTTAAGGTTGGGAGATGTCTCATCTTCTCCCTGCTATAAAAATTGTTTCTCATATAGAGGGGTAACATATCTGTTTATCAAAAGCAAATC
ACATTGATGGGCTTAAATTTGACCTGCAAACTGGGATTATACTATTATTGGTTGAATGTTCTGCCCAATGATATATACATGTTGGCTGAGTTAAGGATCTGATAGACTTA
CTTGCTGAAGGTTGTTTCCCTCAGGTTCTGCTTCAAGGTTCATATTGGACATGAAGGTCATGAAATTTGAACCTGCAGTGGTGCGAAGAGTGGGTTTTGAAATGCTACTC
ACAACTGGAGGAAGAGAGGAGTTCACGTTTTTTCCCCCAAGTGTTGCCATCTTTATGATCGTGTAGGGAAACCAAGAGCCAGGCATGGCGAGAGGTGTGGTGTACAAAGG
ATACCTGCCATATTGGAGCTACGCATCCAAGCTGGTGTTGATCTTGAAAAAGTACCCCTCTAAAAGACGAACAAAACCTGTATATAGTATTGAAGGGAGAATTGCAGAAT
TTGAGTTGGTGAAGAAAAGAACGAGGTAGGGACATGCCATTTCACCAATAGTAGTGATAATTTTGGAAAGTCTGATTTCGCGATATATCTCAAAGAAATGAGTAGGTGTT
CATTCTTATGGAATATTATGGATCGGCAGAATGAAGAAGAGGCTAAGGTCTTGGGCATCTTCCTTCGAGAATGGGAATATCAAGGACAGTGTCGGGCGGCATAGGTGTGC
CGGCATTGTGCCTCGTCCCATGGGCTAGAGGTTGCACATAGGGGCCGTTATGCGATACGGTATGCTCGAAAGACAAGGCACGGCCGCTGGAGTCGTTGTGCCTCGATTAT
GTCCGGATGGATGGATGTTCGAAAGGTCCCGATCATAAATACAAGTCCATGGGTCGATTCTCGGTAAGTGTGCTCAAGTGTCAGCGAAGCCCGATGACATCTTGATGCTC
GAGGTGGCATCCTACAGTACCGGTGGGCGCTCGGGAAGTGAGAAAAAGCTAGTCTTGGCCTGGTAAGCGGCAAGACTCATCTCGGTTGTCAAGTGTAGTCATTCCTTGTC
CGGATCTCGGCAGGAATCATCCCGGCAGTACCCGGTAAGGCCGTTTAGCCCGTTATTGACTAGTTTCTCGGCAGGAATGGTCTTAGAGGCGTCCGATGGGTCCGATAAGT
TAGTTTTGACATAGTTAAGCGGCAGGAATCTCCCGGTTGTACCCGATTGGCATTTTAAGCTCGTTCTGACCCTGTTATAGGTCAGAACTCATCCCGGTAAGATCTTGGCT
TATCGTAGGAGTGGGACGGCCGAGTGGCTGAGAACCAAGATGGTTGAAAACAAGTTGCGACGTCGCGTTTAGAAGACCTTGGCCTAGAGTAGAGGCAAGGTAGAGTCGTG
GGGCTTGGCTTAGAGTAGAGGCAAGTCGGCTAGAGTCGCGATAGAGTTTAAATGCACGTACGGGCGACGTGCATTGCCAAGATAAGTTAGAATTCCGCTGCAAGTGAAGC
TCGATCTTAGGAAGAAAGAACCTCGCCTAAGGTCTGACGAAGCCACAAAAGGCGCAAAAAGTGGATTGGGGCTAGAGTTCCCCGTAAGACTGACCATGGTCGAGCTTAGA
TCATGGCGCCGCACGGTTGAAAAATGTACGACCGTGACACATGGCAGGAGGCAAAGACAGACAAATGATGTTGTGGGTGCCAATCATATATGGCATGTGAGAAAGGGCCT
CCTCTAGATAACAAGTTGAAGAGATTTTACGGCAAAATTTCGGCGGTGGTTGAACTGTGCGTGCAAGCAACTGCACCTATTTCAAATCAGCATAGGAGCATGATGAGGCT
CGATGTTGTTCTTCCACACCATGATGACATTGCTCTTAATTCCTCAAATGGAGAATCTGTTGAGCGTCTCACACTCAGAGAGATGTTGAATCACCATAATATGGAGATCA
GAGTTCGGTAGTGCAGTGCAACGACTTCTTGTGGATATGGGATATGGGATGGTTTGGTTGTAGCCTATCTTGATTTCGCTTGGTGTGTCTCTTCACTTGGCCGGATTCCC
ACGATCAATGGTGGAGAATCTTTGTTCGCCTGGTCTATAGGGCAAACCCAAGCGGAATACGAATGACCCAGCCTCTGAAGAAACCAAAAGTGGAGGAACTTTCCGTTTCC
GATGCCCAACCAACTGCAATCGACGCGGTGGATGAATGCAATAGCACCGCGGAAGAGCAAGAGGCGGAGCTCTTGGCACTAGTCGAGCATCGTACTCGGGAAGTTCAACA
TCTGCAGCACCGTATTTCCTACTACACTCGTCAGCTAGAGGAAGCAGAGAAGCGATTGCAGGAATCTGAATCTATGTTGGCACGCTTTCGAGGTCGGCGTTATACTCTTT
CATCAAAAAGTTCTCAAGACTCTGGATTCAAATGTGTGGAGGCTGAGCCTAGACCAGCCAGTCCTATTCATGCCAATGGAGGTTCCAAAGCAATAACAATTCTTGGTTCC
AGTGCTCAAAAGAGCCCTTCAATTCTCAATCTTGCCATGACAGCTGAACAAGACAGGCCTTGTACAGTATCTTCTACTGAGGGAGTTGTTGAAGATGAAAGTGATAGAAG
AAAAAGAAAGTTTGGTAATACTTCAACCTTCCGTTGGGTAACCTTGCAGTGCTATCTTCTTCATGCAACTTCGTTAGTTTCATATTCTATACAGTACCTCAGGTCCCTCA
TTCTTTTGTTTGGACTCGTGATGTTTGCATGTATACCAGAGCAGAAAGATCACAAAGAATTGATTCCATTAGTACGTAGTAGCTCATCGCCATTAACTGCCCAATGTGAT
AGAAGTTATTATTTCTCAAGTCAGCACAAGAGAAAAATGAGAAGTCTTGCTCCATGTCCAGTTAATGACCAGCTTTTTGTGACCAGTGCTTTGGATGGAATGATCAACTT
GTGGCAAGTTCAGTACAAGGGGTCATCGGCCTCTCTACTTTGTACTACTGACTGTATGTCTCAAAAGCAGAGAAGATGGCCTGAAGATATAGCTTGGCACCCGGATGGAA
ACAGCCTATTTTCTGTGTACAGTGCCGATAGTGGTGACTCTCAAATATCGATCTTGAAATTCAACAAGACTAAAGAGAGGGCCTCTGTGATTTTCTTAGAGGATAAGCCT
TATGTTAAAGGCATTATCAACAGCATCAGTTTCTTGCCCTGGGAATCTGTCCCTTTCATTACTGGTGGCAGTGACCATGCTGTCATTCTATGGAGTATGAAAGATAAAAA
GAATACATGGAAGCCAGAACTGTTGCACAGAGACCTGCATTCTTCAGCTGTCATGGGCGTCTCTGGGATGCAGCTAAAGCAGATCGTACTTTCTGCTGGTGCAGACAAGA
GGATCCTTGGTTTTGATGTTCAAGTAGGAAGCGTACTCTTCAAGCATCAATTAGAGAGCAAATGTATGAGTGTCTTGCCAAATCCATGTGATTTTAACCTGTTCATGGTC
CAAACAGGGAGTCCAGAAAAGCAACTTCGGCTGTTTGACATTAGGTCAAAACAGAAAGAAGTCCACGGTTTTGGGTGGAAGCAAGAAAGCAGTGAATCTCAATCAGCTCT
GATAAACCAGGCGTGGTCTCCTGATGGTTTGCACCTAACATCTGGTTCAGCTGATCCTGTAATTCATGTTTTTGATATCAGATATAATTCTCACATGCCATCTTGTTCAA
TTAAAGCTCATCAGAAACGTGTCTTTAAAGCTGTTTGGCTCCAGGCTCCGATTGGAAAACAAATGCATGGGAAGAGAACGGTATGTTCGGATGATGAATCGGAGGAAGAG
AAGGCGTTGTTTCCGATGTACTCGGCTAGATCTCAACACGACATGTCGGCCATGGTGTGTGCCCTAGCTGAAGTCATCAAATCCAACCGAACCCCTTCAGATCATACAAC
TTCACAACCGCAACCACCACAACCGCAGCCACATGAAGATAATGATCAACAAGGCAGGCGTAGCGGCGGAAGCTGCAGCAGAAGAAGACACTATCGTGGAGTAAGGCAGA
GACCGTGGGGCAAGTGGGCGGCCGAGATCCGTGATCCGAAAAAGGCAGCTAGAGTTTGGCTGGGGACTTTCGACACTGCCGAGGCTGCCGCACTTGCTTACGACCAAGCT
GCCCTCAGCTTCAAAGGAACCAAAGCAAAGCTCAATTTCCCAGAGAGGCTTCAGCCTCAGCCTCACAACTTCTTCTTCAACGCTCCTCCTCCTCACCCTCACAACCTCTT
CTTCAACGCTCCTCCTCCCACTTTTAATTCTTCTCATCAGGATCACGAACAGGAACAGGAACAAGATGCTGCACCCCCCAACTGATTCTACTCTCTATCCCATTGTTGTC
CGCGTCTTTTCTTTTCATTTTTCAACGTCTCAACATTTCCATCGACATCCATATGTATCCAATGGGTTCAACATCAACCTAACAAGTAAATCGAATATTTCCGTAATATT
ACAACAAAATATCACATAATAATATTAGTATGCATCATTTTAAACTTATTTAACTAGTATAATGCTTATTTGATGAAAACTTCTTATTTTTATAGTCTTTCAAGAGATAT
CCATCGACATCGACATTTAAAATCTAGCTTCCATATAAGGTCATGCGAATAGAACCCACTTCGTACTTTATCTAACTTAATAATTTAAC
Protein sequenceShow/hide protein sequence
MTQPLKKPKVEELSVSDAQPTAIDAVDECNSTAEEQEAELLALVEHRTREVQHLQHRISYYTRQLEEAEKRLQESESMLARFRGRRYTLSSKSSQDSGFKCVEAEPRPAS
PIHANGGSKAITILGSSAQKSPSILNLAMTAEQDRPCTVSSTEGVVEDESDRRKRKFGNTSTFRWVTLQCYLLHATSLVSYSIQYLRSLILLFGLVMFACIPEQKDHKEL
IPLVRSSSSPLTAQCDRSYYFSSQHKRKMRSLAPCPVNDQLFVTSALDGMINLWQVQYKGSSASLLCTTDCMSQKQRRWPEDIAWHPDGNSLFSVYSADSGDSQISILKF
NKTKERASVIFLEDKPYVKGIINSISFLPWESVPFITGGSDHAVILWSMKDKKNTWKPELLHRDLHSSAVMGVSGMQLKQIVLSAGADKRILGFDVQVGSVLFKHQLESK
CMSVLPNPCDFNLFMVQTGSPEKQLRLFDIRSKQKEVHGFGWKQESSESQSALINQAWSPDGLHLTSGSADPVIHVFDIRYNSHMPSCSIKAHQKRVFKAVWLQAPIGKQ
MHGKRTVCSDDESEEEKALFPMYSARSQHDMSAMVCALAEVIKSNRTPSDHTTSQPQPPQPQPHEDNDQQGRRSGGSCSRRRHYRGVRQRPWGKWAAEIRDPKKAARVWL
GTFDTAEAAALAYDQAALSFKGTKAKLNFPERLQPQPHNFFFNAPPPHPHNLFFNAPPPTFNSSHQDHEQEQEQDAAPPN