; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000416 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000416
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionC2H2-type domain-containing protein
Genome locationscaffold44:1551528..1556591
RNA-Seq ExpressionMS000416
SyntenyMS000416
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576717.1 U-box domain-containing protein 32, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.5Show/hide
Query:  TLSASAPITIRSVAFSLSWNRSLFFLLMPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKV
        TLS SAP TIRSVAF LSWNRSL FLLMPV+KLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLK+
Subjt:  TLSASAPITIRSVAFSLSWNRSLFFLLMPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKV

Query:  QMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLR
        QMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDKL+ EE KE VSFKNV  E     GIQGSA+IKNLT +IGKPGFSALPHVYLR
Subjt:  QMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLR

Query:  AGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAA
        AGSALLDIVQGRPSRFP++SQE FEILDNASEKTFLCGTAVSMQKY+FDGEA KIGLETKNLVACMSF++EQK+VK WLADKDAEALRCQKLLVEEEEAA
Subjt:  AGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAA

Query:  QRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYP
        QRRQAELLERKRQKKLRQKEQR KEQKHEEK DMEGSVDET EDV  EESSSPQTE HSE +S GIL DH PSS E SQQSLTDEDEDSESHSGFRS YP
Subjt:  QRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYP

Query:  EYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQA
        E+LPID NGE QK  MNGHKHVIAQWQALPK QRGLSNGFRA+QNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEE TAQA
Subjt:  EYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQA

Query:  EEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLE
        EEIKSHEVLIGSISVAL NC QESKE  GA D CQDGHQTP+KI NHLEKFIK +S QTATNR MVK WRPVSRNG+K AMPDQSE+GESEAE+IT+K E
Subjt:  EEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLE

Query:  DQALLNTYSPRSSSLDGDTGDFGNNSSLVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRN
        DQALLNTYSPRSSSLDGDTGD G NS +   EEP QPV LEFSS AAKAFLAQRWKEAITA+HVKLNLPSDSESS CFE +N+ ETSS  FQ SN ++  
Subjt:  DQALLNTYSPRSSSLDGDTGDFGNNSSLVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRN

Query:  VLGNAIVINLEAPKCSANEGGKTKFRT---KGAKIKYIPKLRTTT
           +AI I LE PK SANE GKTK RT   KGAKIKYIPK+RTTT
Subjt:  VLGNAIVINLEAPKCSANEGGKTKFRT---KGAKIKYIPKLRTTT

KAG7014763.1 hypothetical protein SDJN02_22392, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0084.87Show/hide
Query:  FFSLCGNS-PKKQLLSSPPITFSNFPGPLFLRLPETLSASAPITIRSVAFSLSWNRSLFFLLMPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFL
        FFSL   +  +KQLLSSP ITFSNFPGP       TLS SAP TIRSVAF LSWNRSL FLLMPV+KLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFL
Subjt:  FFSLCGNS-PKKQLLSSPPITFSNFPGPLFLRLPETLSASAPITIRSVAFSLSWNRSLFFLLMPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFL

Query:  SFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLF
        SFSRAGESPVQWIQLLHALDQQELPGWPLLSPLK+QMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDKL+ EE KE VSFKNV  
Subjt:  SFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLF

Query:  EASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVAC
        EASHSSGIQGSA+IKNLT +IGKPGFSALPHVYLRAGSALLDIVQGRPSRFP++SQE FEILDNASEKTFLCGTAVSMQKY+FDGEA KIGLETKNLVAC
Subjt:  EASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVAC

Query:  MSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRG
        MSF++EQK+VK WLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQR KEQKHEEK DMEGSVDET EDV  EESSSPQTE HSE +S G
Subjt:  MSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRG

Query:  ILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRA
        IL DH PSS E SQQSLTDEDEDSESHSGFRS YPE+LPID NGE QK  MNGHKHVIAQWQALPK QRGLSNGFRA+QNYQGLKNGDMRRHGNHVQSRA
Subjt:  ILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRA

Query:  APIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVM
        APIVNGKKVWSRKPKPERDGDRFQARIQEE TAQAEEIKSHEVLIGSISVAL NC QESKE  GA D CQDGHQTP+KI NHLEKFIK +S QTATNR M
Subjt:  APIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVM

Query:  VKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSLVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVK
        VK WRPVSRNG+K AMPDQSE+GESEAE+IT+K EDQALLNTYSPRSSSLDGDTGD G NS +   EEP QPV LEFSS AAKAFLAQRWKEAITA+HVK
Subjt:  VKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSLVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVK

Query:  LNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRT---KGAKIKYIPKLRTTT
        LNLPSDSESS CFE +N+ ETSS  FQ SN ++     +AI I LE PK SANE GKTK RT   KGAKIKYIPK+RTTT
Subjt:  LNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRT---KGAKIKYIPKLRTTT

XP_022141004.1 uncharacterized protein LOC111011517 isoform X1 [Momordica charantia]0.0e+0099.02Show/hide
Query:  MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR
        MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR
Subjt:  MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR

Query:  LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL
        LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFE     GIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL
Subjt:  LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL

Query:  DNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK
        DNASEKTFLCGTAVSMQKYVFDGEA KIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK
Subjt:  DNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK

Query:  HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ
        HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ
Subjt:  HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ

Query:  ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP
        ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRF ARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP
Subjt:  ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP

Query:  AGAQDDCQDGHQTPRKINHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSL
        AGAQDDCQDGHQTPRKINHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSL
Subjt:  AGAQDDCQDGHQTPRKINHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSL

Query:  VQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRTK
        VQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRTK
Subjt:  VQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRTK

Query:  GAKIKYIPKLRTTT
        GAKIKYIPKLRTTT
Subjt:  GAKIKYIPKLRTTT

XP_023552425.1 uncharacterized protein LOC111810086 [Cucurbita pepo subsp. pepo]0.0e+0085.13Show/hide
Query:  FFSLCGNS-PKKQLLSSPPITFSNFPGPLFLRLPETLSASAPITIRSVAFSLSWNRSLFFLLMPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFL
        FFSL   +  +KQLLSSP ITFSNFPGP       TLS SAP TIRSVAF LSWNRSL FLLMPV+KLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFL
Subjt:  FFSLCGNS-PKKQLLSSPPITFSNFPGPLFLRLPETLSASAPITIRSVAFSLSWNRSLFFLLMPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFL

Query:  SFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLF
        SFSRAGESPVQWIQLLHALDQQELPGWPLLSPLK+QMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDKL+WEE KE VSFKNV  
Subjt:  SFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLF

Query:  EASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVAC
        E     GIQGSA+IKNLT +IGKPGFSALPHVYLRAGSALLDIVQGRPSRFP++SQELFEILDNASEKTFLCGTAVSMQKY+FDGEA KIGLETKNLVAC
Subjt:  EASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVAC

Query:  MSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRG
        MSF++EQK+VK WLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQR KEQKHEEK DMEGSVDETIEDV  EESSSPQTE HSE +S G
Subjt:  MSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRG

Query:  ILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRA
        IL DH PSS E SQQSLTDEDEDSESHSGFRS YPE+LPID NGE QK  MNGHKHVIAQWQALPK QRGLSNGFRA+QNYQGLKNGDMRRHGNHVQSRA
Subjt:  ILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRA

Query:  APIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVM
        APIVNGKKVWSRKPKPERDGDRFQARIQEE TAQAEEIKSHEVLIGSISVAL NC QESKEP GA D CQDGHQTP+KI NHLEKFIK +S QTATNR M
Subjt:  APIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVM

Query:  VKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSLVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVK
        VK WRPVSRNG+K AMPDQSE+GESEAE+ITEKLEDQALLNTYSPRSSSLDGDTGD GNNS +   EEP QPV LEFSS AAKAFLAQRWKEAITA+HVK
Subjt:  VKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSLVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVK

Query:  LNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRT---KGAKIKYIPKLRTTT
        LNLPSDSESS CFE +N+ ETSS  FQ SN ++     +AI I LE PK SANE GKTK RT   KGAKIKYIPK+RTTT
Subjt:  LNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRT---KGAKIKYIPKLRTTT

XP_038896730.1 uncharacterized protein LOC120084984 [Benincasa hispida]0.0e+0086.25Show/hide
Query:  MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR
        MPVAKLK SNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQL HALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR
Subjt:  MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR

Query:  LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL
        LKKLDKDSAK+RDLLAAFWDKLSWEE KE VSFKNV  E     GIQGSA+IKNLTA+IGKPGFSALPHVYLRAGSALLDIVQGRPSRFP++SQELFEIL
Subjt:  LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL

Query:  DNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK
        DNASEKTFLCGTAVSMQKY+FDGEA KIGLETKNLVACMSFL+E+K+VK WLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK
Subjt:  DNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK

Query:  HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ
         EEK D+EGSVDETIEDV  EESSSPQT+ HSE +S GILPDH PSS E SQ SLTDEDEDSESHSGF + YPE+ P D NGEQQK+QMNGHKHVI+QWQ
Subjt:  HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ

Query:  ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP
        ALPK QRGLS+GFRADQNYQGLKNGDMRRHGNHVQSR  PIVNGKKVWSRKPKPERDGDRFQARIQEE TAQAEEIKSHEVLIGSISVALGNCNQESK+P
Subjt:  ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP

Query:  AGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSS
         G  DDCQDGHQTP+KI NH EKFIK DS QTATNRVMVKLWRPVSRNG+K+AMPDQSENGESEAEVITEK+EDQALLN+YSP+  SLDGDTGDFGNNSS
Subjt:  AGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSS

Query:  LVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRN-VLGNAIVINLEAPKCSANEGGK-TKF
        +  +EEP QPVGLEFSSRAAKAFLAQRWKEAITA+HVKLNLPSDSESSGCF+++N+TET          ++RN V+GN I+INLE PK SANE GK TKF
Subjt:  LVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRN-VLGNAIVINLEAPKCSANEGGK-TKF

Query:  RT---KGAKIKYIPKLRTTT
        RT   KGAKIKYIPKLRTTT
Subjt:  RT---KGAKIKYIPKLRTTT

TrEMBL top hitse value%identityAlignment
A0A0A0LBG6 C2H2-type domain-containing protein0.0e+0082.31Show/hide
Query:  NRAFLLFYAFIFSHTNLRF-FSLCGNS-----------PKKQLLSSPPITFSNFPGPLFLRLPETLSASAPITIRSVAFSLSWNRSLFFLLMPVAKLKTS
        NR  L F AFIFS TNLRF FSL   S            KKQLLSSPPIT SNFPGPL    P TL  S+P TIRSV   LS NRS  FLLMPVAKLK S
Subjt:  NRAFLLFYAFIFSHTNLRF-FSLCGNS-----------PKKQLLSSPPITFSNFPGPLFLRLPETLSASAPITIRSVAFSLSWNRSLFFLLMPVAKLKTS

Query:  NYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSA
        NYPDVMK EEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLK+QMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSA
Subjt:  NYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSA

Query:  KSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFL
        KSRDLLAAFWDKL+WEE KE VSFKNV  E     GIQGSA+IKNLTA+IGKPGFSALPHVYLRAGSALLDIVQGRPSRFP++SQELFEILDNASEKTFL
Subjt:  KSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFL

Query:  CGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEG
        CGTAVSMQKY+FDG+A KIGLETKNLVACMSFL+E+K+VK WLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK EEK D+EG
Subjt:  CGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEG

Query:  SVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGL
        SVDE IED   EESSSPQTE HSE +S GILPDH PSS E SQ SLTDEDEDSESHSGF + YPE+LP D NGEQQK+QMNGHKHVI+QWQALPK QRGL
Subjt:  SVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGL

Query:  SNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQD
        SNG+RADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEE T QAEEIKSHEVLIGSISVALGNCNQESK+P G  DD QD
Subjt:  SNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQD

Query:  GHQTPRKI-NHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSLVQEEEPVQ
        GHQTP+KI NHLEKF+KPDS QTATNRVMVKLWRPVSRNG+K+AMPDQSENGESEAEV TEKLEDQALLN YSP   SLDGDT DFGN+S +  +EEP  
Subjt:  GHQTPRKI-NHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSLVQEEEPVQ

Query:  PVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVL--GNAIVINLEAPKCSANE-GGK--TKFRT---K
        PVGLEFSSRAAKAFLAQRWKEAITA+HVKLNLPSDSESSGCF+++NE ET        N ++  V+  GN I+INLEAPK SANE  GK  TKFRT   K
Subjt:  PVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVL--GNAIVINLEAPKCSANE-GGK--TKFRT---K

Query:  GAKIKYIPKLRTTT
        GAKIKYIPKLRTTT
Subjt:  GAKIKYIPKLRTTT

A0A6J1CHD0 uncharacterized protein LOC111011517 isoform X10.0e+0099.02Show/hide
Query:  MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR
        MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR
Subjt:  MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR

Query:  LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL
        LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFE     GIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL
Subjt:  LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL

Query:  DNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK
        DNASEKTFLCGTAVSMQKYVFDGEA KIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK
Subjt:  DNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK

Query:  HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ
        HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ
Subjt:  HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ

Query:  ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP
        ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRF ARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP
Subjt:  ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP

Query:  AGAQDDCQDGHQTPRKINHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSL
        AGAQDDCQDGHQTPRKINHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSL
Subjt:  AGAQDDCQDGHQTPRKINHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSL

Query:  VQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRTK
        VQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRTK
Subjt:  VQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRTK

Query:  GAKIKYIPKLRTTT
        GAKIKYIPKLRTTT
Subjt:  GAKIKYIPKLRTTT

A0A6J1E534 uncharacterized protein LOC1114308420.0e+0085.79Show/hide
Query:  MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR
        MPV+KLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLK+QMQKCEKCAREFCSVINYRRHIRVHHR
Subjt:  MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR

Query:  LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL
        LKKLDKDSAKSRDLLAAFWDKL+WEE KE VSFKNV  E     GIQGSA+IKNLT +IGKPGFSALPHVYLRAGSALLDIVQGRPSRFP++SQE FEIL
Subjt:  LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL

Query:  DNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK
        DNASEKTFLCGTAVSMQKYVFDGEA KIGLETKNLVACMSF++EQK+VK WLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQR KEQK
Subjt:  DNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK

Query:  HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ
        HEEK DMEGSVDETIEDV  EESSSPQTE HSE +S GIL DH PSS E SQQSLTDEDEDSESHSGFRS YPE+LPID NGE QK  MNGHKHVIAQWQ
Subjt:  HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ

Query:  ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP
        ALPK QR LSNGFRA+QNYQGLKNGDMRRHGNHVQ RAAP+VNGKKVWSRKPKPERDGDRFQARIQEE TAQAEEIKSHEVLIGSISVAL NC QESKEP
Subjt:  ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP

Query:  AGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSS
         GA DDCQDG QTP+KI NHLEKFIK +S QTATNR MVK WRPVSRNG+K AMP QSE+GESEAE+ITEKLEDQALLNTYSPRSSSLDGDTGD GNNS 
Subjt:  AGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSS

Query:  LVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRT
        +   EEP QPV LEFSS AAKAFLAQRWKEAITA+HVKLNLPSDSE S CFE +N+ ETSS  FQ SN ++     +AI I LE PK SANE GKTK RT
Subjt:  LVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRT

Query:  ---KGAKIKYIPKLRTTT
           KGAKIKYIPK+RTTT
Subjt:  ---KGAKIKYIPKLRTTT

A0A6J1FZU0 uncharacterized protein LOC111449426 isoform X10.0e+0078.95Show/hide
Query:  FLLFYAFIFSHTNLRFFSLCGNSPKKQLLSSPPITFSNFPGPLFLRLPETLSAS---APITIRSVAFSLSWNRSLFFLLMPVAKLKTSNYPDVMKSEEGN
        + L   F++SH+  R      NSPKK         F  F  P F  L  TL  S    P TIRS AF  S NRSL  LLMPVAKLK SNYPDVMKSEEGN
Subjt:  FLLFYAFIFSHTNLRFFSLCGNSPKKQLLSSPPITFSNFPGPLFLRLPETLSAS---APITIRSVAFSLSWNRSLFFLLMPVAKLKTSNYPDVMKSEEGN

Query:  DSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDK
        DSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALD  ELPGWPLLSPLKVQMQKC+KC  EF S INYRRHIRV+HR+KK DKDSAK+RDLLAAFWDK
Subjt:  DSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDK

Query:  LSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQKYVF
        LSWEE KE VSFKNV  E     G+QGSA+IKNLTA+IGKP FSALPHVYLRAGSALLDIVQGRPSRFP++SQELFEILDNASEKTFLCGT+ SMQKY+F
Subjt:  LSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQKYVF

Query:  DGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIEDVSPE
        DG   KIGLETKNLVACMSFL+E+K+VK WLADKDAEALRCQKLLVEEEEAAQRRQAELLERK+QKKLRQKEQRSKEQK EEK DME SV+ETIEDV PE
Subjt:  DGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIEDVSPE

Query:  ESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRADQNYQG
        ESSSPQTE HSEG+S  IL DH+PSS E SQQSLTDEDEDSES  G  S YPE+LPID NGE+QK+QMNGHKHVIAQWQALPK QRGLSNG+ A+QNYQG
Subjt:  ESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRADQNYQG

Query:  LKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRKI-NHL
         KNGDMRRHGNHV SRAAP+ NGKKVWSRKPKPERDG R+QARI EE TAQAEEIKSHEVLIGSISVALGNCNQESK P GA DDCQDGHQTP+KI NH+
Subjt:  LKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRKI-NHL

Query:  EKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSLVQEEEPVQPVGLEFSSRAAK
        +KFIKPDS QTATNRVMVKLWRPV RNG+K+AM +QS+N ESEAE ITEKLED+ALLNTYSPRSSSLDGD GDFGNN+SL+Q EEP Q VGLEFSSRAAK
Subjt:  EKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSSLVQEEEPVQPVGLEFSSRAAK

Query:  AFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPK-CSANEGGKTKFRT---KGAKIKYIPKLRTTT
        AFLAQRWKEAITA+HVKLNLPSDSESS C E++N+TETS+  FQ SN ++RNVLGNA+ IN++ PK  SANE GK+KFRT   KGAKIKYIPKL TTT
Subjt:  AFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPK-CSANEGGKTKFRT---KGAKIKYIPKLRTTT

A0A6J1JA46 uncharacterized protein LOC1114830980.0e+0086.21Show/hide
Query:  MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR
        MPV+KLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLK+QMQKCEKCAREFCSVINYRRHIRVHHR
Subjt:  MPVAKLKTSNYPDVMKSEEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHR

Query:  LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL
        LKKLDKDSAKSRDLLAAFWDKL+WEE KE VSFKNV  E     GIQGSA+IKNLT +IGKPGFSALPHVYLRAGSALLDIVQGRPSRFP++SQELFEIL
Subjt:  LKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEIL

Query:  DNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK
        DNASEKTFLCGTAVSMQKY+FDGEA KIGLETKNLVACMSF++EQK+VK WLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQR KEQK
Subjt:  DNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQK

Query:  HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ
        HEEK DMEGSVDETIEDV  EESSSPQTE HSE +S GIL DH PSS E SQQSLTDEDEDSESHSGFRS YPE+LPID NGE QK  MNGHKHVIAQWQ
Subjt:  HEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQ

Query:  ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP
        ALPK QRGLSNGFRA+QNYQGLKNGDMRRHGNHVQ RAAPIVNGKKVWSRKPKPERDGDRFQARIQEE TAQAE+IKSHEVLIGSISVAL NC QESKEP
Subjt:  ALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEP

Query:  AGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSS
         GA DDCQDG QTP+KI NHLEKFIK +S QTATNR MVK WRPVSRNG+K AMP QSE+GESEAE+ITEKLEDQALLNTYSPRSSSLDGDTGD GNNS 
Subjt:  AGAQDDCQDGHQTPRKI-NHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSS

Query:  LVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRT
        +   EEP QPV LEFSS AAKAFLAQRWKEAITA+HVKLNLPSDSESSGCFE +N+ ETSS  FQ SN ++     +AI I LE PK SANE GKTK RT
Subjt:  LVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRT

Query:  ---KGAKIKYIPKLRTTT
           KGAKIKYIPK+RTTT
Subjt:  ---KGAKIKYIPKLRTTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G25610.1 C2H2-like zinc finger protein7.0e-12142.2Show/hide
Query:  EEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAA
        ++GNDSLD +IR+A+GK+PFLSF R   +PVQ  QLLH L   E PGWPLL+PLK+QMQKCEKC+REFCS +N+RRH R+H R +K +KD  K RD L A
Subjt:  EEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAA

Query:  FWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQ
        FW+KLS  +AKE++S K+++ E      I G+++   L +LI KPG++ALP  YLRAGS LLD++Q RP R P++SQELF ILD+ASEKTFL   A  MQ
Subjt:  FWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQ

Query:  KYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIED
        KY+FDGE  K  LE KN+VAC SFL+EQ+++KAWLADKDAEALRCQ LLVEEEEAA+RR+AELLERK++KKLRQKEQR K+QK + K D E +  E  E 
Subjt:  KYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIED

Query:  VSPEESSSP-QTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRAD
          P E SSP      SE ++   LP    SS E  Q   T+   +SE+        P    +D NG+  + + +G +        + + Q+G+ NGF AD
Subjt:  VSPEESSSP-QTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRAD

Query:  QNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRK
                G MR++G +  +RA    N  KVWSRK       D  +   Q     Q ++ KS E ++GS+SV++ N  + +      Q  C +G +  + 
Subjt:  QNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRK

Query:  INHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSS-LVQEEEPVQPVGLEFS
        +      +KP S Q+      VK+WRPVS  G K                                 +S+++G+T      S+    E +    + L+F+
Subjt:  INHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSS-LVQEEEPVQPVGLEFS

Query:  SRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRT---KGAKIKYIPKLR
        +  AKAFLA+RWKEA +AEHV L L  +++ SG     N  E+S+G+  +                            ++K RT   KG K+KY+PK R
Subjt:  SRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANEGGKTKFRT---KGAKIKYIPKLR

AT4G25610.2 C2H2-like zinc finger protein1.7e-11143.77Show/hide
Query:  EEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAA
        ++GNDSLD +IR+A+GK+PFLSF R   +PVQ  QLLH L   E PGWPLL+PLK+QMQKCEKC+REFCS +N+RRH R+H R +K +KD  K RD L A
Subjt:  EEGNDSLDTIIRQAIGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAA

Query:  FWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQ
        FW+KLS  +AKE++S K+++ E      I G+++   L +LI KPG++ALP  YLRAGS LLD++Q RP R P++SQELF ILD+ASEKTFL   A  MQ
Subjt:  FWDKLSWEEAKEVVSFKNVLFEASHSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQ

Query:  KYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIED
        KY+FDGE  K  LE KN+VAC SFL+EQ+++KAWLADKDAEALRCQ LLVEEEEAA+RR+AELLERK++KKLRQKEQR K+QK + K D E +  E  E 
Subjt:  KYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAWLADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIED

Query:  VSPEESSSP-QTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRAD
          P E SSP      SE ++   LP    SS E  Q   T+   +SE+        P    +D NG+  + + +G +        + + Q+G+ NGF AD
Subjt:  VSPEESSSP-QTEYHSEGESRGILPDHIPSSFEASQQSLTDEDEDSESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRAD

Query:  QNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRK
                G MR++G +  +RA    N  KVWSRK       D  +   Q     Q ++ KS E ++GS+SV++ N  + +      Q  C +G +  + 
Subjt:  QNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTAQAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRK

Query:  INHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSS-LVQEEEPVQPVGLEFS
        +      +KP S Q+      VK+WRPVS  G K                                 +S+++G+T      S+    E +    + L+F+
Subjt:  INHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYSPRSSSLDGDTGDFGNNSS-LVQEEEPVQPVGLEFS

Query:  SRAAKAFLAQ
        +  AKAFLA+
Subjt:  SRAAKAFLAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AACAGGGCTTTCCTTCTTTTTTATGCCTTTATATTTTCTCATACAAACCTTCGTTTTTTCTCTCTCTGTGGAAACTCTCCCAAAAAACAGCTTCTCTCTTCTCCACCGAT
CACTTTCTCGAATTTTCCAGGCCCCCTTTTCCTCCGGCTACCTGAAACTCTCTCCGCATCGGCTCCGATTACGATCCGATCTGTCGCCTTCTCTCTTTCTTGGAATCGAT
CTTTGTTCTTCTTACTGATGCCAGTTGCGAAACTCAAGACCTCTAACTATCCAGATGTGATGAAATCGGAGGAGGGAAATGATTCTCTAGACACTATTATTAGACAAGCA
ATAGGAAAAGAGCCTTTTCTTTCTTTCTCGAGAGCTGGTGAGAGCCCAGTGCAGTGGATTCAACTTCTTCATGCGCTAGATCAACAAGAACTTCCAGGTTGGCCATTGCT
CTCTCCTTTAAAGGTTCAAATGCAAAAGTGTGAGAAGTGTGCCCGGGAATTCTGCTCGGTTATCAACTATAGAAGACATATACGAGTGCATCATAGGCTGAAAAAGCTAG
ATAAGGATTCTGCCAAAAGTAGGGATCTACTAGCAGCATTTTGGGACAAGCTGTCTTGGGAAGAGGCGAAGGAGGTTGTATCATTCAAGAACGTCCTATTCGAGGCAAGT
CATTCCTCAGGAATACAAGGATCGGCTATAATCAAGAACCTAACGGCTCTTATTGGAAAACCTGGATTCTCTGCCCTACCACATGTTTATTTGAGGGCAGGTTCTGCGCT
TTTGGACATTGTACAAGGTAGACCATCTAGGTTTCCGATGACATCTCAGGAGTTATTTGAAATTCTTGACAATGCAAGCGAAAAAACGTTTCTATGTGGAACGGCTGTCT
CTATGCAAAAATACGTATTTGATGGGGAGGCTGCAAAGATAGGTCTTGAAACTAAGAACTTAGTTGCTTGCATGAGCTTCTTGGTGGAACAGAAAGTGGTCAAAGCATGG
CTTGCTGACAAGGATGCTGAAGCTTTGAGGTGCCAGAAGTTGCTGGTAGAGGAGGAAGAAGCTGCTCAAAGGAGACAAGCGGAGCTGTTGGAAAGAAAAAGGCAGAAAAA
GCTAAGGCAGAAAGAACAAAGGTCTAAGGAGCAAAAACACGAGGAGAAGGGTGATATGGAAGGGAGTGTAGACGAAACAATTGAAGATGTGTCACCTGAAGAATCATCGA
GCCCTCAGACTGAGTACCATTCAGAAGGGGAGTCTCGGGGCATACTGCCTGATCATATTCCGTCATCTTTTGAAGCATCTCAACAATCACTAACTGATGAAGATGAGGAT
TCTGAGTCTCATTCTGGGTTTCGCAGTTTGTACCCTGAATATCTTCCTATTGATCAGAATGGTGAACAGCAGAAAGTGCAAATGAATGGTCACAAGCATGTCATTGCCCA
ATGGCAGGCGTTGCCTAAGATACAAAGGGGACTTTCCAATGGTTTTCGGGCAGATCAGAATTATCAGGGACTCAAAAATGGAGATATGCGCAGGCATGGAAACCATGTGC
AATCAAGAGCTGCTCCCATTGTTAATGGAAAAAAAGTATGGAGCCGGAAGCCTAAGCCAGAAAGGGATGGAGATCGTTTTCAAGCCAGGATTCAGGAAGAGCCTACGGCC
CAGGCAGAGGAAATTAAGAGCCATGAGGTTTTGATTGGTTCTATTTCAGTGGCGTTAGGAAATTGCAACCAAGAGAGTAAAGAGCCAGCTGGAGCTCAAGATGATTGCCA
GGATGGTCATCAAACGCCAAGGAAGATTAATCATTTGGAGAAATTCATTAAGCCAGATTCTACTCAAACTGCAACAAACCGAGTGATGGTTAAGCTTTGGAGGCCAGTAA
GTCGTAATGGATCCAAACATGCAATGCCAGATCAAAGTGAAAATGGCGAATCTGAAGCCGAAGTGATAACTGAAAAGCTGGAAGATCAGGCCCTGCTGAATACATATTCG
CCAAGATCCTCTTCCTTGGATGGTGATACTGGAGACTTTGGAAACAACTCCTCTTTGGTTCAGGAAGAAGAACCTGTGCAACCAGTTGGCTTGGAGTTCTCTAGCCGTGC
TGCCAAGGCTTTCCTTGCACAGAGATGGAAGGAGGCTATAACAGCCGAGCATGTTAAATTGAATCTACCCTCGGATTCTGAGTCTTCTGGATGCTTTGAAGTTAAAAATG
AGACTGAAACCTCCTCTGGATTATTCCAATCGTCAAACATTGAGAAACGCAATGTTCTTGGAAATGCAATAGTGATCAACTTGGAGGCTCCCAAGTGCTCAGCCAATGAA
GGCGGCAAGACCAAGTTCAGGACAAAGGGTGCGAAGATAAAGTACATTCCCAAACTTCGAACTACTACC
mRNA sequenceShow/hide mRNA sequence
AACAGGGCTTTCCTTCTTTTTTATGCCTTTATATTTTCTCATACAAACCTTCGTTTTTTCTCTCTCTGTGGAAACTCTCCCAAAAAACAGCTTCTCTCTTCTCCACCGAT
CACTTTCTCGAATTTTCCAGGCCCCCTTTTCCTCCGGCTACCTGAAACTCTCTCCGCATCGGCTCCGATTACGATCCGATCTGTCGCCTTCTCTCTTTCTTGGAATCGAT
CTTTGTTCTTCTTACTGATGCCAGTTGCGAAACTCAAGACCTCTAACTATCCAGATGTGATGAAATCGGAGGAGGGAAATGATTCTCTAGACACTATTATTAGACAAGCA
ATAGGAAAAGAGCCTTTTCTTTCTTTCTCGAGAGCTGGTGAGAGCCCAGTGCAGTGGATTCAACTTCTTCATGCGCTAGATCAACAAGAACTTCCAGGTTGGCCATTGCT
CTCTCCTTTAAAGGTTCAAATGCAAAAGTGTGAGAAGTGTGCCCGGGAATTCTGCTCGGTTATCAACTATAGAAGACATATACGAGTGCATCATAGGCTGAAAAAGCTAG
ATAAGGATTCTGCCAAAAGTAGGGATCTACTAGCAGCATTTTGGGACAAGCTGTCTTGGGAAGAGGCGAAGGAGGTTGTATCATTCAAGAACGTCCTATTCGAGGCAAGT
CATTCCTCAGGAATACAAGGATCGGCTATAATCAAGAACCTAACGGCTCTTATTGGAAAACCTGGATTCTCTGCCCTACCACATGTTTATTTGAGGGCAGGTTCTGCGCT
TTTGGACATTGTACAAGGTAGACCATCTAGGTTTCCGATGACATCTCAGGAGTTATTTGAAATTCTTGACAATGCAAGCGAAAAAACGTTTCTATGTGGAACGGCTGTCT
CTATGCAAAAATACGTATTTGATGGGGAGGCTGCAAAGATAGGTCTTGAAACTAAGAACTTAGTTGCTTGCATGAGCTTCTTGGTGGAACAGAAAGTGGTCAAAGCATGG
CTTGCTGACAAGGATGCTGAAGCTTTGAGGTGCCAGAAGTTGCTGGTAGAGGAGGAAGAAGCTGCTCAAAGGAGACAAGCGGAGCTGTTGGAAAGAAAAAGGCAGAAAAA
GCTAAGGCAGAAAGAACAAAGGTCTAAGGAGCAAAAACACGAGGAGAAGGGTGATATGGAAGGGAGTGTAGACGAAACAATTGAAGATGTGTCACCTGAAGAATCATCGA
GCCCTCAGACTGAGTACCATTCAGAAGGGGAGTCTCGGGGCATACTGCCTGATCATATTCCGTCATCTTTTGAAGCATCTCAACAATCACTAACTGATGAAGATGAGGAT
TCTGAGTCTCATTCTGGGTTTCGCAGTTTGTACCCTGAATATCTTCCTATTGATCAGAATGGTGAACAGCAGAAAGTGCAAATGAATGGTCACAAGCATGTCATTGCCCA
ATGGCAGGCGTTGCCTAAGATACAAAGGGGACTTTCCAATGGTTTTCGGGCAGATCAGAATTATCAGGGACTCAAAAATGGAGATATGCGCAGGCATGGAAACCATGTGC
AATCAAGAGCTGCTCCCATTGTTAATGGAAAAAAAGTATGGAGCCGGAAGCCTAAGCCAGAAAGGGATGGAGATCGTTTTCAAGCCAGGATTCAGGAAGAGCCTACGGCC
CAGGCAGAGGAAATTAAGAGCCATGAGGTTTTGATTGGTTCTATTTCAGTGGCGTTAGGAAATTGCAACCAAGAGAGTAAAGAGCCAGCTGGAGCTCAAGATGATTGCCA
GGATGGTCATCAAACGCCAAGGAAGATTAATCATTTGGAGAAATTCATTAAGCCAGATTCTACTCAAACTGCAACAAACCGAGTGATGGTTAAGCTTTGGAGGCCAGTAA
GTCGTAATGGATCCAAACATGCAATGCCAGATCAAAGTGAAAATGGCGAATCTGAAGCCGAAGTGATAACTGAAAAGCTGGAAGATCAGGCCCTGCTGAATACATATTCG
CCAAGATCCTCTTCCTTGGATGGTGATACTGGAGACTTTGGAAACAACTCCTCTTTGGTTCAGGAAGAAGAACCTGTGCAACCAGTTGGCTTGGAGTTCTCTAGCCGTGC
TGCCAAGGCTTTCCTTGCACAGAGATGGAAGGAGGCTATAACAGCCGAGCATGTTAAATTGAATCTACCCTCGGATTCTGAGTCTTCTGGATGCTTTGAAGTTAAAAATG
AGACTGAAACCTCCTCTGGATTATTCCAATCGTCAAACATTGAGAAACGCAATGTTCTTGGAAATGCAATAGTGATCAACTTGGAGGCTCCCAAGTGCTCAGCCAATGAA
GGCGGCAAGACCAAGTTCAGGACAAAGGGTGCGAAGATAAAGTACATTCCCAAACTTCGAACTACTACC
Protein sequenceShow/hide protein sequence
NRAFLLFYAFIFSHTNLRFFSLCGNSPKKQLLSSPPITFSNFPGPLFLRLPETLSASAPITIRSVAFSLSWNRSLFFLLMPVAKLKTSNYPDVMKSEEGNDSLDTIIRQA
IGKEPFLSFSRAGESPVQWIQLLHALDQQELPGWPLLSPLKVQMQKCEKCAREFCSVINYRRHIRVHHRLKKLDKDSAKSRDLLAAFWDKLSWEEAKEVVSFKNVLFEAS
HSSGIQGSAIIKNLTALIGKPGFSALPHVYLRAGSALLDIVQGRPSRFPMTSQELFEILDNASEKTFLCGTAVSMQKYVFDGEAAKIGLETKNLVACMSFLVEQKVVKAW
LADKDAEALRCQKLLVEEEEAAQRRQAELLERKRQKKLRQKEQRSKEQKHEEKGDMEGSVDETIEDVSPEESSSPQTEYHSEGESRGILPDHIPSSFEASQQSLTDEDED
SESHSGFRSLYPEYLPIDQNGEQQKVQMNGHKHVIAQWQALPKIQRGLSNGFRADQNYQGLKNGDMRRHGNHVQSRAAPIVNGKKVWSRKPKPERDGDRFQARIQEEPTA
QAEEIKSHEVLIGSISVALGNCNQESKEPAGAQDDCQDGHQTPRKINHLEKFIKPDSTQTATNRVMVKLWRPVSRNGSKHAMPDQSENGESEAEVITEKLEDQALLNTYS
PRSSSLDGDTGDFGNNSSLVQEEEPVQPVGLEFSSRAAKAFLAQRWKEAITAEHVKLNLPSDSESSGCFEVKNETETSSGLFQSSNIEKRNVLGNAIVINLEAPKCSANE
GGKTKFRTKGAKIKYIPKLRTTT