; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0792 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0792
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionMis18-binding protein 1-like isoform X1
Genome locationMC05:6649785..6658644
RNA-Seq ExpressionMC05g0792
SyntenyMC05g0792
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032797 - SMN complex (cellular component)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR035426 - Gemin2/Brr1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN53109.2 hypothetical protein Csa_015143 [Cucumis sativus]0.069.27Show/hide
Query:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSA---
        MADEISS    DGFN +F ++E PQ P + +DSA +IS D + FP I SNQ  D EV IN   SAS          A+E+PE   TSV+KMV+CDSA   
Subjt:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSA---

Query:  SENGGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSE
        SENGG++ SLV  +++NLD  LE+ KE  KVD V  F T D  EDG Q+VAVDEV+ KDFARS+ SLD NQD AK+ELV+E QL+A      KEAFAR+E
Subjt:  SENGGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSE

Query:  ELLRKEAASASFLEMKKKLLLEELEAMLV------LEEKGKNPPNSEGIVDNCSMIPIV-KGKIADQQNV-SENMNVLRRSNLSLRNSLKIEVIDETALV
        +LL+KE  S S LEMKKKLLLE+++AMLV      L+E G NPP+S GIVD C    ++ + KIADQQN  SE MNVLRRS+LSLRNSLKIEVIDETALV
Subjt:  ELLRKEAASASFLEMKKKLLLEELEAMLV------LEEKGKNPPNSEGIVDNCSMIPIV-KGKIADQQNV-SENMNVLRRSNLSLRNSLKIEVIDETALV

Query:  EPVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRR
        EPVHVS IGNGE +GIVCP+RSMQ+K+NK  EP+KGGKKAK+SRR+ARE K+ E+H  + N+NE+DK N RQ+N EGNKIVYSRKDMEALRFVNVAEQ+R
Subjt:  EPVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRR

Query:  LWKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSD
        LWKAICKEL+PVVAREYSSLT     +K GSTSD +Q L KR+EASSIIREGCSESLDGEIEDM GD EI NFV S+PSCSLS              DSD
Subjt:  LWKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSD

Query:  DDKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQ
        DDKYYHSIQRPAF VEGEP+FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVD SN KKEQSVYMPVIPAIA CP+HLLPSK+WE+AFLADFS+LRQALS 
Subjt:  DDKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQ

Query:  SEG-FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALS
        SE   ++SDFILHEKID + PNLI QPSVLPAN+ DS Q ++   +T AKE +C D+PS+SAISKM+ +FRVSSLRKRIN  ET TTLSRADC WLFALS
Subjt:  SEG-FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALS

Query:  AAIDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN
        AA+DTPLD DTCAAFRSLLRKCASLRAEKTE+D+EVIMLNIL+ ISGRYF QSEN
Subjt:  AAIDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN

XP_008454478.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103494875 [Cucumis melo]0.069.36Show/hide
Query:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSA---
        MADEI+S    DGFN +F ++ENPQ P +P+DSA  IS D + FP I SN+  D EV IN   SAS           +ENPE   +SV+KMV+CDSA   
Subjt:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSA---

Query:  SENGGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSE
        SENGGS+ SLV  +++NLD  +E+ KES KVD V  FET D  ED  QEVAVDEV+ KDFARS+ S D NQD AK+ELVQE QL+A      KEAFAR+E
Subjt:  SENGGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSE

Query:  ELLRKEAASASFLEMKKKLLLEELEAMLV------LEEKGKNPPNSEGIVDNCSMIPIV-KGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVE
        +LL+KE  S S LEMKKKLLLE+++AMLV      L+E G NPP+S GIVD C    ++ + KIADQQN SE MNVLRRS+LSLRNSLKIEVIDETALVE
Subjt:  ELLRKEAASASFLEMKKKLLLEELEAMLV------LEEKGKNPPNSEGIVDNCSMIPIV-KGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVE

Query:  PVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRL
        PVHVS IGNG+ +GIVCP+RSMQ+++ K  EP+KGGKK  +SRR+ARE K+ E+H  +WNVNE+DK + RQ+N EGNKI+YSRKDMEALRFVNVAEQ+RL
Subjt:  PVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRL

Query:  WKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDD
        WKAICKEL+PVVAREYSSLT     +K GSTSD +Q L KR+EASSIIREGCSESLDGEIEDMEGD+EI NFV S+PSCSLS              DSDD
Subjt:  WKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDD

Query:  DKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQS
        DKYYHSIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVD SN KKEQSVYMPVIPAIA CP+HLLPSK+WE+AFLADFS+LRQALS S
Subjt:  DKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQS

Query:  EG-FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSA
        E   ++SDFILHEKID + PNLI QPSVLPA++ D  QPE+   +T AKE +C D+PS+SAISKM+ +FRVSSLRKRIN  ET TTLSRADC WLFALSA
Subjt:  EG-FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSA

Query:  AIDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN
        A+DTPLD DTCAAFRSLLRKCASLRAEKTE+DDEVIMLNIL+ ISGRYFGQSEN
Subjt:  AIDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN

XP_022145184.1 uncharacterized protein LOC111014690 isoform X1 [Momordica charantia]0.0100Show/hide
Query:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN
        MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN
Subjt:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN

Query:  GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR
        GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR
Subjt:  GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR

Query:  KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE
        KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE
Subjt:  KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE

Query:  EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV
        EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV
Subjt:  EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV

Query:  VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA
        VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA
Subjt:  VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA

Query:  FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH
        FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH
Subjt:  FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH

Query:  EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTPLDADTCA
        EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTPLDADTCA
Subjt:  EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTPLDADTCA

Query:  AFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN
        AFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN
Subjt:  AFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN

XP_022145185.1 uncharacterized protein LOC111014690 isoform X2 [Momordica charantia]0.099.85Show/hide
Query:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN
        MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN
Subjt:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN

Query:  GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR
        GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR
Subjt:  GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR

Query:  KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE
        KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE
Subjt:  KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE

Query:  EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV
        EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV
Subjt:  EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV

Query:  VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA
        VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA
Subjt:  VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA

Query:  FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH
        FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH
Subjt:  FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH

Query:  EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLV
        EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINL+
Subjt:  EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLV

XP_038901998.1 uncharacterized protein LOC120088652 [Benincasa hispida]0.071.3Show/hide
Query:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN
        MADEISS   GDGFN +FS +EN Q   +PIDSA +IS D+  FP I SNQ QD EV IN   SAS           +ENPE   TSV K   C S SEN
Subjt:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN

Query:  GGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELL
        GG++ SLV  +++NLD  +E+ KE  KVD V  FET DAVEDG Q+VA+DEVE KDFARS+ S D N D +K+ELVQEVQL+A      KEAFAR+EELL
Subjt:  GGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELL

Query:  RKEAASASFLEMKKKLLLEELEAMLVLE-----EKGKNPPNSEGIVDNCSM-IPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHV
        +KE    S LE+KKKLLLEEL+AMLV       EKG NPP+S G VD+CS  I I + KIAD+QN SE MNVLRRS+LSLRNSLKIEVIDETALVEPVHV
Subjt:  RKEAASASFLEMKKKLLLEELEAMLVLE-----EKGKNPPNSEGIVDNCSM-IPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHV

Query:  SEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAI
        S+IGNGE +GIVCP RSMQ+K+NK  EP++GGKKAKRSRR+AREAK+ E++  L NVNELDK N RQK  EGNKIVYSRKDMEALRFVNVAEQRRLWKAI
Subjt:  SEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAI

Query:  CKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYY
        CKEL+P VAREYSSLTSSN  MKIGSTSD +Q L KR+EASSIIREGCSESLDGEIEDMEGD+E  NFV  +PSCS SVSED            D+DKYY
Subjt:  CKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYY

Query:  HSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFI
         SIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEAS IPNVT+AKVD SN KKEQSVYMPVIP IA CP HLLPSK+WE+AFLADFS LR+ALS SE F 
Subjt:  HSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFI

Query:  QSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTP
        QSDFILHEKID   P+LI QP VLPA NIDS Q E+   +T AKE++C D+PS+SAISKM+SVFRVSSL+KRIN +ET TTLS+ DC WLFALSAA+DTP
Subjt:  QSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTP

Query:  LDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN
        LDADTCAAFRSLLRKCASLRA+KTELDDEVIMLNIL+ ISGRYFGQSEN
Subjt:  LDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN

TrEMBL top hitse value%identityAlignment
A0A0A0KXG5 Uncharacterized protein0.069.27Show/hide
Query:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSA---
        MADEISS    DGFN +F ++E PQ P + +DSA +IS D + FP I SNQ  D EV IN   SAS          A+E+PE   TSV+KMV+CDSA   
Subjt:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSA---

Query:  SENGGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSE
        SENGG++ SLV  +++NLD  LE+ KE  KVD V  F T D  EDG Q+VAVDEV+ KDFARS+ SLD NQD AK+ELV+E QL+A      KEAFAR+E
Subjt:  SENGGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSE

Query:  ELLRKEAASASFLEMKKKLLLEELEAMLV------LEEKGKNPPNSEGIVDNCSMIPIV-KGKIADQQNV-SENMNVLRRSNLSLRNSLKIEVIDETALV
        +LL+KE  S S LEMKKKLLLE+++AMLV      L+E G NPP+S GIVD C    ++ + KIADQQN  SE MNVLRRS+LSLRNSLKIEVIDETALV
Subjt:  ELLRKEAASASFLEMKKKLLLEELEAMLV------LEEKGKNPPNSEGIVDNCSMIPIV-KGKIADQQNV-SENMNVLRRSNLSLRNSLKIEVIDETALV

Query:  EPVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRR
        EPVHVS IGNGE +GIVCP+RSMQ+K+NK  EP+KGGKKAK+SRR+ARE K+ E+H  + N+NE+DK N RQ+N EGNKIVYSRKDMEALRFVNVAEQ+R
Subjt:  EPVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRR

Query:  LWKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSD
        LWKAICKEL+PVVAREYSSLT     +K GSTSD +Q L KR+EASSIIREGCSESLDGEIEDM GD EI NFV S+PSCSLS              DSD
Subjt:  LWKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSD

Query:  DDKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQ
        DDKYYHSIQRPAF VEGEP+FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVD SN KKEQSVYMPVIPAIA CP+HLLPSK+WE+AFLADFS+LRQALS 
Subjt:  DDKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQ

Query:  SEG-FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALS
        SE   ++SDFILHEKID + PNLI QPSVLPAN+ DS Q ++   +T AKE +C D+PS+SAISKM+ +FRVSSLRKRIN  ET TTLSRADC WLFALS
Subjt:  SEG-FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALS

Query:  AAIDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN
        AA+DTPLD DTCAAFRSLLRKCASLRAEKTE+D+EVIMLNIL+ ISGRYF QSEN
Subjt:  AAIDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN

A0A1S3BZY0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034948750.069.36Show/hide
Query:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSA---
        MADEI+S    DGFN +F ++ENPQ P +P+DSA  IS D + FP I SN+  D EV IN   SAS           +ENPE   +SV+KMV+CDSA   
Subjt:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSA---

Query:  SENGGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSE
        SENGGS+ SLV  +++NLD  +E+ KES KVD V  FET D  ED  QEVAVDEV+ KDFARS+ S D NQD AK+ELVQE QL+A      KEAFAR+E
Subjt:  SENGGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSE

Query:  ELLRKEAASASFLEMKKKLLLEELEAMLV------LEEKGKNPPNSEGIVDNCSMIPIV-KGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVE
        +LL+KE  S S LEMKKKLLLE+++AMLV      L+E G NPP+S GIVD C    ++ + KIADQQN SE MNVLRRS+LSLRNSLKIEVIDETALVE
Subjt:  ELLRKEAASASFLEMKKKLLLEELEAMLV------LEEKGKNPPNSEGIVDNCSMIPIV-KGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVE

Query:  PVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRL
        PVHVS IGNG+ +GIVCP+RSMQ+++ K  EP+KGGKK  +SRR+ARE K+ E+H  +WNVNE+DK + RQ+N EGNKI+YSRKDMEALRFVNVAEQ+RL
Subjt:  PVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRL

Query:  WKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDD
        WKAICKEL+PVVAREYSSLT     +K GSTSD +Q L KR+EASSIIREGCSESLDGEIEDMEGD+EI NFV S+PSCSLS              DSDD
Subjt:  WKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDD

Query:  DKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQS
        DKYYHSIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVD SN KKEQSVYMPVIPAIA CP+HLLPSK+WE+AFLADFS+LRQALS S
Subjt:  DKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQS

Query:  EG-FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSA
        E   ++SDFILHEKID + PNLI QPSVLPA++ D  QPE+   +T AKE +C D+PS+SAISKM+ +FRVSSLRKRIN  ET TTLSRADC WLFALSA
Subjt:  EG-FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSA

Query:  AIDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN
        A+DTPLD DTCAAFRSLLRKCASLRAEKTE+DDEVIMLNIL+ ISGRYFGQSEN
Subjt:  AIDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN

A0A5A7TRY3 Mis18-binding protein 1-like isoform X10.069.34Show/hide
Query:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSA---
        MADEI+S    DGFN +F ++ENPQ P +P+DSA  IS D + FP I SN+  D EV IN   SAS           +ENPE   +SV+KMV+CDSA   
Subjt:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSA---

Query:  SENGGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSE
        SENGGS+ SLV  +++NLD  +E+ KES KVD V  FET D  ED  QEVAVDEV+ KDFARS+ S D NQD AK+ELVQE QL+A      KEAFAR+E
Subjt:  SENGGSITSLV-DEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSE

Query:  ELLRKEAASASFLEMKKKLLLEELEAMLV------LEEKGKNPPNSEGIVDNCSMIPIV-KGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVE
        +LL+KE  S S LEMKKKLLLE+++AMLV      L+E G NPP+S GIVD C    ++ + KIADQQN SE MNVLRRS+LSLRNSLKIEVIDETALVE
Subjt:  ELLRKEAASASFLEMKKKLLLEELEAMLV------LEEKGKNPPNSEGIVDNCSMIPIV-KGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVE

Query:  PVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRL
        PVHVS IGNG+ +GIVCP+RSMQ+++ K  EP+KGGKKAK+SRR+ARE K+ E+H  +WNVNE+DK + RQ+N EGNKI+YSRKDMEALRFVNVAEQ+RL
Subjt:  PVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRL

Query:  WKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDD
        WKAICKEL+PVVAREYSSLT     +K GSTSD +Q L KR+EASSIIREGCSESLDGEIEDMEGD+EI NFV S+PSCSLS              DSDD
Subjt:  WKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDD

Query:  DKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQS
        DKYYHSIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVD SN KKEQSVYMPVIPAIA CP+HLLPSK+WE+AFLADFS+LRQALS S
Subjt:  DKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQS

Query:  EG-FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSA
        E   ++SDFILHEKID + PNLI QPSVLPA++ D  QPE+   +T AKE +C D+PS+SAISKM+ +FRVSSLRKRIN  ET TTLSRADC WLFALSA
Subjt:  EG-FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSA

Query:  AIDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGR
        A+DTPLD DTCAAFRSLLRKCASLRAEKTE+DDEVIMLNIL+ ISGR
Subjt:  AIDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGR

A0A6J1CTR6 uncharacterized protein LOC111014690 isoform X20.099.85Show/hide
Query:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN
        MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN
Subjt:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN

Query:  GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR
        GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR
Subjt:  GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR

Query:  KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE
        KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE
Subjt:  KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE

Query:  EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV
        EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV
Subjt:  EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV

Query:  VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA
        VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA
Subjt:  VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA

Query:  FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH
        FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH
Subjt:  FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH

Query:  EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLV
        EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINL+
Subjt:  EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLV

A0A6J1CUG9 uncharacterized protein LOC111014690 isoform X10.0100Show/hide
Query:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN
        MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN
Subjt:  MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASEN

Query:  GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR
        GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR
Subjt:  GGSITSLVDEVRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLR

Query:  KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE
        KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE
Subjt:  KEAASASFLEMKKKLLLEELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGE

Query:  EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV
        EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV
Subjt:  EMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPV

Query:  VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA
        VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA
Subjt:  VAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPA

Query:  FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH
        FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH
Subjt:  FLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILH

Query:  EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTPLDADTCA
        EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTPLDADTCA
Subjt:  EKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTPLDADTCA

Query:  AFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN
        AFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN
Subjt:  AFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN

SwissProt top hitse value%identityAlignment
O14893 Gem-associated protein 21.5e-1329.5Show/hide
Query:  PAFLVEGEPDFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGF-----
        P  L EG   FD S PP    EYLRRV+ EA++ P+V VA++D   LK++QSV +  +      P+   P+ QW+   +A FS +RQ +++         
Subjt:  PAFLVEGEPDFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGF-----

Query:  IQSDFILHEKID-------LISPNLIDQPSVLPANN----IDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCA
        + S+  + +  D        +   L    +V PA N    ID  Q   P              P +S +S+M+     S L    N              
Subjt:  IQSDFILHEKID-------LISPNLIDQPSVLPANN----IDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCA

Query:  WLFALSAAIDTPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILAAISGRYFGQSE
        WL+AL A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  WLFALSAAIDTPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILAAISGRYFGQSE

O42260 Gem-associated protein 22.5e-1629.72Show/hide
Query:  DFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILHEKIDLI
        D+D S PP    EYLRRV+ EA+R P+V +A++D   L+K+Q+V +  +      P    PS +W+   +A FS +RQ+L +  G  +S  +     ++ 
Subjt:  DFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILHEKIDLI

Query:  SPNLIDQPS-------------VLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTP
         P+  D+ S             +  A N +SQ P                 P +S +S+MS     S L   +N  E           WL+AL A ++ P
Subjt:  SPNLIDQPS-------------VLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTP

Query:  LDADTCAAFRSLLRKCASLRA-EKTELDDEVIMLNILAAISGRYFGQSE
        L  +  +  R L R+C+ +RA  + + DD V  LN+   + GRYF Q +
Subjt:  LDADTCAAFRSLLRKCASLRA-EKTELDDEVIMLNILAAISGRYFGQSE

Q54KN2 Gem-associated protein 22.3e-1423.05Show/hide
Query:  QRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKK--EQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQAL--------
        Q  AF V  E + D   P  G EYL+RV+W ++R P+V VA +D S +K     + Y  + P+I  C + LLP+  WE  FL DFS+ RQ L        
Subjt:  QRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKK--EQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQAL--------

Query:  ------------------------------------------------SQSEGFIQSDFILHEKIDLISPNLID-----------------QPSVLPANN
                                                        + +      DF  +++ D    N  D                 +        
Subjt:  ------------------------------------------------SQSEGFIQSDFILHEKIDLISPNLID-----------------QPSVLPANN

Query:  IDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTPLDADTCAAFRSLLRKCASLRAEKTELDD
         + ++ E+ + +   K+    + P++  + ++  V  V+ +   I  +E     ++    WL+ L + ++ P+D DTC+  RS +R+ +  R++ T L+D
Subjt:  IDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTPLDADTCAAFRSLLRKCASLRAEKTELDD

Query:  -EVIMLNILAAISGRYFGQSE
          +  +NIL  I  +YF Q E
Subjt:  -EVIMLNILAAISGRYFGQSE

Q9CQQ4 Gem-associated protein 22.2e-1228.29Show/hide
Query:  PAFLVEGEPDFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDF
        P  L EG   FD S PP    EYLRRV+ EA++ P+V VA++D   LK++QSV +  +      P+   P+ QW+   +A FS +RQ++ +     +S  
Subjt:  PAFLVEGEPDFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDF

Query:  ILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDH------PSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAID
        +     ++  P   D+        +  +   +       +ES  +D+      P +S +S+M+     S L    N              W +AL A ++
Subjt:  ILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDH------PSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAID

Query:  TPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILAAISGRYFGQSE
         PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  TPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILAAISGRYFGQSE

Q9QZP1 Gem-associated protein 21.1e-1127.89Show/hide
Query:  PAFLVEGEPDFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDF
        P  L EG   FD S PP    EYLRRV+ EA++ P+V VA++D   LK++QSV +  +      P+   P+ QW+   +  FS +RQ++ +     +S  
Subjt:  PAFLVEGEPDFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDF

Query:  ILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDH------PSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAID
        +     ++  P   D+        +  +   +        ES  +D+      P +S +S+M+     S L    N              W +AL A ++
Subjt:  ILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDH------PSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAID

Query:  TPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILAAISGRYFGQSE
         PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  TPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILAAISGRYFGQSE

Arabidopsis top hitse value%identityAlignment
AT1G54380.1 spliceosome protein-related1.5e-6132.6Show/hide
Query:  LDRNQDFAKKELVQEVQLSA------AIEADGKEAFARSEELLRKEAASASFLEMKKKLLLE-ELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIAD
        L++  DF KK+L  E+   A      AI  +       + +L  +E   +SF +  ++  ++  LE  L  EEK +    ++ +++      + + ++  
Subjt:  LDRNQDFAKKELVQEVQLSA------AIEADGKEAFARSEELLRKEAASASFLEMKKKLLLE-ELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIAD

Query:  QQNVSENMNVLRRSNLSLRNSL-KIEVIDETALVEPVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELD
            S +++ L +   ++ N + KIE++D TALV+ VH                            P++ G  A+  +   R+ K      K+ +   + 
Subjt:  QQNVSENMNVLRRSNLSLRNSL-KIEVIDETALVEPVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELD

Query:  KFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEG
        + +  + N +  + +Y+RK +E++RF ++  Q+ LW  +   ++P V  EY SL    +       S     +  R E+      G  E+L         
Subjt:  KFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEG

Query:  DSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSN-LKKEQSVYMP
                          +E+G    ++ + D+DD   Y+SI RPAF V+GEPDF +GPPEDGLEYLRRVRWEA  IPNV VAK+D S  +KKEQSVYMP
Subjt:  DSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSN-LKKEQSVYMP

Query:  VIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKM
        +IP I  CP++LLP K+WED+ L DF  LRQ L+QS    + + I  + ++ +   + ++       ++ +++ E                  V+ I  M
Subjt:  VIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKM

Query:  SSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTPLDADTCAAFRSLLRKCASLRAEKT-ELDDE--VIMLNILAAISGRYFGQ
         SV RVS L+KRI LVE  + L  +DC W+ AL A+++TPLDADTCA  R LLRKCAS+RAE + E+ DE  + M N+L  I+GRYFGQ
Subjt:  SSVFRVSSLRKRINLVETHTTLSRADCAWLFALSAAIDTPLDADTCAAFRSLLRKCASLRAEKT-ELDDE--VIMLNILAAISGRYFGQ

AT2G42510.1 FUNCTIONS IN: molecular_function unknown2.2e-2366.67Show/hide
Query:  SVSEDGNTILDECSGDSDDDKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLK-KEQSVYMPVIP
        S+ ED   + +E + D+DD   Y+SI RPAF V+GEPDFDSGPPEDG+EYLRRVRWEA RIPNV VAKV  S  + KEQSVYMP IP
Subjt:  SVSEDGNTILDECSGDSDDDKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLK-KEQSVYMPVIP

AT2G42510.2 FUNCTIONS IN: molecular_function unknown7.7e-2926.16Show/hide
Query:  SASEISTDENKFPSIASNQKQD-FEVMINGKNSASGFSANVFHCAAEENPEI----------TETSVEKM---------VVCDSASENGGSI--TSLVDE
        S  E + D  K  S+   +  D  E MI+ K       ++ F    E++ EI          +ET++E+          VV  S  ENG +I   + V E
Subjt:  SASEISTDENKFPSIASNQKQD-FEVMINGKNSASGFSANVFHCAAEENPEI----------TETSVEKM---------VVCDSASENGGSI--TSLVDE

Query:  VRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIE-ADGKEAFARSEELLRKEAAS----
             S +   +     DDV   +  D VE     V++ +++E     S+ S  + QD   + +V+  ++    E  DG      ++EL + +  S    
Subjt:  VRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIE-ADGKEAFARSEELLRKEAAS----

Query:  -----------------ASFLEMKKKLLL----EELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDE
                          S  +MK+ L      ++ +  ++ E + KN  ++     +   I        DQ+   E + V  + N  +  S+ I+++D+
Subjt:  -----------------ASFLEMKKKLLL----EELEAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDE

Query:  TALVEPVHVSEIGNGEEMGIVCPSRSMQI-KLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNV
        TAL + V   + G       V         + +K V  EK   K   S    R A          +    D  N+ + N +  +I+YSR  ME++R+ ++
Subjt:  TALVEPVHVSEIGNGEEMGIVCPSRSMQI-KLNKYVEPEKGGKKAKRSRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNV

Query:  AEQRRLWKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDEC
        A Q++LW  +   L+P +  EY                                 EG   ++    +D                           + +E 
Subjt:  AEQRRLWKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREGCSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDEC

Query:  SGDSDDDKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLK-KEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQL
        + D+DD   Y+SI RPAF V+GEPDFDSGPPEDG+EYLRRVRWEA RIPNV VAKV  S  + KEQSVYMP IP          P+       L  F   
Subjt:  SGDSDDDKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLK-KEQSVYMPVIPAIADCPQHLLPSKQWEDAFLADFSQL

Query:  RQALSQSEG--FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADC
         + L+ S G  F Q  ++                SV    + D++              N +    + A              K     E  + L  +DC
Subjt:  RQALSQSEG--FIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVSSLRKRINLVETHTTLSRADC

Query:  AWLFALSAAIDTPLDADTCAAFRSLLRKCASLRA
         W+ AL A++DTP DADT A  R+L+RKCASLRA
Subjt:  AWLFALSAAIDTPLDADTCAAFRSLLRKCASLRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATGAGATAAGTTCTGCTGGCACTGGCGATGGGTTTAATCGAAGATTTTCGGCAGCTGAGAACCCCCAGTTTCCTCTGCAACCCATTGATTCTGCCTCTGAGAT
CTCTACCGACGAGAATAAATTCCCTTCGATCGCCTCGAATCAGAAGCAGGACTTTGAAGTAATGATAAACGGTAAAAATTCCGCTTCTGGGTTCTCCGCCAATGTTTTCC
ATTGCGCTGCAGAAGAGAACCCAGAAATTACAGAAACTTCTGTCGAAAAGATGGTCGTGTGCGATTCGGCTTCTGAAAACGGAGGAAGCATTACAAGCCTGGTGGACGAG
GTTAGGAATCTTGATTCGGGGCTTGAAATTGAAAAAGAATCTTCCAAGGTGGACGATGTCCGCCACTTTGAAACGTTCGATGCCGTGGAAGATGGTAATCAAGAAGTCGC
GGTGGATGAAGTTGAAGAGAAAGATTTTGCAAGAAGCATGCCGAGTCTTGATAGAAATCAAGATTTTGCAAAGAAAGAACTTGTTCAAGAAGTTCAGTTGTCTGCTGCTA
TTGAAGCCGACGGCAAAGAAGCCTTTGCGCGATCGGAAGAGCTGTTGAGGAAAGAAGCTGCTTCTGCGAGCTTTCTGGAAATGAAAAAGAAATTACTATTGGAGGAACTC
GAAGCCATGTTGGTTCTTGAAGAAAAGGGAAAGAACCCCCCTAACTCAGAAGGGATCGTGGATAATTGCAGCATGATTCCTATTGTTAAGGGGAAGATTGCTGATCAGCA
AAACGTTTCCGAGAACATGAATGTTCTCAGACGAAGTAATTTGTCACTCAGAAATTCATTGAAGATTGAAGTAATAGACGAGACTGCATTAGTTGAACCGGTTCATGTTT
CTGAAATTGGAAATGGAGAAGAGATGGGTATTGTTTGTCCATCTAGGTCAATGCAGATCAAATTGAACAAGTATGTTGAACCTGAAAAAGGAGGTAAAAAGGCTAAACGA
TCAAGGAGGAGGGCAAGGGAAGCGAAGATTCCTGAGATACATGGGAAGCTGTGGAATGTCAACGAGCTTGATAAATTCAATGCTCGTCAAAAAAATATGGAAGGAAACAA
GATAGTGTATTCAAGGAAGGATATGGAAGCACTGAGGTTTGTGAATGTTGCAGAACAGAGGAGATTGTGGAAAGCTATATGCAAGGAACTTATGCCCGTTGTGGCAAGGG
AATACAGTAGCTTAACAAGCTCGAATTCCCAGATGAAGATAGGCTCCACCTCTGATTCTAAACAACATTTGGAGAAGAGAGACGAAGCCTCTTCAATTATAAGGGAGGGA
TGTTCAGAAAGCTTGGACGGTGAGATAGAGGACATGGAAGGTGATAGTGAAATTAAAAATTTTGTATCCTCGGATCCCTCTTGCAGTCTTAGTGTCAGTGAGGATGGCAA
TACAATTTTGGATGAGTGCAGTGGAGATAGTGATGATGATAAATATTACCACAGTATTCAGCGACCTGCCTTTCTGGTGGAGGGAGAACCTGATTTTGATTCAGGACCTC
CAGAAGATGGATTAGAGTATCTGAGACGTGTTAGGTGGGAAGCTTCCCGTATTCCAAACGTGACGGTGGCAAAGGTTGATACAAGTAATTTGAAGAAAGAGCAAAGTGTC
TATATGCCAGTGATTCCTGCAATTGCCGACTGCCCTCAGCATTTACTGCCTTCAAAACAGTGGGAGGATGCATTTCTTGCTGATTTTTCTCAGCTGCGACAGGCTCTGTC
TCAATCTGAGGGATTTATACAGTCTGATTTCATCCTCCATGAAAAGATCGATTTGATAAGCCCGAACTTGATTGATCAGCCGAGTGTCTTGCCTGCCAACAACATCGACT
CCCAGCAACCTGAGAAACCAATTGCTAACACCATAGCAAAGGAAAGCAATTGCATGGACCATCCATCTGTATCGGCAATCTCGAAGATGAGCTCTGTGTTTCGTGTTTCG
TCGTTGAGGAAACGTATAAACTTGGTAGAAACACACACAACACTCTCGAGGGCTGATTGTGCTTGGCTGTTTGCTTTAAGTGCAGCAATTGATACTCCTCTGGATGCAGA
TACTTGTGCTGCTTTCAGGAGCCTGCTTCGGAAATGTGCTAGCTTGCGGGCCGAGAAGACCGAGCTCGATGACGAGGTGATAATGCTCAACATTCTTGCCGCCATTTCTG
GAAGGTACTTTGGACAGTCGGAAAATTGA
mRNA sequenceShow/hide mRNA sequence
CCCCATATTTTTGGTTCACTCACTCGGCGGTGACTCTCTAAAACCCTCCTTGGTTTCTTCGAGTAATGGCCGATGAGATAAGTTCTGCTGGCACTGGCGATGGGTTTAAT
CGAAGATTTTCGGCAGCTGAGAACCCCCAGTTTCCTCTGCAACCCATTGATTCTGCCTCTGAGATCTCTACCGACGAGAATAAATTCCCTTCGATCGCCTCGAATCAGAA
GCAGGACTTTGAAGTAATGATAAACGGTAAAAATTCCGCTTCTGGGTTCTCCGCCAATGTTTTCCATTGCGCTGCAGAAGAGAACCCAGAAATTACAGAAACTTCTGTCG
AAAAGATGGTCGTGTGCGATTCGGCTTCTGAAAACGGAGGAAGCATTACAAGCCTGGTGGACGAGGTTAGGAATCTTGATTCGGGGCTTGAAATTGAAAAAGAATCTTCC
AAGGTGGACGATGTCCGCCACTTTGAAACGTTCGATGCCGTGGAAGATGGTAATCAAGAAGTCGCGGTGGATGAAGTTGAAGAGAAAGATTTTGCAAGAAGCATGCCGAG
TCTTGATAGAAATCAAGATTTTGCAAAGAAAGAACTTGTTCAAGAAGTTCAGTTGTCTGCTGCTATTGAAGCCGACGGCAAAGAAGCCTTTGCGCGATCGGAAGAGCTGT
TGAGGAAAGAAGCTGCTTCTGCGAGCTTTCTGGAAATGAAAAAGAAATTACTATTGGAGGAACTCGAAGCCATGTTGGTTCTTGAAGAAAAGGGAAAGAACCCCCCTAAC
TCAGAAGGGATCGTGGATAATTGCAGCATGATTCCTATTGTTAAGGGGAAGATTGCTGATCAGCAAAACGTTTCCGAGAACATGAATGTTCTCAGACGAAGTAATTTGTC
ACTCAGAAATTCATTGAAGATTGAAGTAATAGACGAGACTGCATTAGTTGAACCGGTTCATGTTTCTGAAATTGGAAATGGAGAAGAGATGGGTATTGTTTGTCCATCTA
GGTCAATGCAGATCAAATTGAACAAGTATGTTGAACCTGAAAAAGGAGGTAAAAAGGCTAAACGATCAAGGAGGAGGGCAAGGGAAGCGAAGATTCCTGAGATACATGGG
AAGCTGTGGAATGTCAACGAGCTTGATAAATTCAATGCTCGTCAAAAAAATATGGAAGGAAACAAGATAGTGTATTCAAGGAAGGATATGGAAGCACTGAGGTTTGTGAA
TGTTGCAGAACAGAGGAGATTGTGGAAAGCTATATGCAAGGAACTTATGCCCGTTGTGGCAAGGGAATACAGTAGCTTAACAAGCTCGAATTCCCAGATGAAGATAGGCT
CCACCTCTGATTCTAAACAACATTTGGAGAAGAGAGACGAAGCCTCTTCAATTATAAGGGAGGGATGTTCAGAAAGCTTGGACGGTGAGATAGAGGACATGGAAGGTGAT
AGTGAAATTAAAAATTTTGTATCCTCGGATCCCTCTTGCAGTCTTAGTGTCAGTGAGGATGGCAATACAATTTTGGATGAGTGCAGTGGAGATAGTGATGATGATAAATA
TTACCACAGTATTCAGCGACCTGCCTTTCTGGTGGAGGGAGAACCTGATTTTGATTCAGGACCTCCAGAAGATGGATTAGAGTATCTGAGACGTGTTAGGTGGGAAGCTT
CCCGTATTCCAAACGTGACGGTGGCAAAGGTTGATACAAGTAATTTGAAGAAAGAGCAAAGTGTCTATATGCCAGTGATTCCTGCAATTGCCGACTGCCCTCAGCATTTA
CTGCCTTCAAAACAGTGGGAGGATGCATTTCTTGCTGATTTTTCTCAGCTGCGACAGGCTCTGTCTCAATCTGAGGGATTTATACAGTCTGATTTCATCCTCCATGAAAA
GATCGATTTGATAAGCCCGAACTTGATTGATCAGCCGAGTGTCTTGCCTGCCAACAACATCGACTCCCAGCAACCTGAGAAACCAATTGCTAACACCATAGCAAAGGAAA
GCAATTGCATGGACCATCCATCTGTATCGGCAATCTCGAAGATGAGCTCTGTGTTTCGTGTTTCGTCGTTGAGGAAACGTATAAACTTGGTAGAAACACACACAACACTC
TCGAGGGCTGATTGTGCTTGGCTGTTTGCTTTAAGTGCAGCAATTGATACTCCTCTGGATGCAGATACTTGTGCTGCTTTCAGGAGCCTGCTTCGGAAATGTGCTAGCTT
GCGGGCCGAGAAGACCGAGCTCGATGACGAGGTGATAATGCTCAACATTCTTGCCGCCATTTCTGGAAGGTACTTTGGACAGTCGGAAAATTGAAGCTATGGTGGGATGA
CTAATGCCTCATCTTTTTCTCCAGAGTTAAAATAGTCTTTGCTAGGAGGATGGCGATCTTCACTTCTGAATATGAAACCTTGTTTGGAGACTGGTTGACAACATAACTTG
TACATTATTTACTTTTTTTTTGTGAAAAATCAAAGTGTGTTAATGATTTTTTTGGATTACGTAATCAATCTATATTTGATGATTAAGTGGAAGGCTTAGCTTGTTAAATA
AGGGATTTTTTG
Protein sequenceShow/hide protein sequence
MADEISSAGTGDGFNRRFSAAENPQFPLQPIDSASEISTDENKFPSIASNQKQDFEVMINGKNSASGFSANVFHCAAEENPEITETSVEKMVVCDSASENGGSITSLVDE
VRNLDSGLEIEKESSKVDDVRHFETFDAVEDGNQEVAVDEVEEKDFARSMPSLDRNQDFAKKELVQEVQLSAAIEADGKEAFARSEELLRKEAASASFLEMKKKLLLEEL
EAMLVLEEKGKNPPNSEGIVDNCSMIPIVKGKIADQQNVSENMNVLRRSNLSLRNSLKIEVIDETALVEPVHVSEIGNGEEMGIVCPSRSMQIKLNKYVEPEKGGKKAKR
SRRRAREAKIPEIHGKLWNVNELDKFNARQKNMEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYSSLTSSNSQMKIGSTSDSKQHLEKRDEASSIIREG
CSESLDGEIEDMEGDSEIKNFVSSDPSCSLSVSEDGNTILDECSGDSDDDKYYHSIQRPAFLVEGEPDFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDTSNLKKEQSV
YMPVIPAIADCPQHLLPSKQWEDAFLADFSQLRQALSQSEGFIQSDFILHEKIDLISPNLIDQPSVLPANNIDSQQPEKPIANTIAKESNCMDHPSVSAISKMSSVFRVS
SLRKRINLVETHTTLSRADCAWLFALSAAIDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILAAISGRYFGQSEN