; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022429 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022429
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSWIM-type domain-containing protein
Genome locationtig00154217:254362..267138
RNA-Seq ExpressionSgr022429
SyntenySgr022429
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000270 - PB1 domain
IPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458637.1 PREDICTED: uncharacterized protein LOC103497981 isoform X1 [Cucumis melo]0.0e+0092.74Show/hide
Query:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF
        GGEFETGRDGML YHGGDAHAIDVDDKMKFNEFK+E+AEMFN +VDTMSIKYFLPGNRKTLIT+SNDKDLKRM+KFHGDS TVDI+VIME+V+AP++SN 
Subjt:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF

Query:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF
        PASRSSRTTLSETVVPV GTPLT++HGI DDN + DIPLDGALDVVDDTNP+V HIDIAGDITPILPLLG +DEK+GKG QQWQNTITGVGQRFSSVHEF
Subjt:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF

Query:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
        RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASI+KEKLKVFPNYKPKDIV+DIK
Subjt:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK

Query:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
        QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLC KIMETNPGSLATCDTKEDS+FHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
Subjt:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA

Query:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
        TAADGDDGFFPVAFSVVD ESDDNW WFLLQLKSALST+CPITFVADRQKGLTVSIA IFK SFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
Subjt:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA

Query:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM
        YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGE+FYSWVS+AHELPITQMVD IRVKIMELIYTRRADSDQWLTRLTPSM
Subjt:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM

Query:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK
        EEKLEKEGHK H+L VL+SAGSTFEVRGDSIEVVDVDHWDC+CKGWQLTGLPCSHAIAVLGCLGR P+DFCSRY +TESYRLTYS+SVHPVP VD PI K
Subjt:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK

Query:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
         SLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCK+ LQSV
Subjt:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV

XP_022159433.1 uncharacterized protein LOC111025860 isoform X1 [Momordica charantia]0.0e+0094.06Show/hide
Query:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF
        GGEFETGRDGML YHGGDAHAIDVDDKMKFNEFK+EVAEMFNCNVDTMSIKYFLPGNRKTLIT+SNDKDLKRMIKFHGDSVTVDIY+ ME+VVA D+SN 
Subjt:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF

Query:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF
        PASRSSRTTLSETVVPV GTPLTIIHGIGDDNT  DIPLDGALDVVDDTN +V HIDI GDITPILPLLG NDEKHGKGAQQWQNTITGVGQRFSSVHEF
Subjt:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF

Query:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
        RESLRKYAIAHQFAF+YKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMN THTCEGAV TTG+QATRSWVASIIKEKLKV+PNYKPKDIVNDIK
Subjt:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK

Query:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
        QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLP LCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASL GFQQGCRPLIFLDSIPLKSKYQGTLLAA
Subjt:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA

Query:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
        TAADGDDGFFP+AFSVVD ES+DNW WFLLQLKSALST+CPITFVADRQKGLTVSIAGIFK SFHG CLRYLTEQLIRDLKGQFSHEVKRLIVEDFY AA
Subjt:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA

Query:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM
        YAPKPENFQRC+ESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIY RRA+SDQWLTRLTPSM
Subjt:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM

Query:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK
        EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDC+CKGWQL GLPCSHAIAVLGCLGR PY+FCSRY +TESYRLTYSESVHPVPHVD P+ K
Subjt:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK

Query:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
        GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEF QSV
Subjt:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV

XP_022159434.1 uncharacterized protein LOC111025860 isoform X2 [Momordica charantia]0.0e+0094.06Show/hide
Query:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF
        GGEFETGRDGML YHGGDAHAIDVDDKMKFNEFK+EVAEMFNCNVDTMSIKYFLPGNRKTLIT+SNDKDLKRMIKFHGDSVTVDIY+ ME+VVA D+SN 
Subjt:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF

Query:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF
        PASRSSRTTLSETVVPV GTPLTIIHGIGDDNT  DIPLDGALDVVDDTN +V HIDI GDITPILPLLG NDEKHGKGAQQWQNTITGVGQRFSSVHEF
Subjt:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF

Query:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
        RESLRKYAIAHQFAF+YKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMN THTCEGAV TTG+QATRSWVASIIKEKLKV+PNYKPKDIVNDIK
Subjt:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK

Query:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
        QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLP LCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASL GFQQGCRPLIFLDSIPLKSKYQGTLLAA
Subjt:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA

Query:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
        TAADGDDGFFP+AFSVVD ES+DNW WFLLQLKSALST+CPITFVADRQKGLTVSIAGIFK SFHG CLRYLTEQLIRDLKGQFSHEVKRLIVEDFY AA
Subjt:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA

Query:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM
        YAPKPENFQRC+ESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIY RRA+SDQWLTRLTPSM
Subjt:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM

Query:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK
        EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDC+CKGWQL GLPCSHAIAVLGCLGR PY+FCSRY +TESYRLTYSESVHPVPHVD P+ K
Subjt:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK

Query:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
        GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEF QSV
Subjt:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV

XP_022159590.1 uncharacterized protein LOC111025961 [Momordica charantia]0.0e+0094.72Show/hide
Query:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF
        GGEFET RDG L YHGGDAHAID+DDKMKFNEFK+EVAEMFNCN+DTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSV+VDIY+ ME+VVA  VSN 
Subjt:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF

Query:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF
        PASRSSRTTLSETVVPV GTPLTIIHGIGDDNT  DI LDG LDVVDDTNPIV HIDI GDITPILPLLG NDEKHGKGAQQWQNTITGVGQRFSSVHEF
Subjt:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF

Query:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
        RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTH CEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
Subjt:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK

Query:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
        QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAY+QLP LCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASL GFQQGCRPLIFLDSIPLKSKYQGTLLAA
Subjt:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA

Query:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
        TAADGDDGFFPVAFSVVD ES+DNW WFLLQLKSALST+CPITFVADRQKGLTVSIAGIFK SFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFY AA
Subjt:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA

Query:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM
        YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIY RRA+SDQWLTRLTPSM
Subjt:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM

Query:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK
        EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDC+CKGWQLTGLPCSHAIAVLGCLGR PY+FCSRY +TESYRLTYSESVHP+PHVD PIQK
Subjt:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK

Query:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
        GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
Subjt:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV

XP_038901698.1 uncharacterized protein LOC120088456 isoform X1 [Benincasa hispida]0.0e+0093.14Show/hide
Query:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF
        GGEFETGRDGML YHGGDAHAIDVDDKMKFNEFK+E+AEMFNC+VDT+SIKYFLPGNRKTLIT+SNDKDLKRM+KFHGDS TVDI+VIME+V+AP++SN 
Subjt:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF

Query:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF
        PASRSSRTTLSETVVPV GTPLT++HG+GDDN + DIPLDGALDVVDDTNP+VTHIDIAGDITPILPLLG +D+K+GKG QQWQNTITGVGQRFSSVHEF
Subjt:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF

Query:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
        RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASI+KEKLKVFPNYKPKDIV+DIK
Subjt:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK

Query:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
        QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNP SLATCDTKEDSSFHRLFVSF ASLSGFQQGCRPLIFLDSI LKSKYQGTLLAA
Subjt:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA

Query:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
        TAADGDDGFFPVAFSVVD ESDDNWGWFLLQLKSALST+CPITFVADRQKGLTVSIA IFK SFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
Subjt:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA

Query:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM
        YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGE+FYSWVS+AHELPITQMVD IRVKIMELIYTRRA SDQWLTRLTPSM
Subjt:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM

Query:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK
        EEKLEKEGHKVH+LQVL+SAGSTFEVRGDSIEVVD+DHWDC+CKGWQLTGLPCSHAIAVLGCLGR PYDFCSRY +TESYRLTYSESVHPVP VD PI K
Subjt:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK

Query:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
        GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCK+ LQSV
Subjt:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV

TrEMBL top hitse value%identityAlignment
A0A1S3C8B5 uncharacterized protein LOC103497981 isoform X10.0e+0092.74Show/hide
Query:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF
        GGEFETGRDGML YHGGDAHAIDVDDKMKFNEFK+E+AEMFN +VDTMSIKYFLPGNRKTLIT+SNDKDLKRM+KFHGDS TVDI+VIME+V+AP++SN 
Subjt:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF

Query:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF
        PASRSSRTTLSETVVPV GTPLT++HGI DDN + DIPLDGALDVVDDTNP+V HIDIAGDITPILPLLG +DEK+GKG QQWQNTITGVGQRFSSVHEF
Subjt:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF

Query:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
        RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASI+KEKLKVFPNYKPKDIV+DIK
Subjt:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK

Query:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
        QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLC KIMETNPGSLATCDTKEDS+FHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
Subjt:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA

Query:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
        TAADGDDGFFPVAFSVVD ESDDNW WFLLQLKSALST+CPITFVADRQKGLTVSIA IFK SFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
Subjt:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA

Query:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM
        YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGE+FYSWVS+AHELPITQMVD IRVKIMELIYTRRADSDQWLTRLTPSM
Subjt:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM

Query:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK
        EEKLEKEGHK H+L VL+SAGSTFEVRGDSIEVVDVDHWDC+CKGWQLTGLPCSHAIAVLGCLGR P+DFCSRY +TESYRLTYS+SVHPVP VD PI K
Subjt:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK

Query:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
         SLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCK+ LQSV
Subjt:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV

A0A5D3BC93 MuDR family transposase isoform 20.0e+0092.74Show/hide
Query:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF
        GGEFETGRDGML YHGGDAHAIDVDDKMKFNEFK+E+AEMFN +VDTMSIKYFLPGNRKTLIT+SNDKDLKRM+KFHGDS TVDI+VIME+V+AP++SN 
Subjt:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF

Query:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF
        PASRSSRTTLSETVVPV GTPLT++HGI DDN + DIPLDGALDVVDDTNP+V HIDIAGDITPILPLLG +DEK+GKG QQWQNTITGVGQRFSSVHEF
Subjt:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF

Query:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
        RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASI+KEKLKVFPNYKPKDIV+DIK
Subjt:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK

Query:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
        QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLC KIMETNPGSLATCDTKEDS+FHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
Subjt:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA

Query:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
        TAADGDDGFFPVAFSVVD ESDDNW WFLLQLKSALST+CPITFVADRQKGLTVSIA IFK SFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
Subjt:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA

Query:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM
        YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGE+FYSWVS+AHELPITQMVD IRVKIMELIYTRRADSDQWLTRLTPSM
Subjt:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM

Query:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK
        EEKLEKEGHK H+L VL+SAGSTFEVRGDSIEVVDVDHWDC+CKGWQLTGLPCSHAIAVLGCLGR P+DFCSRY +TESYRLTYS+SVHPVP VD PI K
Subjt:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK

Query:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
         SLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCK+ LQSV
Subjt:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV

A0A6J1DYS8 uncharacterized protein LOC111025860 isoform X10.0e+0094.06Show/hide
Query:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF
        GGEFETGRDGML YHGGDAHAIDVDDKMKFNEFK+EVAEMFNCNVDTMSIKYFLPGNRKTLIT+SNDKDLKRMIKFHGDSVTVDIY+ ME+VVA D+SN 
Subjt:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF

Query:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF
        PASRSSRTTLSETVVPV GTPLTIIHGIGDDNT  DIPLDGALDVVDDTN +V HIDI GDITPILPLLG NDEKHGKGAQQWQNTITGVGQRFSSVHEF
Subjt:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF

Query:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
        RESLRKYAIAHQFAF+YKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMN THTCEGAV TTG+QATRSWVASIIKEKLKV+PNYKPKDIVNDIK
Subjt:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK

Query:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
        QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLP LCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASL GFQQGCRPLIFLDSIPLKSKYQGTLLAA
Subjt:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA

Query:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
        TAADGDDGFFP+AFSVVD ES+DNW WFLLQLKSALST+CPITFVADRQKGLTVSIAGIFK SFHG CLRYLTEQLIRDLKGQFSHEVKRLIVEDFY AA
Subjt:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA

Query:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM
        YAPKPENFQRC+ESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIY RRA+SDQWLTRLTPSM
Subjt:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM

Query:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK
        EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDC+CKGWQL GLPCSHAIAVLGCLGR PY+FCSRY +TESYRLTYSESVHPVPHVD P+ K
Subjt:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK

Query:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
        GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEF QSV
Subjt:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV

A0A6J1DZ89 uncharacterized protein LOC1110259610.0e+0094.72Show/hide
Query:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF
        GGEFET RDG L YHGGDAHAID+DDKMKFNEFK+EVAEMFNCN+DTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSV+VDIY+ ME+VVA  VSN 
Subjt:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF

Query:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF
        PASRSSRTTLSETVVPV GTPLTIIHGIGDDNT  DI LDG LDVVDDTNPIV HIDI GDITPILPLLG NDEKHGKGAQQWQNTITGVGQRFSSVHEF
Subjt:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF

Query:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
        RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTH CEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
Subjt:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK

Query:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
        QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAY+QLP LCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASL GFQQGCRPLIFLDSIPLKSKYQGTLLAA
Subjt:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA

Query:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
        TAADGDDGFFPVAFSVVD ES+DNW WFLLQLKSALST+CPITFVADRQKGLTVSIAGIFK SFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFY AA
Subjt:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA

Query:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM
        YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIY RRA+SDQWLTRLTPSM
Subjt:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM

Query:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK
        EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDC+CKGWQLTGLPCSHAIAVLGCLGR PY+FCSRY +TESYRLTYSESVHP+PHVD PIQK
Subjt:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK

Query:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
        GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
Subjt:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV

A0A6J1E2D7 uncharacterized protein LOC111025860 isoform X20.0e+0094.06Show/hide
Query:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF
        GGEFETGRDGML YHGGDAHAIDVDDKMKFNEFK+EVAEMFNCNVDTMSIKYFLPGNRKTLIT+SNDKDLKRMIKFHGDSVTVDIY+ ME+VVA D+SN 
Subjt:  GGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFNCNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNF

Query:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF
        PASRSSRTTLSETVVPV GTPLTIIHGIGDDNT  DIPLDGALDVVDDTN +V HIDI GDITPILPLLG NDEKHGKGAQQWQNTITGVGQRFSSVHEF
Subjt:  PASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEF

Query:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK
        RESLRKYAIAHQFAF+YKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMN THTCEGAV TTG+QATRSWVASIIKEKLKV+PNYKPKDIVNDIK
Subjt:  RESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIK

Query:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA
        QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLP LCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASL GFQQGCRPLIFLDSIPLKSKYQGTLLAA
Subjt:  QEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAA

Query:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA
        TAADGDDGFFP+AFSVVD ES+DNW WFLLQLKSALST+CPITFVADRQKGLTVSIAGIFK SFHG CLRYLTEQLIRDLKGQFSHEVKRLIVEDFY AA
Subjt:  TAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAA

Query:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM
        YAPKPENFQRC+ESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIY RRA+SDQWLTRLTPSM
Subjt:  YAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQWLTRLTPSM

Query:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK
        EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDC+CKGWQL GLPCSHAIAVLGCLGR PY+FCSRY +TESYRLTYSESVHPVPHVD P+ K
Subjt:  EEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQK

Query:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV
        GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEF QSV
Subjt:  GSLQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV

SwissProt top hitse value%identityAlignment
Q6NQJ7 Protein FAR1-RELATED SEQUENCE 41.3e-0620.56Show/hide
Query:  KIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQ--QGCRPLIFLDSIPLKSKYQGTLLAATAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALS
        ++ E NP      D  ED   H L   F     G +  +    ++  ++    SKY+  L+     +       +   ++  ++   + W +     A+ 
Subjt:  KIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQ--QGCRPLIFLDSIPLKSKYQGTLLAATAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALS

Query:  TTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAAYA--PKPENFQRCVESIKSISLEAYNWI--LQSEPQN
           P   + D+   +  +IA +  ++ H YCL ++ +QL R+L   +    +   ++  +   Y    + E  +R ++ I    L    W+  L  E + 
Subjt:  TTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAAYA--PKPENFQRCVESIKSISLEAYNWI--LQSEPQN

Query:  WANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIR--VKIMELIYTR--RADSDQW-----LTRLTPSMEEKLEKEGHKV---HSLQVLLSA-
        WA  F  G  +  ++        + + D +  P T + + +     ++E  Y    +AD D W     L   +P  ++ L    H++     L+VL +A 
Subjt:  WANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIR--VKIMELIYTR--RADSDQW-----LTRLTPSMEEKLEKEGHKV---HSLQVLLSA-

Query:  ---------GSTFEVRGDSIEVVDVDHWD-------CSCKGWQLTGLPCSHAIAVLGCLG
                 G+T+ V+    E   +  WD       CSC+ ++  G  C HAI VL   G
Subjt:  ---------GSTFEVRGDSIEVVDVDHWD-------CSCKGWQLTGLPCSHAIAVLGCLG

Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase7.7e-3423.06Show/hide
Query:  DEKHGKGAQQWQNTITG---------VGQRFSSVHEFRESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGA
        D   G G+      ++G         VG  F  + E ++++   +I  +     ++ +     V+C+   C W I ASR     L  I + +  H C   
Subjt:  DEKHGKGAQQWQNTITG---------VGQRFSSVHEFRESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGA

Query:  VTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIKQEYGIQLNYFQAWRGKEIAKE-------QLQGSYKEAYNQLPFLCEKIMETNPGSL------A
             +      +   I+  ++V P     ++    ++++G  L+       + + ++       +  G + +++  +P L   ++ ++ G L      +
Subjt:  VTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIKQEYGIQLNYFQAWRGKEIAKE-------QLQGSYKEAYNQLPFLCEKIMETNPGSL------A

Query:  TCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAATAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQK
             E +SF  LF +F  S+ GFQ  CRPLI +D+  L  KY+  L+ A+A D  + +FP+AF+V    S D+W WFL +++  ++    I  ++    
Subjt:  TCDTKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIPLKSKYQGTLLAATAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQK

Query:  GLTVSI---AGIFKD--SFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAAYAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYN
         +   I      +K+  ++H +CL +L  +L     G F + +  L+ E    A  + + E F   ++ IK  + EA+ W+ Q  P  WA A  +G RY 
Subjt:  GLTVSI---AGIFKD--SFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAAYAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYN

Query:  HMTSNFGELF------------------YSWVSDAHELPI------TQMVDAIRVKIMELIYTRRADSDQWLTRLTPSMEEKLEKEGHKVHSLQVLLSAG
         M  +   LF                  +  + DA            +  D     +ME +     DSD W+  +TP     LE++ +     QV ++  
Subjt:  HMTSNFGELF------------------YSWVSDAHELPI------TQMVDAIRVKIMELIYTRRADSDQWLTRLTPSMEEKLEKEGHKVHSLQVLLSAG

Query:  STFEVRGDSIE----VVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQKGSLQASVTVTPPPTRRP
            + G S +    +V ++   C+C  +Q    PC HA+AV   L   P  +     + E Y  TYS    PVP + A  +   +    T+ PP    P
Subjt:  STFEVRGDSIE----VVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQKGSLQASVTVTPPPTRRP

Query:  PGRPTSK
        P + + K
Subjt:  PGRPTSK

AT1G64260.1 MuDR family transposase8.5e-3322.54Show/hide
Query:  PDVSNFPASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITP-ILPLLGPNDEKHGKGAQQWQNTITGVGQR
        P V+N      ++ T    VVPV  T  + +      + +  I     +D    +  I+  +  +G + P +LP L  +D+               +G  
Subjt:  PDVSNFPASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPIVTHIDIAGDITP-ILPLLGPNDEKHGKGAQQWQNTITGVGQR

Query:  FSSVHEFRESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPK
        F    E ++++  + I  +     ++ +    T +C    C W + A+R+    L+ I K    HTC        +     + A  I+  +++ P     
Subjt:  FSSVHEFRESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEGAVTTTGHQATRSWVASIIKEKLKVFPNYKPK

Query:  DIVNDIKQEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCD-----TKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIP
        ++    K++ G +L   +   GK    +++ G   +++  +P L      +N G L           + +SF  +F SF  S+ GFQ  CRPLI +D+  
Subjt:  DIVNDIKQEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCD-----TKEDSSFHRLFVSFHASLSGFQQGCRPLIFLDSIP

Query:  LKSKYQGTLLAATAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSI---AGIFKD--SFHGYCLRYLTEQLIRDLKGQ
        L  KYQ  L+ A+  D  + FFP+AF+V    S D+W WF  +++  ++    +  ++   + +   +     ++++  + H +CL +L  Q +    G 
Subjt:  LKSKYQGTLLAATAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSI---AGIFKD--SFHGYCLRYLTEQLIRDLKGQ

Query:  FSHEVKRLIVEDFYAAAYAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELF-----YSWVSDAHELPITQMVDAIRVKI
        F       +VE    A    + E F   +  IK  + EA+ W+ Q     WA A   G RY  +  +   LF     + + + A    +  M D +R   
Subjt:  FSHEVKRLIVEDFYAAAYAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELF-----YSWVSDAHELPITQMVDAIRVKI

Query:  MELIYTRRADSDQWLTRLTPSMEEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIE---VVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYL
         + + +  +  ++ +    P M++  E     +  +   L   S F+V   S +   +V ++   C+C+ +Q    PC HA+AV   L   P  +     
Subjt:  MELIYTRRADSDQWLTRLTPSMEEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIE---VVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYL

Query:  STESYRLTYSESVHPVPHVDA
        + E Y  TY+ +  PVP V A
Subjt:  STESYRLTYSESVHPVPHVDA

AT1G76320.1 FAR1-related sequence 49.4e-0820.56Show/hide
Query:  KIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQ--QGCRPLIFLDSIPLKSKYQGTLLAATAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALS
        ++ E NP      D  ED   H L   F     G +  +    ++  ++    SKY+  L+     +       +   ++  ++   + W +     A+ 
Subjt:  KIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQ--QGCRPLIFLDSIPLKSKYQGTLLAATAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALS

Query:  TTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAAYA--PKPENFQRCVESIKSISLEAYNWI--LQSEPQN
           P   + D+   +  +IA +  ++ H YCL ++ +QL R+L   +    +   ++  +   Y    + E  +R ++ I    L    W+  L  E + 
Subjt:  TTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAAYA--PKPENFQRCVESIKSISLEAYNWI--LQSEPQN

Query:  WANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIR--VKIMELIYTR--RADSDQW-----LTRLTPSMEEKLEKEGHKV---HSLQVLLSA-
        WA  F  G  +  ++        + + D +  P T + + +     ++E  Y    +AD D W     L   +P  ++ L    H++     L+VL +A 
Subjt:  WANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIR--VKIMELIYTR--RADSDQW-----LTRLTPSMEEKLEKEGHKV---HSLQVLLSA-

Query:  ---------GSTFEVRGDSIEVVDVDHWD-------CSCKGWQLTGLPCSHAIAVLGCLG
                 G+T+ V+    E   +  WD       CSC+ ++  G  C HAI VL   G
Subjt:  ---------GSTFEVRGDSIEVVDVDHWD-------CSCKGWQLTGLPCSHAIAVLGCLG

AT1G76320.2 FAR1-related sequence 49.4e-0820.56Show/hide
Query:  KIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQ--QGCRPLIFLDSIPLKSKYQGTLLAATAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALS
        ++ E NP      D  ED   H L   F     G +  +    ++  ++    SKY+  L+     +       +   ++  ++   + W +     A+ 
Subjt:  KIMETNPGSLATCDTKEDSSFHRLFVSFHASLSGFQ--QGCRPLIFLDSIPLKSKYQGTLLAATAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALS

Query:  TTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAAYA--PKPENFQRCVESIKSISLEAYNWI--LQSEPQN
           P   + D+   +  +IA +  ++ H YCL ++ +QL R+L   +    +   ++  +   Y    + E  +R ++ I    L    W+  L  E + 
Subjt:  TTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKGQFSHEVKRLIVEDFYAAAYA--PKPENFQRCVESIKSISLEAYNWI--LQSEPQN

Query:  WANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIR--VKIMELIYTR--RADSDQW-----LTRLTPSMEEKLEKEGHKV---HSLQVLLSA-
        WA  F  G  +  ++        + + D +  P T + + +     ++E  Y    +AD D W     L   +P  ++ L    H++     L+VL +A 
Subjt:  WANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIR--VKIMELIYTR--RADSDQW-----LTRLTPSMEEKLEKEGHKV---HSLQVLLSA-

Query:  ---------GSTFEVRGDSIEVVDVDHWD-------CSCKGWQLTGLPCSHAIAVLGCLG
                 G+T+ V+    E   +  WD       CSC+ ++  G  C HAI VL   G
Subjt:  ---------GSTFEVRGDSIEVVDVDHWD-------CSCKGWQLTGLPCSHAIAVLGCLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAATCTCAATATGGTGCTCCTTCTGATGCTCCTGCCCCTTCTGATGGTGCTATTCATTCTGGTGTTTTGGTTCCAACTCCTTCTCTACGACGCAGCACCAGATC
AATTCAGCGCCCTTCTTATTTACGTGACTACCATTGTAACTTATTGCTTAATGCTGGTTTATCTTCCTCCTCTACTAACAAGCCAAAATATTCCTTGGATCGTTATGTTT
CTTACTCCAAGTTGTCTCATTCCCATCAAATTTTCCCCCTGAATGATAATAATACTCATAGGAAGAGGAGATCCGGCGCCGGTACGAGGGCTGAGGCACGGCGTCGCTGG
TGGTGCTGGTGGTGCTGGTGGTGGGTGGCACTGCCAGTTTCGTATCGCATTCCATTACCACTGCCTGGGGGAGGAACGCTGCCGCTTACGCAAACCCATCCGCCACCGTT
CCTTCTTTTCCACACTAATTTTGCTGTGGGGGCTCAATACAAACTGGGAATTCGGTCAAACACTCTCTCCTCCTCGGATTCTGATCTCAATTACGGCGGTGAATTTGAGA
CTGGTAGAGATGGTATGCTTTTATACCATGGAGGAGATGCCCATGCTATTGACGTGGATGATAAAATGAAGTTTAACGAGTTCAAGCTGGAAGTAGCAGAAATGTTTAAT
TGTAACGTGGACACTATGTCGATCAAATACTTCCTACCTGGCAACAGGAAGACTCTCATTACTGTCTCCAATGACAAGGATCTAAAGCGCATGATAAAGTTTCATGGAGA
TTCTGTCACTGTTGATATTTATGTTATCATGGAAGATGTTGTGGCTCCAGACGTCTCAAATTTTCCTGCCAGTAGGTCAAGCAGAACGACTTTGTCAGAAACAGTGGTAC
CTGTTGTTGGTACCCCTCTTACTATTATCCATGGTATTGGGGATGATAATACCCAGCCTGATATCCCACTCGATGGTGCGCTTGATGTTGTGGATGACACAAACCCTATA
GTTACTCACATTGATATAGCAGGTGACATTACACCAATTCTTCCTCTTCTTGGCCCAAATGATGAGAAGCATGGCAAAGGTGCACAGCAGTGGCAGAATACCATTACTGG
TGTCGGTCAAAGATTCAGCAGCGTTCATGAGTTTCGTGAATCACTCCGTAAATATGCCATTGCACATCAATTTGCATTCAGGTACAAGAAAAATGATAGTCATCGGGTGA
CTGTTAAATGCAAGGCTGAAGGTTGCCCATGGAGGATTCATGCATCAAGATTGTCGACCACTCAATTAATATGTATTAAGAAGATGAATCCCACACATACATGTGAAGGG
GCAGTTACGACTACAGGCCACCAGGCTACAAGGAGTTGGGTGGCCAGTATTATTAAGGAGAAATTAAAAGTTTTCCCAAATTACAAGCCAAAGGATATTGTCAATGACAT
CAAACAGGAATATGGAATACAATTAAACTACTTCCAGGCCTGGCGTGGGAAAGAAATAGCAAAGGAGCAGCTTCAGGGTTCATATAAAGAAGCATATAATCAGTTACCTT
TTTTGTGTGAGAAAATAATGGAGACTAATCCAGGTAGTCTTGCCACCTGCGACACCAAAGAAGACTCGAGTTTTCACCGTCTCTTTGTCTCATTCCATGCCTCGTTATCT
GGTTTCCAACAGGGTTGCCGTCCTCTTATTTTCCTTGACAGCATTCCTTTAAAGTCAAAATATCAAGGAACATTATTGGCTGCTACAGCTGCTGATGGAGATGATGGTTT
TTTTCCTGTTGCTTTTTCTGTAGTAGACGCAGAAAGTGATGATAACTGGGGCTGGTTTCTTTTACAATTAAAATCAGCGCTGTCAACAACTTGTCCTATAACATTTGTGG
CAGATAGACAGAAAGGTTTAACTGTTTCCATTGCTGGTATATTCAAGGACTCGTTTCATGGCTACTGCCTAAGATACTTGACTGAACAACTTATTAGAGACTTGAAAGGG
CAATTTTCTCATGAGGTGAAGCGGCTTATAGTTGAGGACTTCTATGCTGCTGCTTATGCACCTAAACCGGAAAATTTTCAGAGGTGTGTTGAAAGCATTAAAAGCATTTC
ACTAGAGGCTTACAATTGGATCCTACAAAGTGAACCTCAGAATTGGGCAAACGCATTCTTTGAGGGTGCTAGGTATAACCACATGACATCAAACTTTGGAGAGCTCTTCT
ACAGTTGGGTATCAGATGCACATGAATTGCCCATCACACAGATGGTTGATGCGATTAGGGTTAAGATAATGGAGTTGATCTATACACGGCGAGCAGATTCTGACCAGTGG
TTGACAAGGCTAACCCCATCCATGGAGGAAAAGTTGGAAAAGGAAGGCCACAAGGTTCATAGCCTTCAAGTGCTATTATCTGCGGGTAGCACATTTGAAGTTCGAGGTGA
CTCCATCGAAGTTGTTGATGTTGATCACTGGGATTGTTCTTGTAAAGGGTGGCAGCTTACTGGATTACCATGCAGTCATGCAATTGCAGTCCTTGGTTGTCTTGGCCGAA
TCCCTTACGATTTTTGCTCCCGATATCTATCGACTGAAAGCTACAGATTAACATATTCGGAGTCAGTACATCCTGTTCCCCATGTTGACGCGCCCATACAGAAAGGTTCT
CTACAGGCCTCCGTAACTGTAACTCCTCCTCCTACGCGTCGTCCACCTGGCCGACCTACATCAAAGCGATATGGATCCCCAGAGGTGATGAAACGTCAACTTCAATGCAG
CAGATGCAAGGGGCTCGGACACAACAAGTCAACCTGCAAAGAGTTCCTCCAGAGTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCAATCTCAATATGGTGCTCCTTCTGATGCTCCTGCCCCTTCTGATGGTGCTATTCATTCTGGTGTTTTGGTTCCAACTCCTTCTCTACGACGCAGCACCAGATC
AATTCAGCGCCCTTCTTATTTACGTGACTACCATTGTAACTTATTGCTTAATGCTGGTTTATCTTCCTCCTCTACTAACAAGCCAAAATATTCCTTGGATCGTTATGTTT
CTTACTCCAAGTTGTCTCATTCCCATCAAATTTTCCCCCTGAATGATAATAATACTCATAGGAAGAGGAGATCCGGCGCCGGTACGAGGGCTGAGGCACGGCGTCGCTGG
TGGTGCTGGTGGTGCTGGTGGTGGGTGGCACTGCCAGTTTCGTATCGCATTCCATTACCACTGCCTGGGGGAGGAACGCTGCCGCTTACGCAAACCCATCCGCCACCGTT
CCTTCTTTTCCACACTAATTTTGCTGTGGGGGCTCAATACAAACTGGGAATTCGGTCAAACACTCTCTCCTCCTCGGATTCTGATCTCAATTACGGCGGTGAATTTGAGA
CTGGTAGAGATGGTATGCTTTTATACCATGGAGGAGATGCCCATGCTATTGACGTGGATGATAAAATGAAGTTTAACGAGTTCAAGCTGGAAGTAGCAGAAATGTTTAAT
TGTAACGTGGACACTATGTCGATCAAATACTTCCTACCTGGCAACAGGAAGACTCTCATTACTGTCTCCAATGACAAGGATCTAAAGCGCATGATAAAGTTTCATGGAGA
TTCTGTCACTGTTGATATTTATGTTATCATGGAAGATGTTGTGGCTCCAGACGTCTCAAATTTTCCTGCCAGTAGGTCAAGCAGAACGACTTTGTCAGAAACAGTGGTAC
CTGTTGTTGGTACCCCTCTTACTATTATCCATGGTATTGGGGATGATAATACCCAGCCTGATATCCCACTCGATGGTGCGCTTGATGTTGTGGATGACACAAACCCTATA
GTTACTCACATTGATATAGCAGGTGACATTACACCAATTCTTCCTCTTCTTGGCCCAAATGATGAGAAGCATGGCAAAGGTGCACAGCAGTGGCAGAATACCATTACTGG
TGTCGGTCAAAGATTCAGCAGCGTTCATGAGTTTCGTGAATCACTCCGTAAATATGCCATTGCACATCAATTTGCATTCAGGTACAAGAAAAATGATAGTCATCGGGTGA
CTGTTAAATGCAAGGCTGAAGGTTGCCCATGGAGGATTCATGCATCAAGATTGTCGACCACTCAATTAATATGTATTAAGAAGATGAATCCCACACATACATGTGAAGGG
GCAGTTACGACTACAGGCCACCAGGCTACAAGGAGTTGGGTGGCCAGTATTATTAAGGAGAAATTAAAAGTTTTCCCAAATTACAAGCCAAAGGATATTGTCAATGACAT
CAAACAGGAATATGGAATACAATTAAACTACTTCCAGGCCTGGCGTGGGAAAGAAATAGCAAAGGAGCAGCTTCAGGGTTCATATAAAGAAGCATATAATCAGTTACCTT
TTTTGTGTGAGAAAATAATGGAGACTAATCCAGGTAGTCTTGCCACCTGCGACACCAAAGAAGACTCGAGTTTTCACCGTCTCTTTGTCTCATTCCATGCCTCGTTATCT
GGTTTCCAACAGGGTTGCCGTCCTCTTATTTTCCTTGACAGCATTCCTTTAAAGTCAAAATATCAAGGAACATTATTGGCTGCTACAGCTGCTGATGGAGATGATGGTTT
TTTTCCTGTTGCTTTTTCTGTAGTAGACGCAGAAAGTGATGATAACTGGGGCTGGTTTCTTTTACAATTAAAATCAGCGCTGTCAACAACTTGTCCTATAACATTTGTGG
CAGATAGACAGAAAGGTTTAACTGTTTCCATTGCTGGTATATTCAAGGACTCGTTTCATGGCTACTGCCTAAGATACTTGACTGAACAACTTATTAGAGACTTGAAAGGG
CAATTTTCTCATGAGGTGAAGCGGCTTATAGTTGAGGACTTCTATGCTGCTGCTTATGCACCTAAACCGGAAAATTTTCAGAGGTGTGTTGAAAGCATTAAAAGCATTTC
ACTAGAGGCTTACAATTGGATCCTACAAAGTGAACCTCAGAATTGGGCAAACGCATTCTTTGAGGGTGCTAGGTATAACCACATGACATCAAACTTTGGAGAGCTCTTCT
ACAGTTGGGTATCAGATGCACATGAATTGCCCATCACACAGATGGTTGATGCGATTAGGGTTAAGATAATGGAGTTGATCTATACACGGCGAGCAGATTCTGACCAGTGG
TTGACAAGGCTAACCCCATCCATGGAGGAAAAGTTGGAAAAGGAAGGCCACAAGGTTCATAGCCTTCAAGTGCTATTATCTGCGGGTAGCACATTTGAAGTTCGAGGTGA
CTCCATCGAAGTTGTTGATGTTGATCACTGGGATTGTTCTTGTAAAGGGTGGCAGCTTACTGGATTACCATGCAGTCATGCAATTGCAGTCCTTGGTTGTCTTGGCCGAA
TCCCTTACGATTTTTGCTCCCGATATCTATCGACTGAAAGCTACAGATTAACATATTCGGAGTCAGTACATCCTGTTCCCCATGTTGACGCGCCCATACAGAAAGGTTCT
CTACAGGCCTCCGTAACTGTAACTCCTCCTCCTACGCGTCGTCCACCTGGCCGACCTACATCAAAGCGATATGGATCCCCAGAGGTGATGAAACGTCAACTTCAATGCAG
CAGATGCAAGGGGCTCGGACACAACAAGTCAACCTGCAAAGAGTTCCTCCAGAGTGTTTGA
Protein sequenceShow/hide protein sequence
MDQSQYGAPSDAPAPSDGAIHSGVLVPTPSLRRSTRSIQRPSYLRDYHCNLLLNAGLSSSSTNKPKYSLDRYVSYSKLSHSHQIFPLNDNNTHRKRRSGAGTRAEARRRW
WCWWCWWWVALPVSYRIPLPLPGGGTLPLTQTHPPPFLLFHTNFAVGAQYKLGIRSNTLSSSDSDLNYGGEFETGRDGMLLYHGGDAHAIDVDDKMKFNEFKLEVAEMFN
CNVDTMSIKYFLPGNRKTLITVSNDKDLKRMIKFHGDSVTVDIYVIMEDVVAPDVSNFPASRSSRTTLSETVVPVVGTPLTIIHGIGDDNTQPDIPLDGALDVVDDTNPI
VTHIDIAGDITPILPLLGPNDEKHGKGAQQWQNTITGVGQRFSSVHEFRESLRKYAIAHQFAFRYKKNDSHRVTVKCKAEGCPWRIHASRLSTTQLICIKKMNPTHTCEG
AVTTTGHQATRSWVASIIKEKLKVFPNYKPKDIVNDIKQEYGIQLNYFQAWRGKEIAKEQLQGSYKEAYNQLPFLCEKIMETNPGSLATCDTKEDSSFHRLFVSFHASLS
GFQQGCRPLIFLDSIPLKSKYQGTLLAATAADGDDGFFPVAFSVVDAESDDNWGWFLLQLKSALSTTCPITFVADRQKGLTVSIAGIFKDSFHGYCLRYLTEQLIRDLKG
QFSHEVKRLIVEDFYAAAYAPKPENFQRCVESIKSISLEAYNWILQSEPQNWANAFFEGARYNHMTSNFGELFYSWVSDAHELPITQMVDAIRVKIMELIYTRRADSDQW
LTRLTPSMEEKLEKEGHKVHSLQVLLSAGSTFEVRGDSIEVVDVDHWDCSCKGWQLTGLPCSHAIAVLGCLGRIPYDFCSRYLSTESYRLTYSESVHPVPHVDAPIQKGS
LQASVTVTPPPTRRPPGRPTSKRYGSPEVMKRQLQCSRCKGLGHNKSTCKEFLQSV