; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g22210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g22210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr5:15942733..15956658
RNA-Seq ExpressionMoc05g22210
SyntenyMoc05g22210
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]1.3e-14843.32Show/hide
Query:  DPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIP--PRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSG
        DPEIERT  + RK QR        K ++ +++ +  V +      DIP  PR                         + D++D A+R+YAA  F+  +SG
Subjt:  DPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIP--PRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSG

Query:  IVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFP
        I+ P      FELKP+MFQMLQTIG F G   EDPH HL+ F++I+++F+  G+ +DALRL LFP+S++D+AR WLN+ P GS+TTW  L EKFL+K+FP
Subjt:  IVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFP

Query:  PTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAA
        P  +A +R EI SF+Q D E +++AWERFKEL+RKCP+HG+  CIQ+E F+ GL+  TKM+++ +ANGA   K++N+  +IL  +A+ N  W S R++  
Subjt:  PTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAA

Query:  PKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSN
         K    AG+  +D  TSM+ ++ +M   LK +++G              +    + + Q  ++ C FC E H YD+CP NP SVFY+  GN     PYSN
Subjt:  PKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSN

Query:  TYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPVQNNNSNLENMMKEYM-------ARTDAVIQSQAASMRNFETQ
        TYN  WR HPNFSW  QG +SG + G  + N   Y P   Q  P                 +++LENM+KEY+       ++T+A++QSQAAS+RN E Q
Subjt:  TYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPVQNNNSNLENMMKEYM-------ARTDAVIQSQAASMRNFETQ

Query:  LGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSG---------LAYDGPTMPTTDVQIPSTKPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQP
        +GQLANEL+NRP G+ P  TE PK  G E CKA+TL+SG           YD    P+ + +IP  K            E EN      D  S P   + 
Subjt:  LGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSG---------LAYDGPTMPTTDVQIPSTKPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQP

Query:  S-FTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGG
        S   P PPFPQR  K+  + QF+KFLD+LKQLHINIPLV+ALEQMPNY KF+KDI+++K+++GE E VA+TK  S  +   LP K  DPGSFTIPC+IG 
Subjt:  S-FTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGG

Query:  KNLG
           G
Subjt:  KNLG

XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]1.4e-17150.14Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I +AD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
           ++ +S + N  PA A FE KPMM QML  IG F G EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFSL  QA AWLNAFP  +I T   +V
Subjt:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN
        WCS++ R   K+ DPA VLALD  TSMQK++ T+ Q LK M    KN  A  + P  ++   P+PV Q+ +  C                         N
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN

Query:  NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYNQRTQ--TPVQNNNSNLENMMKEY-----------MAR
         + FNPYSN YNPGW+ HPNFSW GQG SSG   GQ+QQ KQ Y P   P     PP  QQYNQ+     P Q N SN+E +MKE+           M R
Subjt:  NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYNQRTQ--TPVQNNNSNLENMMKEY-----------MAR

Query:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPENPTTPEK
        TDA I+              ++RN E QLGQLANE++ RPQGS P  TE P+R              +    P+    D Q+        +P+    PE 
Subjt:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPENPTTPEK

Query:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPL
                 + SV PQV    +P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKFLKDI++RKKK+GE+E VA+T+CSS    S  
Subjt:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPL

Query:  PMKCNDPGSFTIPCSIGGKNLG
        P K  DPGSFTI C IGGK++G
Subjt:  PMKCNDPGSFTIPCSIGGKNLG

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]2.5e-29889.49Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQT-PVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQT P+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQT-PVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPSTKP VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]5.3e-18754.49Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS
        A N   N I +AD RD AMR+YAA   ++ +S ++N  PA A FE KPMM QML  IG FGG EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        +  QA AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFRG D  TKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK M    KN          ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYNQRTQ--TPVQNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P     PP   QYNQ+     P Q N 
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYNQRTQ--TPVQNNN

Query:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLGQL NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTKPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSR
              PS        E  T    + I +     P V PQV  S  P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKF+KDI++R
Subjt:  TTDVQIPSTKPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSR

Query:  KKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLG
        KKK+GE+E VA+T+CSS    S +P K  DPGSFTIPC IGGK++G
Subjt:  KKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLG

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]3.3e-17395.06Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQT-PV
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQT PV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQT-PV

Query:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPE
        QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPE

Query:  NPTTPEKENIRKGNEDTPSVPPQV
        NPTTPEK NIRKGNEDTPSVPPQ+
Subjt:  NPTTPEKENIRKGNEDTPSVPPQV

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129471.5e-17150.14Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I +AD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
           ++ +S + N  PA A FE KPMM QML  IG F G EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFSL  QA AWLNAFP  +I T   +V
Subjt:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN
        WCS++ R   K+ DPA VLALD  TSMQK++ T+ Q LK M    KN  A  + P  ++   P+PV Q+ +  C                         N
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN

Query:  NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYNQRTQ--TPVQNNNSNLENMMKEY-----------MAR
         + FNPYSN YNPGW+ HPNFSW GQG SSG   GQ+QQ KQ Y P   P     PP  QQYNQ+     P Q N SN+E +MKE+           M R
Subjt:  NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYNQRTQ--TPVQNNNSNLENMMKEY-----------MAR

Query:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPENPTTPEK
        TDA I+              ++RN E QLGQLANE++ RPQGS P  TE P+R              +    P+    D Q+        +P+    PE 
Subjt:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPENPTTPEK

Query:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPL
                 + SV PQV    +P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKFLKDI++RKKK+GE+E VA+T+CSS    S  
Subjt:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPL

Query:  PMKCNDPGSFTIPCSIGGKNLG
        P K  DPGSFTI C IGGK++G
Subjt:  PMKCNDPGSFTIPCSIGGKNLG

A0A6J1DSZ5 uncharacterized protein LOC1110241071.8e-13258.65Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS
        A N   N I +AD +D AMR+YAAT  ++ +S ++NP+PA A FE KPMM QML  I  FGG EHEDP  HLKSFI++AN  RLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        L  QA AWLNAFP G+ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWE FK+LIR CPN G+PAC+QIEHFFRG D PTKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN+ W S+RSR   K+ DPAGVLALD  TSMQK++ T+ Q LK M    KN  A    P  ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYNQRTQ--TPVQNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTY+PGW+ HPNFSW GQG SS  + GQ+QQ KQ Y P   P     PP   QYNQ+     PVQ N 
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYNQRTQ--TPVQNNN

Query:  SNLENMMKEYMARTDA
        SN+E +MKE++ + DA
Subjt:  SNLENMMKEYMARTDA

A0A6J1DW02 uncharacterized protein LOC1110248971.2e-29889.49Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQT-PVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQT P+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQT-PVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPSTKP VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

A0A6J1DY39 uncharacterized protein LOC1110256532.6e-18754.49Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS
        A N   N I +AD RD AMR+YAA   ++ +S ++N  PA A FE KPMM QML  IG FGG EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        +  QA AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFRG D  TKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK M    KN          ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYNQRTQ--TPVQNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P     PP   QYNQ+     P Q N 
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYNQRTQ--TPVQNNN

Query:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLGQL NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTKPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSR
              PS        E  T    + I +     P V PQV  S  P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKF+KDI++R
Subjt:  TTDVQIPSTKPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSR

Query:  KKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLG
        KKK+GE+E VA+T+CSS    S +P K  DPGSFTIPC IGGK++G
Subjt:  KKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLG

A0A6J1DYG0 uncharacterized protein LOC1110257641.6e-17395.06Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQT-PV
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQT PV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQT-PV

Query:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPE
        QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPE

Query:  NPTTPEKENIRKGNEDTPSVPPQV
        NPTTPEK NIRKGNEDTPSVPPQ+
Subjt:  NPTTPEKENIRKGNEDTPSVPPQV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGGTGGGGCCCCTTGTTCAAGTCTCAAAGTCAGCATTTAAGGGAACACTCATCTACTCCCCTAAAGTCGAGGAGGAGTGAATTTCATCTTGTGAAGTTA
TGTTCCCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATGTTGAGCCGACGACTCGGGCCACTCTCACCCATACAGATCAAAGGACGAGTCCTCACGGG
CAGGAGTCCATAACTCACTCAGGATTGAGAATGAGTTGTCTGGTCATCCTAAGAAATAGCAACCTATTAGTTAATGATGTTACATCTAAAGATTGCCTATTTCGT
GGTCCGGTTACCCTAGTCTCTGCAGCAACCAAAAGCGGTAGTGAAAGAGTAGAACTCAAATCCCAAGAAAAGTCAGGAATTGCGTCTGGTGCATTTTCCCAACAT
AGTGTGTTTTCCATGTTTTGTATCAAAAATAAGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCGAAAAGACGGAAAAACTTACCATGGAGGCGCCAGGCGC
CTGGGAAGCCTGCAGAAAACAGTGTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCAATGCATACGGTACGTTAAAG
TGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTAGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAA
AAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACAAGCACATCGATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAAC
GGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAAC
TTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAA
CATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGGTATAACAGATGATGCTCTGAGACTAACACTTTTTCCATTT
TCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGAGAAGTTCTTAACAAAATTCTTTCCACCT
ACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTAAAGAACTAATCAGGAAATGT
CCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTTAGAGGCTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACA
AAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACATAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCA
GCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCC
ACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAAT
TGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAAT
TTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCG
CAACAGCAGTATAATCAGAGAACACAGACTCCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAA
TCCCAAGCGGCATCAATGAGGAATTTCGAGACTCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCA
AAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTAAA
CCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTT
ACCCCTACTCCCCCGTTTCCACAAAGATTAGTTAAGAAAAATAATGATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAACTGCACATAAATATACCATTA
GTAGATGCTCTAGAACAGATGCCAAATTATGCTAAGTTTTTGAAAGATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGACAAAATGT
AGTAGTGAAGCTGTAGGCAGCCCGCTACCCATGAAATGTAATGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGAGACTTTGAAGAG
TGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAA
GATGTGGCTAGTCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCAT
TTGAAATATGCGTATCTAGGGGATAACGACACTTTACCAGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCA
TGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGCACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGGTGGGGCCCCTTGTTCAAGTCTCAAAGTCAGCATTTAAGGGAACACTCATCTACTCCCCTAAAGTCGAGGAGGAGTGAATTTCATCTTGTGAAGTTA
TGTTCCCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATGTTGAGCCGACGACTCGGGCCACTCTCACCCATACAGATCAAAGGACGAGTCCTCACGGG
CAGGAGTCCATAACTCACTCAGGATTGAGAATGAGTTGTCTGGTCATCCTAAGAAATAGCAACCTATTAGTTAATGATGTTACATCTAAAGATTGCCTATTTCGT
GGTCCGGTTACCCTAGTCTCTGCAGCAACCAAAAGCGGTAGTGAAAGAGTAGAACTCAAATCCCAAGAAAAGTCAGGAATTGCGTCTGGTGCATTTTCCCAACAT
AGTGTGTTTTCCATGTTTTGTATCAAAAATAAGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCGAAAAGACGGAAAAACTTACCATGGAGGCGCCAGGCGC
CTGGGAAGCCTGCAGAAAACAGTGTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCAATGCATACGGTACGTTAAAG
TGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTAGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAA
AAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACAAGCACATCGATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAAC
GGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAAC
TTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAA
CATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGGTATAACAGATGATGCTCTGAGACTAACACTTTTTCCATTT
TCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGAGAAGTTCTTAACAAAATTCTTTCCACCT
ACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTAAAGAACTAATCAGGAAATGT
CCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTTAGAGGCTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACA
AAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACATAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCA
GCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCC
ACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAAT
TGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAAT
TTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCG
CAACAGCAGTATAATCAGAGAACACAGACTCCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAA
TCCCAAGCGGCATCAATGAGGAATTTCGAGACTCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCA
AAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTAAA
CCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTT
ACCCCTACTCCCCCGTTTCCACAAAGATTAGTTAAGAAAAATAATGATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAACTGCACATAAATATACCATTA
GTAGATGCTCTAGAACAGATGCCAAATTATGCTAAGTTTTTGAAAGATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGACAAAATGT
AGTAGTGAAGCTGTAGGCAGCCCGCTACCCATGAAATGTAATGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGAGACTTTGAAGAG
TGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAA
GATGTGGCTAGTCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCAT
TTGAAATATGCGTATCTAGGGGATAACGACACTTTACCAGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCA
TGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGCACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MRGWGPLFKSQSQHLREHSSTPLKSRRSEFHLVKLCSQLPTRSRPQNGRNVEPTTRATLTHTDQRTSPHGQESITHSGLRMSCLVILRNSNLLVNDVTSKDCLFR
GPVTLVSAATKSGSERVELKSQEKSGIASGAFSQHSVFSMFCIKNKDHLGVQINQKKRKDGKTYHGGARRLGSLQKTVFLPTLPLMKRVFQCVLVVPTNAYGTLK
CMSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQN
FDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPP
TRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDP
AGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPN
FSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELP
KREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTKPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPL
VDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEGPE
DVASPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPVREVVQHIYNLRASLDFAVLPSWPPALAAILGHPSPSTDTDPSPQPPTS