; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g29450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g29450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:21146486..21153871
RNA-Seq ExpressionMoc08g29450
SyntenyMoc08g29450
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]3.5e-14743.68Show/hide
Query:  DPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIP--PRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMQEYAATAFQNFDSG
        DPEIERT  + RK QR        K ++ +++ +  V +      DIP  PR                         + D++D A+++YAA  F+  +SG
Subjt:  DPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIP--PRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMQEYAATAFQNFDSG

Query:  IVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFP
        I+ P      FELKP+MFQMLQTIG F G   EDPH HL+ F++I+++F+   + +DALRL LFP+S++D+AR WLN+ P GS+TTW  L EKFL+K+FP
Subjt:  IVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFP

Query:  PTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAA
        P  +A +R EI SF+Q D E +++AWERFKEL+RKCP+HG+  CIQ+E F+ GL+  TKM+++ +ANGA   K++N+  +IL  +A+ N  W S R++  
Subjt:  PTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAA

Query:  PKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSN
         K    AG+  +D  TSMK ++ +M   LK +++G              +    + + Q  ++ C FC E H YD+CP NP SVFY+  GN     PYSN
Subjt:  PKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSN

Query:  TYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYM-------ARTDAVIQSQAASMRNFET
        TYN  WR HPNFSW  QG +SG + G  + N   Y P   Q  P                  +++LENM+KEY+       ++T+A++QSQAAS+RN E 
Subjt:  TYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYM-------ARTDAVIQSQAASMRNFET

Query:  QLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPS-FTPTPP
        Q+GQLANEL+NRP G+ P  TE PK  G E CKA+TL+SG    G T+  TD +   +       E P   E EN      D  S P   + S   P PP
Subjt:  QLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPS-FTPTPP

Query:  FPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGKNLG
        FPQR  K+  + QF+KFLD+LKQLHINIPLV+ALEQMPNY KF+KDI+++K+++GE E VA+TK  S  +   LP K  DPGSFTIPC+IG    G
Subjt:  FPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGKNLG

XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]1.9e-16949.58Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMQEYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I +AD RD AM++YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMQEYAA

Query:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
           ++ +S + N  PA A FE KPMM QML  IG F G EHEDP  HLKSFI++AN FRLP I+DDALRLTLFPFSL  QA AWLNAFP  +I T   +V
Subjt:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN
        WCS++ R   K+ DPA VLALD  TSM+K++ T+ Q LK M    KN  A  + P  ++   P+PV Q+ +  C                         N
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN

Query:  NRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNNSNLENMMKEY-----------MAR
         + FNPYSN YNPGW+ HPNFSW GQG SSG   GQ+QQ KQ Y P   P     PP  QQYN Q+    P   N SN+E +MKE+           M R
Subjt:  NRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNNSNLENMMKEY-----------MAR

Query:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK
        TDA I+              ++RN E QLGQLANE++ RPQGS P  TE P+R              +    P+    D Q+        +P+    PE 
Subjt:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK

Query:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSSL
                 + SV PQV    +P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKFLKDI++RKKK+GE+E VA+T+CSS    S  
Subjt:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSSL

Query:  PMKCNDPGSFTIPCSIGGKNLG
        P K  DPGSFTI C IGGK++G
Subjt:  PMKCNDPGSFTIPCSIGGKNLG

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]5.3e-29788.64Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMQEYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAM+EYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMQEYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLP ITDDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSM+KEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQG SSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+ NNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]4.2e-18554.01Show/hide
Query:  ARNDEFNHIQMADNRDVAMQEYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFS
        A N   N I +AD RD AM++YAA   ++ +S ++N  PA A FE KPMM QML  IG FGG EHEDP  HLKSFI++AN FRLP I+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMQEYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        +  QA AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFRG D  TKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DPAGVLALD  TSM+K++ T+ Q LK M    KN          ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P     PP   QYN Q+    P   N 
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNN

Query:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLGQL NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV
              PS E      +    P+K       E   SVP  PQV  S  P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKF+KDI+
Subjt:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV

Query:  SRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGKNLG
        +RKKK+GE+E VA+T+CSS    S +P K  DPGSFTIPC IGGK++G
Subjt:  SRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGKNLG

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]7.5e-17494.44Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SM+KE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQG S GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  HNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
         NNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  HNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRKGNEDTPSVPPQV
        NPTTPEK NIRKGNEDTPSVPPQ+
Subjt:  NPTTPEKENIRKGNEDTPSVPPQV

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129472.1e-16949.58Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMQEYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I +AD RD AM++YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMQEYAA

Query:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
           ++ +S + N  PA A FE KPMM QML  IG F G EHEDP  HLKSFI++AN FRLP I+DDALRLTLFPFSL  QA AWLNAFP  +I T   +V
Subjt:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN
        WCS++ R   K+ DPA VLALD  TSM+K++ T+ Q LK M    KN  A  + P  ++   P+PV Q+ +  C                         N
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN

Query:  NRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNNSNLENMMKEY-----------MAR
         + FNPYSN YNPGW+ HPNFSW GQG SSG   GQ+QQ KQ Y P   P     PP  QQYN Q+    P   N SN+E +MKE+           M R
Subjt:  NRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNNSNLENMMKEY-----------MAR

Query:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK
        TDA I+              ++RN E QLGQLANE++ RPQGS P  TE P+R              +    P+    D Q+        +P+    PE 
Subjt:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK

Query:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSSL
                 + SV PQV    +P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKFLKDI++RKKK+GE+E VA+T+CSS    S  
Subjt:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSSL

Query:  PMKCNDPGSFTIPCSIGGKNLG
        P K  DPGSFTI C IGGK++G
Subjt:  PMKCNDPGSFTIPCSIGGKNLG

A0A6J1DSZ5 uncharacterized protein LOC1110241071.9e-13057.69Show/hide
Query:  ARNDEFNHIQMADNRDVAMQEYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFS
        A N   N I +AD +D AM++YAAT  ++ +S ++NP+PA A FE KPMM QML  I  FGG EHEDP  HLKSFI++AN  RLP I+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMQEYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        L  QA AWLNAFP G+ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWE FK+LIR CPN G+PAC+QIEHFFRG D PTKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN+ W S+RSR   K+ DPAGVLALD  TSM+K++ T+ Q LK M    KN  A    P  ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTY+PGW+ HPNFSW GQG SS  + GQ+QQ KQ Y P   P     PP   QYN Q+    PV  N 
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNN

Query:  SNLENMMKEYMARTDA
        SN+E +MKE++ + DA
Subjt:  SNLENMMKEYMARTDA

A0A6J1DW02 uncharacterized protein LOC1110248972.6e-29788.64Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMQEYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAM+EYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMQEYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLP ITDDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSM+KEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQG SSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+ NNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

A0A6J1DY39 uncharacterized protein LOC1110256532.0e-18554.01Show/hide
Query:  ARNDEFNHIQMADNRDVAMQEYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFS
        A N   N I +AD RD AM++YAA   ++ +S ++N  PA A FE KPMM QML  IG FGG EHEDP  HLKSFI++AN FRLP I+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMQEYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        +  QA AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFRG D  TKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DPAGVLALD  TSM+K++ T+ Q LK M    KN          ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P     PP   QYN Q+    P   N 
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNN

Query:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLGQL NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV
              PS E      +    P+K       E   SVP  PQV  S  P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKF+KDI+
Subjt:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV

Query:  SRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGKNLG
        +RKKK+GE+E VA+T+CSS    S +P K  DPGSFTIPC IGGK++G
Subjt:  SRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGKNLG

A0A6J1DYG0 uncharacterized protein LOC1110257643.6e-17494.44Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SM+KE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQG S GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  HNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
         NNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  HNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRKGNEDTPSVPPQV
        NPTTPEK NIRKGNEDTPSVPPQ+
Subjt:  NPTTPEKENIRKGNEDTPSVPPQV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAATAGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCC
TTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATG
GCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCAAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAA
TGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGATATAACAGAC
GATGCTCTGAGACTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGAGAA
GTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTA
AAGAACTAATCAGGAAATGTCCGAACCATGGCTTACCGGCATGCATCCAGATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCC
AACGGAGCCTTTACAAAAAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAA
GCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGAAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCAT
TAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATCTATGATAAT
TGTCCACATAACCCTGCTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTC
ATGGGGAGGTCAAGGATGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAACAATACATCCCACCACCGCAACAGCAGT
ACAATCAGAGAACACAGACTCCACCAGTTCACAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCA
TCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGA
ACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAA
ATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTTACCCCTACTCCCCCGTTTCCACAAAGA
TTAGTTAAGAAAAATAATGATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAACTGCATATAAATATACCATTAGTAGATGCTCTAGAACAGATGCCAAATTATGC
TAAGTTTTTGAAAGATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCTCGCTACCCATGAAAT
GTAATGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTGTTATGTTTGATGAG
TTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTC
GTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTACCAGCGCCTGGCGCCT
GCCCTGTTTTTCCAGCATTTCGAAAACTCTCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTACGGACTTCATTGGAATTTGCGGTTTTACCTTCATGGCCTCCAGC
GCTAGCTGCTATCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAATAGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCC
TTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATG
GCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCAAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAA
TGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGATATAACAGAC
GATGCTCTGAGACTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGAGAA
GTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTA
AAGAACTAATCAGGAAATGTCCGAACCATGGCTTACCGGCATGCATCCAGATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCC
AACGGAGCCTTTACAAAAAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAA
GCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGAAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCAT
TAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATCTATGATAAT
TGTCCACATAACCCTGCTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTC
ATGGGGAGGTCAAGGATGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAACAATACATCCCACCACCGCAACAGCAGT
ACAATCAGAGAACACAGACTCCACCAGTTCACAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCA
TCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGA
ACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAA
ATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTTACCCCTACTCCCCCGTTTCCACAAAGA
TTAGTTAAGAAAAATAATGATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAACTGCATATAAATATACCATTAGTAGATGCTCTAGAACAGATGCCAAATTATGC
TAAGTTTTTGAAAGATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCTCGCTACCCATGAAAT
GTAATGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTGTTATGTTTGATGAG
TTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTC
GTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTACCAGCGCCTGGCGCCT
GCCCTGTTTTTCCAGCATTTCGAAAACTCTCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTACGGACTTCATTGGAATTTGCGGTTTTACCTTCATGGCCTCCAGC
GCTAGCTGCTATCCTTAG
Protein sequenceShow/hide protein sequence
MGGARRLGSLQKNRFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLGKLKCMSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSM
ADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMQEYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPDITD
DALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAA
NGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMKKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDN
CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGCSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAA
SMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPPFPQR
LVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGKNLGDFEECSAINSLNPVMFDE
FYDLLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPAPGACPVFPAFRKLSGSRSRATYLQLTDFIGICGFTFMASS
ASCYP