; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g17780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g17780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr11:13610232..13629383
RNA-Seq ExpressionMoc11g17780
SyntenyMoc11g17780
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]1.5e-13242.58Show/hide
Query:  DPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVADIP--PRDPVDPPAVNGNMRDHA--RNDEFNHIQMADNRDVAMREYAATAFQNFD
        DPEIERT  + RK QR        K ++ +++ +  V +      DIP  PR        +  +R +A  R +E N   +  N      E     FQ   
Subjt:  DPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVADIP--PRDPVDPPAVNGNMRDHA--RNDEFNHIQMADNRDVAMREYAATAFQNFD

Query:  -----SGIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPV
             SG+  EDPH HL+ F++I+++F+  G+ +DALRL LFP+S++D+AR WLN+ P GS+TTW  L EKFL+K+FPP  +A +R EI SF+Q D E +
Subjt:  -----SGIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPV

Query:  HEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEM
        ++AWERFKEL+RKCP+HG+  CIQ+E F+ GL+  TKM+++ +ANGA   K++N+  +IL  +A+ N  W S R++   K    AG+  +D  TSM+ ++
Subjt:  HEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEM

Query:  VTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSG
         +M   LK +++G              +    + + Q  ++ C FC E H YD+CP NP SVFY+  GN     PY NTYN  WR HPNFSW  QG +SG
Subjt:  VTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSG

Query:  FNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYM-------ARTDAMIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTE
         + G  + N   Y P   Q  P                  +++LENM+KEY+       ++T+A++QSQAAS+RN E Q+GQLANEL+NRP G+ P  TE
Subjt:  FNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYM-------ARTDAMIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTE

Query:  LPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPS-FTPTPPFPQRLVKKNNDSQFRKFLDILK
         PK  G E CKA+TL+SG    G T+  TD +   +       E P   E EN      D  S P   + S   P PPFPQR  K+  + QF+KFLD+LK
Subjt:  LPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPS-FTPTPPFPQRLVKKNNDSQFRKFLDILK

Query:  QLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLG
        QLHINIPLV+ALEQMPNY KF+KDI+++K+++GE E VA+TK  S  +   LP K  DPGSFTIPC+IG    G
Subjt:  QLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLG

XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]1.1e-15647.78Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSVADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I +AD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSVADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDS-----------------------------GIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
           ++ +S                             G+EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFSL  QA AWLNAFP  +I T   +V
Subjt:  TAFQNFDS-----------------------------GIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN
        WCS++ R   K+ DPA VLALD  TSMQK++ T+ Q LK M    KN  A  + P  ++   P+PV Q+ +  C                         N
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN

Query:  NRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVQNNNSNLENMMKEY-----------MAR
         + FNPY N YNPGW+ HPNFSW GQG SSG   GQ+QQ KQ Y P   P     PP  QQYN Q+    P Q N SN+E +MKE+           M R
Subjt:  NRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVQNNNSNLENMMKEY-----------MAR

Query:  TDAMIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK
        TDA I+              ++RN E QLGQLANE++ RPQGS P  TE P+R              +    P+    D Q+        +P+    PE 
Subjt:  TDAMIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK

Query:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPL
                 + SV PQV    +P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKFLKDI++RKKK+GE+E VA+T+CSS    S  
Subjt:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPL

Query:  PMKCNDPGSFTIPCSIGGKNLG
        P K  DPGSFTI C IGGK++G
Subjt:  PMKCNDPGSFTIPCSIGGKNLG

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]7.0e-27684.41Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTS+ADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-----------------------------EHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGI                             EHEDPHDHLKSFIQIANAFRLPGITDDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGI-----------------------------EHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAMIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDA+IQSQAASMRNF 
Subjt:  RNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAMIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]1.0e-17051.85Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDS-----------------------------GIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS
        A N   N I +AD RD AMR+YAA   ++ +S                             G+EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDS-----------------------------GIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        +  QA AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFRG D  TKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK M    KN          ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVQNNN
        C + H  +NCP NP+S++YVG  N + FNPY NTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P     PP   QYN Q+    P Q N 
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVQNNN

Query:  SNLENMMKEYMARTDAMIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLGQL NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARTDAMIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV
              PS E      +    P+K       E   SVP  PQV  S  P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKF+KDI+
Subjt:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV

Query:  SRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLG
        +RKKK+GE+E VA+T+CSS    S +P K  DPGSFTIPC IGGK++G
Subjt:  SRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLG

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]3.5e-17494.75Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPY NTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAMIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
        QNNNSNLENMMKEYMARTDA+IQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  QNNNSNLENMMKEYMARTDAMIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRKGNEDTPSVPPQV
        NPTTPEK NIRKGNEDTPSVPPQ+
Subjt:  NPTTPEKENIRKGNEDTPSVPPQV

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129456.9e-12038.96Show/hide
Query:  LLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFD
        L+P DP+IERT R+ R+E      L +    +   +  + +       A+   RD V  P V G +    R    N    A+N ++              
Subjt:  LLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFD

Query:  SGIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWE
        SG+  +DP+ HL +F++I + F+  G+TDDA+RL LFPFSL+D+A++WLN+ P GSITTW  L +KFL KFFPP + A +R +I SF Q+D E ++EAWE
Subjt:  SGIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWE

Query:  RFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQ
        RFKEL+R+CP+HG+P  +Q++ F+ GL    K +++ AA GA   K   +  ++L ++AS+N  W S+RS +    +   G   +D   ++  ++  +++
Subjt:  RFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQ

Query:  RLKEMAL-GIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQG
        +L  + +  ++N L                      ++C  C ++H YD CP+N  SV +VG+ N +  NPY NTYNPGWR+HPNFSW    G S     
Subjt:  RLKEMAL-GIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQG

Query:  QSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAMIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCK
            N +P +PP  Q    PQ           +    S LE ++ +Y+++TDA+IQSQ AS+RN ETQ+GQLAN + NRPQGS P  T++   +GKEQC+
Subjt:  QSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAMIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCK

Query:  AVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDAL
        A+TLRSG   +G      + +I   +            +K++ +  N+ T  V         P PPFPQRL K+  + QF+KFL++ K+LHINIP  +AL
Subjt:  AVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDAL

Query:  EQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIG
        EQMP+Y KFLKDI+S+K+K+GE E V +T+  S  + + LP K  DPGSFTIPC+IG
Subjt:  EQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIG

A0A6J1CPJ3 uncharacterized protein LOC1110129471.2e-15647.78Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSVADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I +AD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSVADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDS-----------------------------GIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
           ++ +S                             G+EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFSL  QA AWLNAFP  +I T   +V
Subjt:  TAFQNFDS-----------------------------GIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN
        WCS++ R   K+ DPA VLALD  TSMQK++ T+ Q LK M    KN  A  + P  ++   P+PV Q+ +  C                         N
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN

Query:  NRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVQNNNSNLENMMKEY-----------MAR
         + FNPY N YNPGW+ HPNFSW GQG SSG   GQ+QQ KQ Y P   P     PP  QQYN Q+    P Q N SN+E +MKE+           M R
Subjt:  NRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVQNNNSNLENMMKEY-----------MAR

Query:  TDAMIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK
        TDA I+              ++RN E QLGQLANE++ RPQGS P  TE P+R              +    P+    D Q+        +P+    PE 
Subjt:  TDAMIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK

Query:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPL
                 + SV PQV    +P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKFLKDI++RKKK+GE+E VA+T+CSS    S  
Subjt:  ENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGSPL

Query:  PMKCNDPGSFTIPCSIGGKNLG
        P K  DPGSFTI C IGGK++G
Subjt:  PMKCNDPGSFTIPCSIGGKNLG

A0A6J1DW02 uncharacterized protein LOC1110248973.4e-27684.41Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTS+ADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-----------------------------EHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGI                             EHEDPHDHLKSFIQIANAFRLPGITDDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGI-----------------------------EHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAMIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDA+IQSQAASMRNF 
Subjt:  RNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAMIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

A0A6J1DY39 uncharacterized protein LOC1110256535.1e-17151.85Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDS-----------------------------GIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS
        A N   N I +AD RD AMR+YAA   ++ +S                             G+EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDS-----------------------------GIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        +  QA AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFRG D  TKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK M    KN          ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVQNNN
        C + H  +NCP NP+S++YVG  N + FNPY NTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P     PP   QYN Q+    P Q N 
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVQNNN

Query:  SNLENMMKEYMARTDAMIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLGQL NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARTDAMIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV
              PS E      +    P+K       E   SVP  PQV  S  P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKF+KDI+
Subjt:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV

Query:  SRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLG
        +RKKK+GE+E VA+T+CSS    S +P K  DPGSFTIPC IGGK++G
Subjt:  SRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLG

A0A6J1DYG0 uncharacterized protein LOC1110257641.7e-17494.75Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPY NTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAMIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
        QNNNSNLENMMKEYMARTDA+IQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  QNNNSNLENMMKEYMARTDAMIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRKGNEDTPSVPPQV
        NPTTPEK NIRKGNEDTPSVPPQ+
Subjt:  NPTTPEKENIRKGNEDTPSVPPQV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein8.9e-1131Show/hide
Query:  IWDTLESRYGGDDSGRKKYVVGKWLQFQMADDKPIMDQVHEYENLVANVLSEGMKICEILQENVLLKKFPPSWSDYRNPLKHKKKYLTLQELISHMRTEE
        +WD L+  Y  D+S  K+  V K+++F+M +++PI++QV  +  +  +++S GM + E    + ++ KFPPSW  +   L  +++YL +  L+  ++ EE
Subjt:  IWDTLESRYGGDDSGRKKYVVGKWLQFQMADDKPIMDQVHEYENLVANVLSEGMKICEILQENVLLKKFPPSWSDYRNPLKHKKKYLTLQELISHMRTEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGGCCTCGTGTCGCCCTGGGCACGGCCTTCCTACGGAAGGTGTTTACGTGGTTCAATATCGAGGTTCGTCTTGCAAGAGGATTATCCTCAAGCTCCTGTG
CCTAACGCCACTGTGGCGGTGCGCAACATCTATGACAGGTGGATAAAGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAG
AAGTACGAGAACACAGTCACCACTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGATGTTAAACCGATTCATTGGATTAGTCTCGATCAAGCTAATTCTAGA
GATGAGGACGTTACTATTGACCGATTCTACAAAGGGCTTAATGAGAAAATTGCTGAGATTGCCAAATGCGAGGAAATTCAACCATGTGATAGTTTGGAAGAGATG
GTGCACATTACCATTGAGGATCTAGGAGTGACCATCGTAGATTCTAAGTCCGATCTAAGAGTGATTCATCTTCTAACCTTACTCTTGGTTGGAATCATTCGAGCA
TTGGTCGACATCCTTTGGCCGGCTCTAGAGGATGGACTACCACAAGTACCGCAAGATCCTAATACGGTGATTCTTCAAGCAATTCAAGATATGTATCTCCATGCT
ATCAAAATTGAAGATCAATTGAGGAAAGAAAAGCTGCTGCATTCCAAATGGCATGAATCCATGCAACCAAAAGAGAAGTTTGGTGCTGTCAAACGAGTGGAGATC
GAGAGCTCCAATACTAAAAAGAATCAAGCTCCAAAGGAGGTAGGGGAGAAGACTAATTGTCTAAAATGTTGGAAGTGCAAAGGGTTTGGGCACATGAACAAAGAA
TGTGTCAATAAAGAAGTCAGGGTGATAAGAAATGGGGTCATTGATTCAAATGATGCTTGTGAGAAATATGATGCAAAATTTGAGGAAGAGTCTAAAGCACATGTT
GACGAATTTATTGAAAAAGATCGCAAAGATTCTGATTTTGATGAAAGTGAGATGAAAGAAAAGAGAGAAAGAGAAGAGTTAAGAGAAAGGAATAGAAAGGAGAAA
GAAGAGGAAAAGAGCATGATCCAATCCATTTGTCCATATAAGCCAAATCTTATAGCATTGTTGCATAACGACCCTTACTATCAAGCTCACATATTTTATCCTAGT
TTTTCTACTTTCCTAAGATTTTTGCAGGAAGTTCCAACTTTTCAATTCAAGTGTGGTCGAATACAAGTAACAAAGGAAGCTATAAAGGAGCCAACGAGAGTTATC
CTATCATCTTATCATGATCTTGAGATTGTCTTACAAGGTAACCATGGACATAAGAAACCTAAGGTGAAAGGCATTGTGGAAAAGTACTTGTCCAAGAATTTGCAT
AAAGAATTCCATGGTCAAGAAAATAGAATGTTTGATGTTAAACCGATTCATTGGATTAGTCTCGATGAAGCTAATTCTGGAGACCATTTGGGAGTGCAAATTAAT
CAAAAGAAGCGAAAAGACGGAAAAACCTACCTAGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTC
TTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTT
CGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCA
GTGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCGAGAAATGATGAATTCAACCATATCCAGATGGCGGAC
AACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAA
ATTGCAAATGCATTTCGATTACCTGGTATAACAGACGATGCTCTGAGGCTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTT
CCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTT
AGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTAAAGAACTAATCAGGAAATGTCCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACAT
TTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGAC
TTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAA
AAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCT
GCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGACAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACAT
GGGAACAATAGGAACTTTAACCCATATTTGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAAT
CAAGGGCAGAGCCAGCAGAATAAGCAGCCCTATGTTCCACCTACACAACAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTT
CAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAATGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAA
TTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACC
CTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCA
GAAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTTACCCCTACTCCCCCGTTTCCACAAAGATTAGTTAAG
AAAAATAATGATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAATTGCATATAAATATACCATTAGTAGATGCTCTAGAACAGATGCCAAATTATGCTAAG
TTTTTGAAAGATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCCCGCTACCCATGAAA
TGTAATGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTGTTATGTTT
GATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAA
GAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTA
CCAGTTCGAGAAGTCGTGCAACATATCTACAACTTACGGACTTCATTGGAATTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTAGGGCTAGT
AGCCGGTCTTTCAGATGTAAAAGTCTTAGTGTAAGGAGGGAGTGTGCAGATTCCTTAGGGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCGAAAAGACGGA
AAAACCTACCTAGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTT
CCAACCGATGCATACGTTATTTGGGACACATTGGAGTCAAGATATGGTGGAGATGATTCCGGGAGGAAAAAGTATGTGGTTGGCAAATGGCTGCAGTTTCAAATG
GCAGATGACAAACCAATCATGGACCAAGTTCACGAATACGAGAACTTGGTGGCTAATGTTCTATCCGAGGGTATGAAGATATGCGAAATACTCCAGGAAAATGTA
CTGCTCAAGAAATTTCCCCCTTCCTGGAGTGATTATCGCAATCCTCTAAAACACAAGAAAAAATATTTAACATTACAAGAATTGATCAGTCATATGCGCACAGAA
GAAGCCAATAGACAAAAAGATATGCTCTCTTCTCAGTTTGTCAATTCAGTTAATGCTAACTTAATTGAATTTTCTGTTGCAAATAAATATAGGTTCAAAGGCAAA
GGAAAACTGGTTGCCAAGGAAAGAATCCAAAAGAAGAAGGGATTTCAATTCAAGACTTCCAGTGGAAGAATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCGGCCTCGTGTCGCCCTGGGCACGGCCTTCCTACGGAAGGTGTTTACGTGGTTCAATATCGAGGTTCGTCTTGCAAGAGGATTATCCTCAAGCTCCTGTG
CCTAACGCCACTGTGGCGGTGCGCAACATCTATGACAGGTGGATAAAGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAG
AAGTACGAGAACACAGTCACCACTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGATGTTAAACCGATTCATTGGATTAGTCTCGATCAAGCTAATTCTAGA
GATGAGGACGTTACTATTGACCGATTCTACAAAGGGCTTAATGAGAAAATTGCTGAGATTGCCAAATGCGAGGAAATTCAACCATGTGATAGTTTGGAAGAGATG
GTGCACATTACCATTGAGGATCTAGGAGTGACCATCGTAGATTCTAAGTCCGATCTAAGAGTGATTCATCTTCTAACCTTACTCTTGGTTGGAATCATTCGAGCA
TTGGTCGACATCCTTTGGCCGGCTCTAGAGGATGGACTACCACAAGTACCGCAAGATCCTAATACGGTGATTCTTCAAGCAATTCAAGATATGTATCTCCATGCT
ATCAAAATTGAAGATCAATTGAGGAAAGAAAAGCTGCTGCATTCCAAATGGCATGAATCCATGCAACCAAAAGAGAAGTTTGGTGCTGTCAAACGAGTGGAGATC
GAGAGCTCCAATACTAAAAAGAATCAAGCTCCAAAGGAGGTAGGGGAGAAGACTAATTGTCTAAAATGTTGGAAGTGCAAAGGGTTTGGGCACATGAACAAAGAA
TGTGTCAATAAAGAAGTCAGGGTGATAAGAAATGGGGTCATTGATTCAAATGATGCTTGTGAGAAATATGATGCAAAATTTGAGGAAGAGTCTAAAGCACATGTT
GACGAATTTATTGAAAAAGATCGCAAAGATTCTGATTTTGATGAAAGTGAGATGAAAGAAAAGAGAGAAAGAGAAGAGTTAAGAGAAAGGAATAGAAAGGAGAAA
GAAGAGGAAAAGAGCATGATCCAATCCATTTGTCCATATAAGCCAAATCTTATAGCATTGTTGCATAACGACCCTTACTATCAAGCTCACATATTTTATCCTAGT
TTTTCTACTTTCCTAAGATTTTTGCAGGAAGTTCCAACTTTTCAATTCAAGTGTGGTCGAATACAAGTAACAAAGGAAGCTATAAAGGAGCCAACGAGAGTTATC
CTATCATCTTATCATGATCTTGAGATTGTCTTACAAGGTAACCATGGACATAAGAAACCTAAGGTGAAAGGCATTGTGGAAAAGTACTTGTCCAAGAATTTGCAT
AAAGAATTCCATGGTCAAGAAAATAGAATGTTTGATGTTAAACCGATTCATTGGATTAGTCTCGATGAAGCTAATTCTGGAGACCATTTGGGAGTGCAAATTAAT
CAAAAGAAGCGAAAAGACGGAAAAACCTACCTAGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTC
TTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTT
CGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCA
GTGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCGAGAAATGATGAATTCAACCATATCCAGATGGCGGAC
AACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAA
ATTGCAAATGCATTTCGATTACCTGGTATAACAGACGATGCTCTGAGGCTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTT
CCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTT
AGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTAAAGAACTAATCAGGAAATGTCCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACAT
TTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGAC
TTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAA
AAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCT
GCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGACAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACAT
GGGAACAATAGGAACTTTAACCCATATTTGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAAT
CAAGGGCAGAGCCAGCAGAATAAGCAGCCCTATGTTCCACCTACACAACAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTT
CAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAATGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAA
TTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACC
CTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCA
GAAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTTACCCCTACTCCCCCGTTTCCACAAAGATTAGTTAAG
AAAAATAATGATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAATTGCATATAAATATACCATTAGTAGATGCTCTAGAACAGATGCCAAATTATGCTAAG
TTTTTGAAAGATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCCCGCTACCCATGAAA
TGTAATGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTGTTATGTTT
GATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAA
GAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTA
CCAGTTCGAGAAGTCGTGCAACATATCTACAACTTACGGACTTCATTGGAATTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTAGGGCTAGT
AGCCGGTCTTTCAGATGTAAAAGTCTTAGTGTAAGGAGGGAGTGTGCAGATTCCTTAGGGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCGAAAAGACGGA
AAAACCTACCTAGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTT
CCAACCGATGCATACGTTATTTGGGACACATTGGAGTCAAGATATGGTGGAGATGATTCCGGGAGGAAAAAGTATGTGGTTGGCAAATGGCTGCAGTTTCAAATG
GCAGATGACAAACCAATCATGGACCAAGTTCACGAATACGAGAACTTGGTGGCTAATGTTCTATCCGAGGGTATGAAGATATGCGAAATACTCCAGGAAAATGTA
CTGCTCAAGAAATTTCCCCCTTCCTGGAGTGATTATCGCAATCCTCTAAAACACAAGAAAAAATATTTAACATTACAAGAATTGATCAGTCATATGCGCACAGAA
GAAGCCAATAGACAAAAAGATATGCTCTCTTCTCAGTTTGTCAATTCAGTTAATGCTAACTTAATTGAATTTTCTGTTGCAAATAAATATAGGTTCAAAGGCAAA
GGAAAACTGGTTGCCAAGGAAAGAATCCAAAAGAAGAAGGGATTTCAATTCAAGACTTCCAGTGGAAGAATTTAA
Protein sequenceShow/hide protein sequence
MLGLVSPWARPSYGRCLRGSISRFVLQEDYPQAPVPNATVAVRNIYDRWIKANDKAKVYILASISDVLAKKYENTVTTKEIMDSLQSMFDVKPIHWISLDQANSR
DEDVTIDRFYKGLNEKIAEIAKCEEIQPCDSLEEMVHITIEDLGVTIVDSKSDLRVIHLLTLLLVGIIRALVDILWPALEDGLPQVPQDPNTVILQAIQDMYLHA
IKIEDQLRKEKLLHSKWHESMQPKEKFGAVKRVEIESSNTKKNQAPKEVGEKTNCLKCWKCKGFGHMNKECVNKEVRVIRNGVIDSNDACEKYDAKFEEESKAHV
DEFIEKDRKDSDFDESEMKEKREREELRERNRKEKEEEKSMIQSICPYKPNLIALLHNDPYYQAHIFYPSFSTFLRFLQEVPTFQFKCGRIQVTKEAIKEPTRVI
LSSYHDLEIVLQGNHGHKKPKVKGIVEKYLSKNLHKEFHGQENRMFDVKPIHWISLDEANSGDHLGVQINQKKRKDGKTYLGGARRLGSLQKTVFLPTLPLMKRV
FQCVLVVPTDAYGKLKCMSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMAD
NRDVAMREYAATAFQNFDSGIEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISF
RQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQ
KEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYLNTYNPGWRHHPNFSWGGQGGSSGFN
QGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAMIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVT
LRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAK
FLKDIVSRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLGDFEECSAINSLNPVMFDEFYDLLVTEIEEELDKIAEGPEDVANPIEKIQKE
ECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPVREVVQHIYNLRTSLEFAVLPSWPPALAAILRASSRSFRCKSLSVRRECADSLGDHLGVQINQKKRKDG
KTYLGGARRLGSLQKTVFLPTLPLMKRVFQCVLVVPTDAYVIWDTLESRYGGDDSGRKKYVVGKWLQFQMADDKPIMDQVHEYENLVANVLSEGMKICEILQENV
LLKKFPPSWSDYRNPLKHKKKYLTLQELISHMRTEEANRQKDMLSSQFVNSVNANLIEFSVANKYRFKGKGKLVAKERIQKKKGFQFKTSSGRI