; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20600 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20600
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr6:16051231..16063451
RNA-Seq ExpressionMoc06g20600
SyntenyMoc06g20600
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]1.8e-23352.65Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I +AD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
           ++ +S + N  PA A FE KPMM QML  IG F G EHEDP  HLKSFI++AN FRLPGI DDALRLTLFPFSL  QA AWLNAFP  +I T   +V
Subjt:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN
        WCS++ R   K+ DPA VLALD  TSMQK++ T+ Q LK M    KN  A  + P  ++   P+PV Q+ +  C                         N
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN

Query:  NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNNSNLENMMKEY-----------MAR
         + FNPYSN YNPGW+ HPNFSW GQG SSG   GQ+QQ KQ Y P   P     PP  QQYN Q+    P   N SN+E +MKE+           M R
Subjt:  NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNNSNLENMMKEY-----------MAR

Query:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK
        TDA I+              ++RN E QLGQLANE++ RPQGS P  TE P+R              +    P+    D Q+        +P+    PE 
Subjt:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK

Query:  ENIRKGNEDTPSVPPQVQPSFTPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMIKCSSEAVGSPL
                 + SV PQV    +P P FPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKFLKDI++RKKK+GE+E VA+ +CSS    S  
Subjt:  ENIRKGNEDTPSVPPQVQPSFTPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMIKCSSEAVGSPL

Query:  PMKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLVQVDKFIFPADFIILDCEADLDVPIIL
        P K  DPGSFTI C IGGK++GRALCDLGA INLMPLS+FK L IG+A PTTVTL LADRSI +PEGKI+DVLV+VDKFIFPADFIILDCEAD DVPIIL
Subjt:  PMKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLVQVDKFIFPADFIILDCEADLDVPIIL

Query:  GRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSI
        GRPFLATG+T+ +V+KGE+TMRV++++V FN+LDAMK P D EEC  I+    I   E  DLL  EIE EL+   E  ++    I  ++KE+ KS+ P  
Subjt:  GRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSI

Query:  VEPP
        +EPP
Subjt:  VEPP

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]6.2e-18194.85Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIVSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPI+SPHGYVNFQQLPT NIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIVSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGM+SP+V PPPQLPFLERGPQAPQL+PNI ST+NMGQLKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP

Query:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH
        LEPPRMPTPTNMPMDAGDEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IITPEDGNDDFLLVSRGH
Subjt:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENEVVRTDTQEPSPLDTPTE
        CSNMPETEDTENEVVRTDTQEPSPLDTPTE
Subjt:  CSNMPETEDTENEVVRTDTQEPSPLDTPTE

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]9.5e-29989.15Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGI DDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+ NNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]9.0e-24956.14Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFS
        A N   N I +AD RD AMR+YAA   ++ +S ++N  PA A FE KPMM QML  IG FGG EHEDP  HLKSFI++AN FRLPGI DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        +  QA AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFRG D  TKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK M    KN          ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P     PP   QYN Q+    P   N 
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNN

Query:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLGQL NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV
              PS E      +    P+K       E   SVP  PQV  S  P P FPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKF+KDI+
Subjt:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV

Query:  SRKKKIGEHELVAMIKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLV
        +RKKK+GE+E VA+ +CSS    S +P K  DPGSFTIPC IGGK++GRALCDLGASINLMPLS+FK   IG+A PTTVTLQLADRSI +PEGKI+DVLV
Subjt:  SRKKKIGEHELVAMIKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLV

Query:  QVDKFIFPADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKI
        +VDKFIFP DFIILDCEAD DVPIILGRPFLATG+T+ +V+KGE+TMRV++++V FN+LDAMK   D EEC+ I+    I   E  DLL  EIE EL+  
Subjt:  QVDKFIFPADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKI

Query:  AEGPEDVANPIEKIQKEECKSLLPSIVEPP
         E  ++       ++KE+ KS+ P  +EPP
Subjt:  AEGPEDVANPIEKIQKEECKSLLPSIVEPP

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]7.3e-18247.02Show/hide
Query:  IQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFSLKDQARAW
        I + D+R  A+REYAA  F   + GIV P      FELKP+MFQMLQT+G F     EDPH HL+SF++++++F++ G+ ++  RL LFPFSL+D+AR+W
Subjt:  IQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFSLKDQARAW

Query:  LNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTF
        LN   P S+T W    EKFL K+FPPTR+A  R EI+SF Q + E   +AWERFKEL+RKCP+HG+P CIQ+E F+ GL+  ++M+L+ +ANGA   K++
Subjt:  LNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTF

Query:  NEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYD
        NE  +IL  +AS+N  W + R   AP  +  AGVL +D  T++  +M +M   LK +++G             S    PA   Q +D+ C FC E H ++
Subjt:  NEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYD

Query:  NCPHNPASVFYVGHGN-NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYM
         CP NP SV Y+G+ N NRN   +SN+YN  W++HPN SWG +     F+QG     +Q Y P   Q +  PQ   N +          S+LE++M++YM
Subjt:  NCPHNPASVFYVGHGN-NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYM

Query:  ARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRK---
        A+ DAVIQSQAA +RN E QLG LANELK RPQGS P  TE P+R+GKEQCK++ LRSG               P++  I +     T  E  + R+   
Subjt:  ARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRK---

Query:  --GNEDTPSVPPQVQPSFTPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMIKCSSEAVGSPLPMK
          G +        V  S  P   FPQR  K+  D QF+KFLD+LKQLHINIPLV+ALEQMPNY KFLKDI+++K+++GE E   + +     + + +P K
Subjt:  --GNEDTPSVPPQVQPSFTPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMIKCSSEAVGSPLPMK

Query:  CNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLVQVDKFIFPADFIILDCEADLDVPIILGRP
          DPGSFTIP SIGG++                      LGIGEARPTTVTLQLADRS+  P+GKI+DVLVQVDKFIFPADFIILD E D +VPIIL RP
Subjt:  CNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLVQVDKFIFPADFIILDCEADLDVPIILGRP

Query:  FLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDE
        FLATG T+ +V KGE+TMR  +E+  F V   ++ P    EC AI  ++P M +E
Subjt:  FLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDE

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129472.0e-23352.65Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA
        MST S+LLP DPEIE+T ++ R+EQRL+KQ  K QKE+E E                              +    A N   N I +AD RD AMR+YAA
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEK-QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAA

Query:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV
           ++ +S + N  PA A FE KPMM QML  IG F G EHEDP  HLKSFI++AN FRLPGI DDALRLTLFPFSL  QA AWLNAFP  +I T   +V
Subjt:  TAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLV

Query:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL
        +KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AANG FT K+FNEIV+IL+ L+ HN+ 
Subjt:  EKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNEL

Query:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN
        WCS++ R   K+ DPA VLALD  TSMQK++ T+ Q LK M    KN  A  + P  ++   P+PV Q+ +  C                         N
Subjt:  WCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGN

Query:  NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNNSNLENMMKEY-----------MAR
         + FNPYSN YNPGW+ HPNFSW GQG SSG   GQ+QQ KQ Y P   P     PP  QQYN Q+    P   N SN+E +MKE+           M R
Subjt:  NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNNSNLENMMKEY-----------MAR

Query:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK
        TDA I+              ++RN E QLGQLANE++ RPQGS P  TE P+R              +    P+    D Q+        +P+    PE 
Subjt:  TDAVIQSQA----------ASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK

Query:  ENIRKGNEDTPSVPPQVQPSFTPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMIKCSSEAVGSPL
                 + SV PQV    +P P FPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKFLKDI++RKKK+GE+E VA+ +CSS    S  
Subjt:  ENIRKGNEDTPSVPPQVQPSFTPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMIKCSSEAVGSPL

Query:  PMKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLVQVDKFIFPADFIILDCEADLDVPIIL
        P K  DPGSFTI C IGGK++GRALCDLGA INLMPLS+FK L IG+A PTTVTL LADRSI +PEGKI+DVLV+VDKFIFPADFIILDCEAD DVPIIL
Subjt:  PMKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLVQVDKFIFPADFIILDCEADLDVPIIL

Query:  GRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSI
        GRPFLATG+T+ +V+KGE+TMRV++++V FN+LDAMK P D EEC  I+    I   E  DLL  EIE EL+   E  ++    I  ++KE+ KS+ P  
Subjt:  GRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSI

Query:  VEPP
        +EPP
Subjt:  VEPP

A0A6J1DW02 uncharacterized protein LOC1110248974.6e-29989.15Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGI DDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+ NNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

A0A6J1DX11 uncharacterized protein LOC1110248603.0e-18194.85Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIVSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPI+SPHGYVNFQQLPT NIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIVSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGM+SP+V PPPQLPFLERGPQAPQL+PNI ST+NMGQLKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP

Query:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH
        LEPPRMPTPTNMPMDAGDEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IITPEDGNDDFLLVSRGH
Subjt:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENEVVRTDTQEPSPLDTPTE
        CSNMPETEDTENEVVRTDTQEPSPLDTPTE
Subjt:  CSNMPETEDTENEVVRTDTQEPSPLDTPTE

A0A6J1DY39 uncharacterized protein LOC1110256534.4e-24956.14Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFS
        A N   N I +AD RD AMR+YAA   ++ +S ++N  PA A FE KPMM QML  IG FGG EHEDP  HLKSFI++AN FRLPGI DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        +  QA AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFRG D  TKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK M    KN          ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P     PP   QYN Q+    P   N 
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQQQYN-QRTQTPPVHNNN

Query:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLGQL NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV
              PS E      +    P+K       E   SVP  PQV  S  P P FPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKF+KDI+
Subjt:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV

Query:  SRKKKIGEHELVAMIKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLV
        +RKKK+GE+E VA+ +CSS    S +P K  DPGSFTIPC IGGK++GRALCDLGASINLMPLS+FK   IG+A PTTVTLQLADRSI +PEGKI+DVLV
Subjt:  SRKKKIGEHELVAMIKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLV

Query:  QVDKFIFPADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKI
        +VDKFIFP DFIILDCEAD DVPIILGRPFLATG+T+ +V+KGE+TMRV++++V FN+LDAMK   D EEC+ I+    I   E  DLL  EIE EL+  
Subjt:  QVDKFIFPADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKI

Query:  AEGPEDVANPIEKIQKEECKSLLPSIVEPP
         E  ++       ++KE+ KS+ P  +EPP
Subjt:  AEGPEDVANPIEKIQKEECKSLLPSIVEPP

A0A6J1EQ90 uncharacterized protein LOC1114364115.3e-17844.34Show/hide
Query:  LDPEIERTL-RKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSG
        LDPEIERT  R+ +K++++ +Q  +Q E   +++ E E                      N  M  +      N I +AD+R+ A+R YA  A +  +  
Subjt:  LDPEIERTL-RKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSG

Query:  IVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI-------ANAFRLPGIIDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEK
        I+ P      FELKP+MFQMLQTIG F G   EDPH HLKSF+ +       +++FR  G+  D +RL+LFP+ L+D A++WLN   PG+I +W SL E 
Subjt:  IVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI-------ANAFRLPGIIDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEK

Query:  FLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWC
        FL K+FPPTR+A  + EI++F+Q++ E + EA ERFKE++RKCP+HGLP CIQ+E F+ GL+  TK +++ +ANGA   KT+NE  +IL  +AS+N  W 
Subjt:  FLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWC

Query:  SQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGH----
          RS    K +   GVL +D  +S+  ++ ++   L+ +ALG  + +  P+        T A + Q     C +C E H +D CP NPAS+FYVG+    
Subjt:  SQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGH----

Query:  GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQ--TPPVHNNNSNLENMMKEYMARTDAVIQSQAAS
        GN +N NP+SNTYNPGWR+HPNFSW GQ   S +NQ    +   P     Q  +    QQ N + +  T   + + +++E+++KEYMA+ DAVIQSQ AS
Subjt:  GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQ--TPPVHNNNSNLENMMKEYMARTDAVIQSQAAS

Query:  MRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSF
        +RN E Q+G      KN  QG                            D  +  T D Q  + E  V+   +    E E   K    T +   Q   ++
Subjt:  MRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSF

Query:  TPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMIKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNL
        TP+P FPQR+ +K  ++ F KF+DILK++HINIPLV+AL+QMPNY KFLKD++  ++K  E ++V++ +  S  + + +P+K  DPGSFTIP SIGGK L
Subjt:  TPTPSFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMIKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNL

Query:  GRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLVQVDKFIFPADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITM
        GRALCDLGA+INLMPLS++K LGIGEARPTTVTLQLADRSI  PEGKI+D+L+QVDKFIF ADFIILD E D DVPIILGRPFL  G T+ +V KG IT+
Subjt:  GRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRSIIRPEGKIKDVLVQVDKFIFPADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITM

Query:  RVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDEFYDLLVTEIEE
        R++ ++V+FN+ D+MK P   EECSA+  L      E +D   +E EE
Subjt:  RVNNEEVKFNVLDAMKLPGDFEECSAINSLNPIMFDEFYDLLVTEIEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCGGCAAGGGTTTTGTTCATTGGTATCCAACTCAACTTACTATTGTCGCCGACGTCGCCCACTACCGCCAATGCTGCCACCGTCGCCCACTGCCACTGTCG
TCAGTGCCCACTGCCGTTGCCGCCGTCGCCGATGCTGCTGTTCGTGGGTTTCCGTCGTCGATGCTGTTGTTCGTGGTTTCTGTCTCAACATCTGCCTTGGTTCGA
TTTAGCCGGTTCGACCGGTTCGGTTTAAATAGTCCGGTTCAACTGGTTCGATTCACTTGGTTTGATCCATCCGGTAACCTAAGATTATTTGCTACCTCTTCATCA
TCTACATCTTTGCATGATGTTACCATAGCAGATGACACCACTTCTCCTGTTTTTGGCTCTAGCACAGATCTTACAACGAAGAAGAATATTAGTAAAGGGCGTGAA
TCTGATAGCCTTTACACGTTTAATACAGAAATATCTACAGCCATTGTTTGTACTCGAGTGCCAACTCCTTTCGAAGAACATTACCGTTTGGGTCATCCATCCCTC
TCCGTGTTAAAGAGTCTCTGTCCTCAATTTCATAGTTTGCCTTCTTTAGACTTTAAGAGAGTTGTGTTTTTGGAGGAAAGAGCACGTGACACTCCCCCTTGCTTG
AAAGAGAAGGGCGAAGCTCTCAATCCCGATAAAAACTCTCCGTTAGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAG
CGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACG
AGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAG
ATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAG
CTTAAACCAATGATGTTCCAAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAAT
GCATTTCGATTACCTGGTATAATAGACGATGCTCTAAGGCTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGA
TCTATCACCACATGGGGGTCGTTAGTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTAT
GATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTAAAGAACTAATCAGGAAATGTCCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTTAGA
GGCTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCA
CATAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATG
GTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTT
TGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAAT
AGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAG
AGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCGCCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCACAATAAC
AACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAA
CTTGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGT
GGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACCGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAA
AATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTTACTCCTACTCCCTCGTTTCCACAAAGATTAGTTAAGAAAAATAAT
GATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAACTGCATATAAATATACCATTAGTAGATGCTCTAGAACAGATGCCAAATTATGCTAAGTTTTTGAAA
GATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGATAAAATGCAGTAGTGAAGCTGTAGGCAGCCCGCTACCCATGAAATGTAATGAT
CCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGTAGGGCATTATGTGATCTAGGCGCTAGCATTAATCTTATGCCATTATCAGTTTTCAAA
TCTTTAGGCATAGGAGAAGCTAGACCTACTACGGTTACCCTCCAACTAGCAGATAGATCTATCATAAGACCTGAGGGAAAGATAAAAGATGTTTTAGTCCAGGTG
GACAAATTCATTTTCCCTGCGGACTTTATCATTCTAGATTGCGAGGCAGACTTAGACGTTCCCATTATTTTGGGAAGACCATTTCTAGCCACTGGGGATACAGTT
TTTAATGTGAGGAAGGGAGAAATTACAATGAGGGTAAATAATGAAGAAGTTAAATTTAACGTTCTAGATGCCATGAAATTACCAGGAGACTTTGAAGAGTGCTCT
GCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTG
GCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAA
TATGCGTATCTAGGGGATAACGACACTTTACCACTACCAAAAAGAAGAAGGAATTTCCAGCATTTGCTTACATCTATGGCTTCTTCTTCTCAACATACTTCACTT
CCTACCTCTTCCACTCAAGCTCCAAATGCCACAAGCATTCCTTTTCCACCATTGGAGAACTTCCAACACCACATGGGTTCAGATTCTAGGCTAGCTGCTGTTAGG
GGAGGAAACCCGCTTCAAACATTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAGTGAGCCCACATGGTTATGTTAATTTTCAGCAACTACCCACC
TTTAATATACCTCAAAACAGTGAGTTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTTTATGTTTAACCCGGTTCCT
TCCTATCATTTTCCCTTGTCTCAAATGCAAATTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGA
ATGGGTCATTGGGTACCTCCACATAATTATGGGATGCTTTCACCTAGAGTTTCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTG
AACCCTAACATATCGAGTACCTTCAACATGGGACAACTAAAACCCTTAGAGCCTCCTAGGATGCCAACCCCAACCAATATGCCAATGGATGCAGGAGATGAGCAT
GGAGGAGAGCAAGAGAAGAGCCATAGCCATAGGCTAGAGCCCGGGGTTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAA
GATGGGAGTAGTAGGCGTCTGACGCCTAAGGATTCATCTATGGAGAACAGGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAAT
GATGATTTTCTACTGGTTTCACGAGGACATTGTTCAAATATGCCAGAAACAGAGGATACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTG
GATACACCTACAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCGGCAAGGGTTTTGTTCATTGGTATCCAACTCAACTTACTATTGTCGCCGACGTCGCCCACTACCGCCAATGCTGCCACCGTCGCCCACTGCCACTGTCG
TCAGTGCCCACTGCCGTTGCCGCCGTCGCCGATGCTGCTGTTCGTGGGTTTCCGTCGTCGATGCTGTTGTTCGTGGTTTCTGTCTCAACATCTGCCTTGGTTCGA
TTTAGCCGGTTCGACCGGTTCGGTTTAAATAGTCCGGTTCAACTGGTTCGATTCACTTGGTTTGATCCATCCGGTAACCTAAGATTATTTGCTACCTCTTCATCA
TCTACATCTTTGCATGATGTTACCATAGCAGATGACACCACTTCTCCTGTTTTTGGCTCTAGCACAGATCTTACAACGAAGAAGAATATTAGTAAAGGGCGTGAA
TCTGATAGCCTTTACACGTTTAATACAGAAATATCTACAGCCATTGTTTGTACTCGAGTGCCAACTCCTTTCGAAGAACATTACCGTTTGGGTCATCCATCCCTC
TCCGTGTTAAAGAGTCTCTGTCCTCAATTTCATAGTTTGCCTTCTTTAGACTTTAAGAGAGTTGTGTTTTTGGAGGAAAGAGCACGTGACACTCCCCCTTGCTTG
AAAGAGAAGGGCGAAGCTCTCAATCCCGATAAAAACTCTCCGTTAGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAG
CGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACG
AGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAG
ATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAG
CTTAAACCAATGATGTTCCAAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAAT
GCATTTCGATTACCTGGTATAATAGACGATGCTCTAAGGCTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGA
TCTATCACCACATGGGGGTCGTTAGTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTAT
GATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTAAAGAACTAATCAGGAAATGTCCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTTAGA
GGCTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCA
CATAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATG
GTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTT
TGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAAT
AGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAG
AGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCGCCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCACAATAAC
AACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAA
CTTGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGT
GGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACCGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAA
AATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTTACTCCTACTCCCTCGTTTCCACAAAGATTAGTTAAGAAAAATAAT
GATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAACTGCATATAAATATACCATTAGTAGATGCTCTAGAACAGATGCCAAATTATGCTAAGTTTTTGAAA
GATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGATAAAATGCAGTAGTGAAGCTGTAGGCAGCCCGCTACCCATGAAATGTAATGAT
CCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGTAGGGCATTATGTGATCTAGGCGCTAGCATTAATCTTATGCCATTATCAGTTTTCAAA
TCTTTAGGCATAGGAGAAGCTAGACCTACTACGGTTACCCTCCAACTAGCAGATAGATCTATCATAAGACCTGAGGGAAAGATAAAAGATGTTTTAGTCCAGGTG
GACAAATTCATTTTCCCTGCGGACTTTATCATTCTAGATTGCGAGGCAGACTTAGACGTTCCCATTATTTTGGGAAGACCATTTCTAGCCACTGGGGATACAGTT
TTTAATGTGAGGAAGGGAGAAATTACAATGAGGGTAAATAATGAAGAAGTTAAATTTAACGTTCTAGATGCCATGAAATTACCAGGAGACTTTGAAGAGTGCTCT
GCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTG
GCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAA
TATGCGTATCTAGGGGATAACGACACTTTACCACTACCAAAAAGAAGAAGGAATTTCCAGCATTTGCTTACATCTATGGCTTCTTCTTCTCAACATACTTCACTT
CCTACCTCTTCCACTCAAGCTCCAAATGCCACAAGCATTCCTTTTCCACCATTGGAGAACTTCCAACACCACATGGGTTCAGATTCTAGGCTAGCTGCTGTTAGG
GGAGGAAACCCGCTTCAAACATTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAGTGAGCCCACATGGTTATGTTAATTTTCAGCAACTACCCACC
TTTAATATACCTCAAAACAGTGAGTTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTTTATGTTTAACCCGGTTCCT
TCCTATCATTTTCCCTTGTCTCAAATGCAAATTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGA
ATGGGTCATTGGGTACCTCCACATAATTATGGGATGCTTTCACCTAGAGTTTCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTG
AACCCTAACATATCGAGTACCTTCAACATGGGACAACTAAAACCCTTAGAGCCTCCTAGGATGCCAACCCCAACCAATATGCCAATGGATGCAGGAGATGAGCAT
GGAGGAGAGCAAGAGAAGAGCCATAGCCATAGGCTAGAGCCCGGGGTTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAA
GATGGGAGTAGTAGGCGTCTGACGCCTAAGGATTCATCTATGGAGAACAGGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAAT
GATGATTTTCTACTGGTTTCACGAGGACATTGTTCAAATATGCCAGAAACAGAGGATACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTG
GATACACCTACAGAATGA
Protein sequenceShow/hide protein sequence
MRGKGFVHWYPTQLTIVADVAHYRQCCHRRPLPLSSVPTAVAAVADAAVRGFPSSMLLFVVSVSTSALVRFSRFDRFGLNSPVQLVRFTWFDPSGNLRLFATSSS
STSLHDVTIADDTTSPVFGSSTDLTTKKNISKGRESDSLYTFNTEISTAIVCTRVPTPFEEHYRLGHPSLSVLKSLCPQFHSLPSLDFKRVVFLEERARDTPPCL
KEKGEALNPDKNSPLGKLKCMSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQ
MADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIIDDALRLTLFPFSLKDQARAWLNAFPPG
SITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLAS
HNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQ
LANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPSFPQRLVKKNN
DSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIVSRKKKIGEHELVAMIKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFK
SLGIGEARPTTVTLQLADRSIIRPEGKIKDVLVQVDKFIFPADFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECS
AINSLNPIMFDEFYDLLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPLPKRRRNFQHLLTSMASSSQHTSL
PTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIVSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVP
SYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEH
GGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPL
DTPTE