; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g18080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g18080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr3:11992909..12004231
RNA-Seq ExpressionMoc03g18080
SyntenyMoc03g18080
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]7.9e-21194.4Show/hide
Query:  GVGGVQAPPPQHLHTPQSEARFNKDFKHYGPPTFDGESERATAAEEWIRELEALYAYLGCEDQFKVKGAIFMLRGETLNWWDSVAAAEDHANVPIPWARF
        GVGGVQAPPPQHLHTPQSEARF KDFK YGPPTFDGESERATA EEWIRELEALYAYLGCEDQFKVKGA+FMLRGE LNWWDSVAAAED+ANVPIPWARF
Subjt:  GVGGVQAPPPQHLHTPQSEARFNKDFKHYGPPTFDGESERATAAEEWIRELEALYAYLGCEDQFKVKGAIFMLRGETLNWWDSVAAAEDHANVPIPWARF

Query:  KNLLYDYYYPETVKDMKEAEFLHLVHGTLSVAQYESKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPL
        KNLLYDYYYPETVKDMKEAEFLHLV GTLSVAQYE KFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPL
Subjt:  KNLLYDYYYPETVKDMKEAEFLHLVHGTLSVAQYESKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPL

Query:  QEVGSSSCVKRKFPSTYADPVLRAPQRQAQHQGMPPVCPTCQKRHTGQCWTGSKGCFKCGREGHFARECPMSAANTQRLGQRIPPLVATQGNNQSAPVFT
         EVGSSS VKRKFPSTYAD VLRAPQRQAQHQGMPPVCPTCQKRHTGQCWTGSKGCF+CGREGHFARECPMSAANTQRLGQRIPP V+TQGNNQ A VF 
Subjt:  QEVGSSSCVKRKFPSTYADPVLRAPQRQAQHQGMPPVCPTCQKRHTGQCWTGSKGCFKCGREGHFARECPMSAANTQRLGQRIPPLVATQGNNQSAPVFT

Query:  LTPKEAADAETVVTGTVLDHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPPGSILIASQKVRAGELSFDNQTLRARLIQLDM
        LT KEAADAETVVTGTVL HDVPAYVLFDSGSSHTFISS FVRQATLELEPLGFLLSVSTP GSILIASQKVRA ELSFDNQTLRARLIQLDM
Subjt:  LTPKEAADAETVVTGTVLDHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPPGSILIASQKVRAGELSFDNQTLRARLIQLDM

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]2.3e-29587.97Show/hide
Query:  LGRRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISSESEVESTSTSMADIPPRDPVDQPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        +  RSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEIS ESEVESTSTSMADIPPRDPVD PAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  LGRRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISSESEVESTSTSMADIPPRDPVDQPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPASVFYVGHGNN
        C QRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQRQYNQRTQTPPVHNNNSNLENMMKEYMARIDAVIQSQTASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQ+QYNQRTQTPP+ NNNSNLENMMKEYMAR DAVIQSQ ASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQRQYNQRTQTPPVHNNNSNLENMMKEYMARIDAVIQSQTASMRNFE

Query:  TQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLGHLANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]6.8e-17877.26Show/hide
Query:  YLGCEDQFKVKGAIFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVHGTLSVAQYESKFTELSRFALELIPTEALKI
        YL CE+QFKVKG +FMLRGE LNWWDSVA AEDHANVPI WARFK+LLYDYYYP+T+KDMKEAEFLH  HGTL+VAQYE KFTELS FA ELIPTEA+KI
Subjt:  YLGCEDQFKVKGAIFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVHGTLSVAQYESKFTELSRFALELIPTEALKI

Query:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLQEVGSSSCVKRKFPSTYADPVLRAPQRQAQHQGMPPVCPTCQKRHTGQCWTGSKGC
        KRFVKGLRKGIRGPVDLQRP TYAEAVRG L+MD DVSN   PL EVGSSS VKRK    YAD   RAPQR AQ QG+PPVCP+CQKR  GQCWTG++GC
Subjt:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLQEVGSSSCVKRKFPSTYADPVLRAPQRQAQHQGMPPVCPTCQKRHTGQCWTGSKGC

Query:  FKCGREGHFARECPMSAANTQRLGQRIPPLVATQGNNQSAPVFTLTPKEAADAETVVTGTVLDHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLL
        F+CGREGHFAREC M+AANTQRLGQR  P V+TQG                       GT L H+VPAYVLFD GSSHTFIS+AFVRQATLELEPLGFLL
Subjt:  FKCGREGHFARECPMSAANTQRLGQRIPPLVATQGNNQSAPVFTLTPKEAADAETVVTGTVLDHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLL

Query:  SVSTPPGSILIASQKVRAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFKLPSGRNFTFKGVTGRVPRTVSALKARGLLQNGAW
        SVSTP GS+LIASQ VRAGELSFDNQTL ARLIQLDM+DFDVI+GMDWLATNQANINCS+REVSF+LPSGR+FTFKGV+G VPR VSALKAR LL NGAW
Subjt:  SVSTPPGSILIASQKVRAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFKLPSGRNFTFKGVTGRVPRTVSALKARGLLQNGAW

Query:  GYLANVVDV
         YLA+VVD+
Subjt:  GYLANVVDV

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]5.7e-18554.01Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS
        A N   N I +AD RD AMR+YAA   ++ +S ++N  PA A FE KPMM QML  IG FGG EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        +  QA AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFRG D  TKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN  WC ++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK M    KN          ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CNENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQRQYN-QRTQTPPVHNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P     PP   QYN Q+    P   N 
Subjt:  CNENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQRQYN-QRTQTPPVHNNN

Query:  SNLENMMKEYMARIDAVIQS---------------------QTASMRNFETQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLG L NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARIDAVIQS---------------------QTASMRNFETQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV
              PS E      +    P+K       E   SVP  PQV  S  P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKF+KDI+
Subjt:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV

Query:  SRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGNNLG
        +RKKK+GE+E VA+T+CSS    S +P K  DPGSFTIPC IGG ++G
Subjt:  SRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGNNLG

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]5.5e-17293.21Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWC QRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCNENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQRQYNQRTQTPPV
        VNDLICSFC+ENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQ++YNQRTQTPPV
Subjt:  VNDLICSFCNENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQRQYNQRTQTPPV

Query:  HNNNSNLENMMKEYMARIDAVIQSQTASMRNFETQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
         NNNSNLENMMKEYMAR DAVIQSQ ASMRNFETQLG LANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  HNNNSNLENMMKEYMARIDAVIQSQTASMRNFETQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRKGNEDTPSVPPQV
        NPTTPEK NIRKGNEDTPSVPPQ+
Subjt:  NPTTPEKENIRKGNEDTPSVPPQV

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]2.2e-1141.79Show/hide
Query:  YDSLVTEIEEELDKIAEGPEDVANPIEKIQ-KEECKSLL---PSIVEPPTL--EQKPLPSHLKYAYLGDNDTVPVIITFNLSPTNEYSLL---QILEKHK
        +++ + ++  EL    +G       + K + KE+CK++        + PT+      +PS      + +N T P     N+   NE +     QILEKHK
Subjt:  YDSLVTEIEEELDKIAEGPEDVANPIEKIQ-KEECKSLL---PSIVEPPTL--EQKPLPSHLKYAYLGDNDTVPVIITFNLSPTNEYSLL---QILEKHK

Query:  KAIGWTIADIRGISPTFCMHKILLEEDAKNFIES
        KAIGWTIADIRGISP FCMHKILLEED KN IES
Subjt:  KAIGWTIADIRGISPTFCMHKILLEEDAKNFIES

TrEMBL top hitse value%identityAlignment
A0A6J1DUM2 uncharacterized protein LOC1110232473.8e-21194.4Show/hide
Query:  GVGGVQAPPPQHLHTPQSEARFNKDFKHYGPPTFDGESERATAAEEWIRELEALYAYLGCEDQFKVKGAIFMLRGETLNWWDSVAAAEDHANVPIPWARF
        GVGGVQAPPPQHLHTPQSEARF KDFK YGPPTFDGESERATA EEWIRELEALYAYLGCEDQFKVKGA+FMLRGE LNWWDSVAAAED+ANVPIPWARF
Subjt:  GVGGVQAPPPQHLHTPQSEARFNKDFKHYGPPTFDGESERATAAEEWIRELEALYAYLGCEDQFKVKGAIFMLRGETLNWWDSVAAAEDHANVPIPWARF

Query:  KNLLYDYYYPETVKDMKEAEFLHLVHGTLSVAQYESKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPL
        KNLLYDYYYPETVKDMKEAEFLHLV GTLSVAQYE KFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPL
Subjt:  KNLLYDYYYPETVKDMKEAEFLHLVHGTLSVAQYESKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPL

Query:  QEVGSSSCVKRKFPSTYADPVLRAPQRQAQHQGMPPVCPTCQKRHTGQCWTGSKGCFKCGREGHFARECPMSAANTQRLGQRIPPLVATQGNNQSAPVFT
         EVGSSS VKRKFPSTYAD VLRAPQRQAQHQGMPPVCPTCQKRHTGQCWTGSKGCF+CGREGHFARECPMSAANTQRLGQRIPP V+TQGNNQ A VF 
Subjt:  QEVGSSSCVKRKFPSTYADPVLRAPQRQAQHQGMPPVCPTCQKRHTGQCWTGSKGCFKCGREGHFARECPMSAANTQRLGQRIPPLVATQGNNQSAPVFT

Query:  LTPKEAADAETVVTGTVLDHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPPGSILIASQKVRAGELSFDNQTLRARLIQLDM
        LT KEAADAETVVTGTVL HDVPAYVLFDSGSSHTFISS FVRQATLELEPLGFLLSVSTP GSILIASQKVRA ELSFDNQTLRARLIQLDM
Subjt:  LTPKEAADAETVVTGTVLDHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPPGSILIASQKVRAGELSFDNQTLRARLIQLDM

A0A6J1DW02 uncharacterized protein LOC1110248971.1e-29587.97Show/hide
Query:  LGRRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISSESEVESTSTSMADIPPRDPVDQPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        +  RSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEIS ESEVESTSTSMADIPPRDPVD PAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  LGRRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISSESEVESTSTSMADIPPRDPVDQPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPASVFYVGHGNN
        C QRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQRQYNQRTQTPPVHNNNSNLENMMKEYMARIDAVIQSQTASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQ+QYNQRTQTPP+ NNNSNLENMMKEYMAR DAVIQSQ ASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQRQYNQRTQTPPVHNNNSNLENMMKEYMARIDAVIQSQTASMRNFE

Query:  TQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLGHLANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

A0A6J1DY39 uncharacterized protein LOC1110256532.8e-18554.01Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS
        A N   N I +AD RD AMR+YAA   ++ +S ++N  PA A FE KPMM QML  IG FGG EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS

Query:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN
        +  QA AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFRG D  TKMMLN AAN
Subjt:  LKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        G FT K+FNEIV+IL+ L+ HN  WC ++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK M    KN          ++   P+PV Q+ +  C +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CNENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQRQYN-QRTQTPPVHNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P     PP   QYN Q+    P   N 
Subjt:  CNENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYIPPPQRQYN-QRTQTPPVHNNN

Query:  SNLENMMKEYMARIDAVIQS---------------------QTASMRNFETQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLG L NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARIDAVIQS---------------------QTASMRNFETQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV
              PS E      +    P+K       E   SVP  PQV  S  P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQMP YAKF+KDI+
Subjt:  TTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFLKDIV

Query:  SRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGNNLG
        +RKKK+GE+E VA+T+CSS    S +P K  DPGSFTIPC IGG ++G
Subjt:  SRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGNNLG

A0A6J1DYG0 uncharacterized protein LOC1110257642.7e-17293.21Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWC QRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCLQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCNENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQRQYNQRTQTPPV
        VNDLICSFC+ENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQ++YNQRTQTPPV
Subjt:  VNDLICSFCNENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQRQYNQRTQTPPV

Query:  HNNNSNLENMMKEYMARIDAVIQSQTASMRNFETQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
         NNNSNLENMMKEYMAR DAVIQSQ ASMRNFETQLG LANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  HNNNSNLENMMKEYMARIDAVIQSQTASMRNFETQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRKGNEDTPSVPPQV
        NPTTPEK NIRKGNEDTPSVPPQ+
Subjt:  NPTTPEKENIRKGNEDTPSVPPQV

A0A6J1DYG0 uncharacterized protein LOC1110257641.1e-1141.79Show/hide
Query:  YDSLVTEIEEELDKIAEGPEDVANPIEKIQ-KEECKSLL---PSIVEPPTL--EQKPLPSHLKYAYLGDNDTVPVIITFNLSPTNEYSLL---QILEKHK
        +++ + ++  EL    +G       + K + KE+CK++        + PT+      +PS      + +N T P     N+   NE +     QILEKHK
Subjt:  YDSLVTEIEEELDKIAEGPEDVANPIEKIQ-KEECKSLL---PSIVEPPTL--EQKPLPSHLKYAYLGDNDTVPVIITFNLSPTNEYSLL---QILEKHK

Query:  KAIGWTIADIRGISPTFCMHKILLEEDAKNFIES
        KAIGWTIADIRGISP FCMHKILLEED KN IES
Subjt:  KAIGWTIADIRGISPTFCMHKILLEEDAKNFIES

A0A6J1DYU5 uncharacterized protein LOC1110255173.3e-17877.26Show/hide
Query:  YLGCEDQFKVKGAIFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVHGTLSVAQYESKFTELSRFALELIPTEALKI
        YL CE+QFKVKG +FMLRGE LNWWDSVA AEDHANVPI WARFK+LLYDYYYP+T+KDMKEAEFLH  HGTL+VAQYE KFTELS FA ELIPTEA+KI
Subjt:  YLGCEDQFKVKGAIFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPETVKDMKEAEFLHLVHGTLSVAQYESKFTELSRFALELIPTEALKI

Query:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLQEVGSSSCVKRKFPSTYADPVLRAPQRQAQHQGMPPVCPTCQKRHTGQCWTGSKGC
        KRFVKGLRKGIRGPVDLQRP TYAEAVRG L+MD DVSN   PL EVGSSS VKRK    YAD   RAPQR AQ QG+PPVCP+CQKR  GQCWTG++GC
Subjt:  KRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLQEVGSSSCVKRKFPSTYADPVLRAPQRQAQHQGMPPVCPTCQKRHTGQCWTGSKGC

Query:  FKCGREGHFARECPMSAANTQRLGQRIPPLVATQGNNQSAPVFTLTPKEAADAETVVTGTVLDHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLL
        F+CGREGHFAREC M+AANTQRLGQR  P V+TQG                       GT L H+VPAYVLFD GSSHTFIS+AFVRQATLELEPLGFLL
Subjt:  FKCGREGHFARECPMSAANTQRLGQRIPPLVATQGNNQSAPVFTLTPKEAADAETVVTGTVLDHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLL

Query:  SVSTPPGSILIASQKVRAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFKLPSGRNFTFKGVTGRVPRTVSALKARGLLQNGAW
        SVSTP GS+LIASQ VRAGELSFDNQTL ARLIQLDM+DFDVI+GMDWLATNQANINCS+REVSF+LPSGR+FTFKGV+G VPR VSALKAR LL NGAW
Subjt:  SVSTPPGSILIASQKVRAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFKLPSGRNFTFKGVTGRVPRTVSALKARGLLQNGAW

Query:  GYLANVVDV
         YLA+VVD+
Subjt:  GYLANVVDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTTGGAGTTGCTTATCGCGTGTTTAGAGTTAAGGCTAGCCAGACTGAGTTAGGAACTAGCATACTTAGGAGTAGTGATCTCGGTCGACCTCCTCGTCAT
CACCAGGTGAGAACGGAGCCGATCCACCGCCCCCCTCCTGCTGGGAATCAGGCAGGAGTAGTCCCTCCATTTCCTCCAGCAGCAGCTCGAGAGCGGGCAGATCCT
CCAATTCCCCCAGCAGTTCCTCAGGTGAACCCCCAATTGGCATTGCTTGTAGAGGCCTTGCAAGCAGTGATCAGTAACGCCGCAGGGGTGGGCGGGGTCCAAGCT
CCGCCACCCCAACACCTTCATACACCCCAGAGCGAGGCTCGTTTCAACAAGGATTTCAAGCACTACGGACCCCCAACCTTTGACGGTGAGAGTGAAAGAGCGACA
GCAGCGGAAGAGTGGATTAGAGAGTTGGAAGCCCTTTACGCGTATCTTGGTTGCGAAGACCAATTCAAGGTGAAGGGCGCGATTTTTATGTTGAGGGGCGAGACC
CTGAACTGGTGGGACTCAGTAGCAGCGGCAGAGGATCATGCGAATGTACCAATTCCGTGGGCAAGGTTCAAGAACTTGTTGTACGACTACTACTATCCGGAGACT
GTGAAAGACATGAAGGAGGCAGAATTCCTGCATCTAGTCCATGGAACTTTATCAGTGGCACAGTATGAAAGCAAGTTCACGGAACTCTCCCGTTTCGCTCTAGAG
TTGATTCCCACTGAGGCATTAAAGATCAAGAGGTTTGTTAAGGGCTTGCGCAAGGGAATCAGAGGCCCGGTGGACCTCCAGCGACCCACCACCTATGCTGAAGCG
GTTAGGGGCGCCTTGGTTATGGATAAGGATGTTTCCAACAAGGCCTCACCCCTGCAAGAGGTCGGATCATCTTCCTGTGTGAAAAGGAAGTTCCCTTCGACTTAT
GCCGACCCGGTATTGAGAGCACCCCAGCGCCAGGCTCAACACCAGGGCATGCCGCCAGTATGCCCCACCTGCCAAAAAAGACATACGGGGCAGTGCTGGACGGGA
AGTAAGGGTTGTTTCAAGTGTGGAAGAGAGGGGCATTTTGCAAGGGAATGTCCCATGTCGGCCGCAAATACACAGAGGTTGGGTCAGAGGATTCCACCACTAGTT
GCGACGCAGGGAAATAACCAAAGTGCTCCTGTCTTCACACTTACGCCCAAGGAAGCGGCGGATGCCGAAACGGTGGTCACAGGTACTGTTTTAGACCATGATGTG
CCTGCGTATGTATTGTTTGATTCGGGGTCGAGCCACACCTTCATCTCTTCTGCGTTTGTTCGTCAGGCAACCCTCGAATTAGAGCCGTTAGGGTTTTTGTTGTCG
GTTTCTACACCACCAGGGTCGATTTTGATCGCTAGCCAAAAGGTGAGGGCAGGTGAGTTGTCTTTTGATAATCAGACTCTAAGGGCAAGACTGATCCAGCTGGAC
ATGCAAGATTTTGACGTTATTGTGGGCATGGATTGGCTAGCTACCAACCAAGCCAACATTAATTGCTCGAGAAGAGAAGTCTCCTTCAAACTACCTTCGGGTCGG
AACTTTACTTTTAAAGGGGTTACGGGTAGAGTCCCAAGGACAGTATCAGCGTTGAAGGCAAGAGGCCTGTTGCAGAATGGAGCTTGGGGATATTTGGCCAACGTC
GTCGACGTTATGTTAGATGAGTTGGACTATTTTGAGGTAGAGTTAGCGGTAGAAGATGTTTCGGCAGTGCTGGCTCAACTCTTAGTCAAACCCACCCTAAGACAA
CGAATCATCGCTGCACAAAAGGGAGACTCTAGTCTGAGCAAGGGTGTCGGGATGCTGGGCCGAGGAGATTTTTCCCTTTCAGAAGATAAGGCCCTGCTCTATCAG
GGGAGACTGCTAGGGGCTGATGTGAAGTTGGTTATGATAAGGAACGTTGGATTCTTGGGGAAAGTTAATGGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGC
GAAAAGACGAAAAAACACCTCAGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGT
TTTGGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAGAAGATCTTTTCTTCTACCCCTTGACCCTGAGATT
GAGCGGACCCTTCGAAAAACTAGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTTCTGAAAGTGAGGTAGAGAGT
ACAAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCAACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATC
CAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTT
GAGCTTAAACCAATGATGTTCCAAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCA
AATGCATTTCGATTACCTGGTATAACAGACGATGCTCTGAGACTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCG
GGATCCATCACCACATGGGGGTCGTTAGTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAG
TATGATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTAAAGAACTAATCAGGAAATGTCCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTT
AGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCT
TCACACAACGAACTATGGTGTTTGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAG
ATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCT
GTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAATGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAAC
AATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGG
CAGAGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCGCAACGGCAGTACAATCAGAGAACACAGACTCCACCAGTTCACAAT
AACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAATCGACGCAGTGATACAATCCCAAACGGCATCAATGAGGAATTTTGAGACCCAATTGGGA
CACCTTGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGG
AGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAA
GAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTTACCCCTACTCCCCCGTTTCCACAAAGATTAGTTAAGAAAAAT
AATGATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAACTGCATATAAATATACCATTAGTAGATGCTCTAGAACAGATGCCAAATTATGCTAAGTTTTTG
AAAGATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCTCGCTACCCATGAAATGTAAT
GATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAACAACTTAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAG
TTTTATGACTCGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGC
AAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCATATCTAGGGGATAACGACACTGTACCGGTT
ATTATAACTTTCAATTTATCACCTACTAATGAATATTCTTTATTGCAGATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACAATAGCAGATATCCGAGGGATA
AGCCCGACCTTTTGCATGCACAAAATCTTATTGGAAGAAGATGCTAAGAACTTTATCGAGAGTCAAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTTGGAGTTGCTTATCGCGTGTTTAGAGTTAAGGCTAGCCAGACTGAGTTAGGAACTAGCATACTTAGGAGTAGTGATCTCGGTCGACCTCCTCGTCAT
CACCAGGTGAGAACGGAGCCGATCCACCGCCCCCCTCCTGCTGGGAATCAGGCAGGAGTAGTCCCTCCATTTCCTCCAGCAGCAGCTCGAGAGCGGGCAGATCCT
CCAATTCCCCCAGCAGTTCCTCAGGTGAACCCCCAATTGGCATTGCTTGTAGAGGCCTTGCAAGCAGTGATCAGTAACGCCGCAGGGGTGGGCGGGGTCCAAGCT
CCGCCACCCCAACACCTTCATACACCCCAGAGCGAGGCTCGTTTCAACAAGGATTTCAAGCACTACGGACCCCCAACCTTTGACGGTGAGAGTGAAAGAGCGACA
GCAGCGGAAGAGTGGATTAGAGAGTTGGAAGCCCTTTACGCGTATCTTGGTTGCGAAGACCAATTCAAGGTGAAGGGCGCGATTTTTATGTTGAGGGGCGAGACC
CTGAACTGGTGGGACTCAGTAGCAGCGGCAGAGGATCATGCGAATGTACCAATTCCGTGGGCAAGGTTCAAGAACTTGTTGTACGACTACTACTATCCGGAGACT
GTGAAAGACATGAAGGAGGCAGAATTCCTGCATCTAGTCCATGGAACTTTATCAGTGGCACAGTATGAAAGCAAGTTCACGGAACTCTCCCGTTTCGCTCTAGAG
TTGATTCCCACTGAGGCATTAAAGATCAAGAGGTTTGTTAAGGGCTTGCGCAAGGGAATCAGAGGCCCGGTGGACCTCCAGCGACCCACCACCTATGCTGAAGCG
GTTAGGGGCGCCTTGGTTATGGATAAGGATGTTTCCAACAAGGCCTCACCCCTGCAAGAGGTCGGATCATCTTCCTGTGTGAAAAGGAAGTTCCCTTCGACTTAT
GCCGACCCGGTATTGAGAGCACCCCAGCGCCAGGCTCAACACCAGGGCATGCCGCCAGTATGCCCCACCTGCCAAAAAAGACATACGGGGCAGTGCTGGACGGGA
AGTAAGGGTTGTTTCAAGTGTGGAAGAGAGGGGCATTTTGCAAGGGAATGTCCCATGTCGGCCGCAAATACACAGAGGTTGGGTCAGAGGATTCCACCACTAGTT
GCGACGCAGGGAAATAACCAAAGTGCTCCTGTCTTCACACTTACGCCCAAGGAAGCGGCGGATGCCGAAACGGTGGTCACAGGTACTGTTTTAGACCATGATGTG
CCTGCGTATGTATTGTTTGATTCGGGGTCGAGCCACACCTTCATCTCTTCTGCGTTTGTTCGTCAGGCAACCCTCGAATTAGAGCCGTTAGGGTTTTTGTTGTCG
GTTTCTACACCACCAGGGTCGATTTTGATCGCTAGCCAAAAGGTGAGGGCAGGTGAGTTGTCTTTTGATAATCAGACTCTAAGGGCAAGACTGATCCAGCTGGAC
ATGCAAGATTTTGACGTTATTGTGGGCATGGATTGGCTAGCTACCAACCAAGCCAACATTAATTGCTCGAGAAGAGAAGTCTCCTTCAAACTACCTTCGGGTCGG
AACTTTACTTTTAAAGGGGTTACGGGTAGAGTCCCAAGGACAGTATCAGCGTTGAAGGCAAGAGGCCTGTTGCAGAATGGAGCTTGGGGATATTTGGCCAACGTC
GTCGACGTTATGTTAGATGAGTTGGACTATTTTGAGGTAGAGTTAGCGGTAGAAGATGTTTCGGCAGTGCTGGCTCAACTCTTAGTCAAACCCACCCTAAGACAA
CGAATCATCGCTGCACAAAAGGGAGACTCTAGTCTGAGCAAGGGTGTCGGGATGCTGGGCCGAGGAGATTTTTCCCTTTCAGAAGATAAGGCCCTGCTCTATCAG
GGGAGACTGCTAGGGGCTGATGTGAAGTTGGTTATGATAAGGAACGTTGGATTCTTGGGGAAAGTTAATGGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGC
GAAAAGACGAAAAAACACCTCAGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGT
TTTGGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAGAAGATCTTTTCTTCTACCCCTTGACCCTGAGATT
GAGCGGACCCTTCGAAAAACTAGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTTCTGAAAGTGAGGTAGAGAGT
ACAAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCAACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATC
CAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTT
GAGCTTAAACCAATGATGTTCCAAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCA
AATGCATTTCGATTACCTGGTATAACAGACGATGCTCTGAGACTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCG
GGATCCATCACCACATGGGGGTCGTTAGTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAG
TATGATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTAAAGAACTAATCAGGAAATGTCCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTT
AGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCT
TCACACAACGAACTATGGTGTTTGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAG
ATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCT
GTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAATGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAAC
AATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGG
CAGAGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCGCAACGGCAGTACAATCAGAGAACACAGACTCCACCAGTTCACAAT
AACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAATCGACGCAGTGATACAATCCCAAACGGCATCAATGAGGAATTTTGAGACCCAATTGGGA
CACCTTGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGG
AGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAA
GAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTTACCCCTACTCCCCCGTTTCCACAAAGATTAGTTAAGAAAAAT
AATGATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAACTGCATATAAATATACCATTAGTAGATGCTCTAGAACAGATGCCAAATTATGCTAAGTTTTTG
AAAGATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCTCGCTACCCATGAAATGTAAT
GATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAACAACTTAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAG
TTTTATGACTCGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGC
AAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCATATCTAGGGGATAACGACACTGTACCGGTT
ATTATAACTTTCAATTTATCACCTACTAATGAATATTCTTTATTGCAGATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACAATAGCAGATATCCGAGGGATA
AGCCCGACCTTTTGCATGCACAAAATCTTATTGGAAGAAGATGCTAAGAACTTTATCGAGAGTCAAAGGTAA
Protein sequenceShow/hide protein sequence
MVVGVAYRVFRVKASQTELGTSILRSSDLGRPPRHHQVRTEPIHRPPPAGNQAGVVPPFPPAAARERADPPIPPAVPQVNPQLALLVEALQAVISNAAGVGGVQA
PPPQHLHTPQSEARFNKDFKHYGPPTFDGESERATAAEEWIRELEALYAYLGCEDQFKVKGAIFMLRGETLNWWDSVAAAEDHANVPIPWARFKNLLYDYYYPET
VKDMKEAEFLHLVHGTLSVAQYESKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTTYAEAVRGALVMDKDVSNKASPLQEVGSSSCVKRKFPSTY
ADPVLRAPQRQAQHQGMPPVCPTCQKRHTGQCWTGSKGCFKCGREGHFARECPMSAANTQRLGQRIPPLVATQGNNQSAPVFTLTPKEAADAETVVTGTVLDHDV
PAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPPGSILIASQKVRAGELSFDNQTLRARLIQLDMQDFDVIVGMDWLATNQANINCSRREVSFKLPSGR
NFTFKGVTGRVPRTVSALKARGLLQNGAWGYLANVVDVMLDELDYFEVELAVEDVSAVLAQLLVKPTLRQRIIAAQKGDSSLSKGVGMLGRGDFSLSEDKALLYQ
GRLLGADVKLVMIRNVGFLGKVNGTIWECKLIKRSEKTKKHLRRRQAPGKPAENSFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLGRRSFLLPLDPEI
ERTLRKTRKEQRLRKQLEKQKEREGEISSESEVESTSTSMADIPPRDPVDQPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANF
ELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQ
YDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCLQRSRAAPKKQDPAGVLALDIATSMQKE
MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCNENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQG
QSQQNKQPYVPPTQQYIPPPQRQYNQRTQTPPVHNNNSNLENMMKEYMARIDAVIQSQTASMRNFETQLGHLANELKNRPQGSFPGHTELPKREGKEQCKAVTLR
SGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMPNYAKFL
KDIVSRKKKIGEHELVAMTKCSSEAVGSSLPMKCNDPGSFTIPCSIGGNNLGDFEECSAINSLNPIMFDEFYDSLVTEIEEELDKIAEGPEDVANPIEKIQKEEC
KSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTVPVIITFNLSPTNEYSLLQILEKHKKAIGWTIADIRGISPTFCMHKILLEEDAKNFIESQR