; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g27140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g27140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr8:19621867..19627913
RNA-Seq ExpressionMoc08g27140
SyntenyMoc08g27140
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]7.5e-22594.23Show/hide
Query:  HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMY
        HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPT NIPQ+SEFRAENPQQLPPMINPGMY
Subjt:  HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMY

Query:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWIPPQNYGMISPKVPPPPQLPFLERGPQAPQLDPNILSTYNMGQLKPLEPPRM
        QPFMFN VPSYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHW+PP NYGMISP+VPPPPQL FLERGPQAPQL+PNILST+NMGQLKPLEPPRM
Subjt:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWIPPQNYGMISPKVPPPPQLPFLERGPQAPQLDPNILSTYNMGQLKPLEPPRM

Query:  PTPTNMPMDAGDEHGGEQEKCHSHRLEPGVSIGQKRKGKEVMTDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSSFIITPEDGNDDFLLVSRGHCSNMPE
        PTPTNMPMD GDEHGGEQEK HSHRLEPGV IGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IIT EDGNDDFLLVSRGHCSNMPE
Subjt:  PTPTNMPMDAGDEHGGEQEKCHSHRLEPGVSIGQKRKGKEVMTDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSSFIITPEDGNDDFLLVSRGHCSNMPE

Query:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDPEDTQT
        TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGS PLATDEYVT MATLPGVRDAHTIPSNTVNPLGCDTGRS+VGENSTQEPTSEED ED QT
Subjt:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDPEDTQT

Query:  IREMFQYKRREKKSSK
        IREMFQYKRREK SSK
Subjt:  IREMFQYKRREKKSSK

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]1.9e-0283.87Show/hide
Query:  MSNYAKLLKDIVSRKKKIREHQLVAMTKCSS
        M NYAK LKDIVSRKKKI EH+LVAMTKCS+
Subjt:  MSNYAKLLKDIVSRKKKIREHQLVAMTKCSS

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]1.3e-21652.76Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS
        A N   N I +AD RD AMR+YA    ++ +S + N  PA A FE KPMM QML  IG F G EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS

Query:  LKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACIQIEHFFRSLDHPTKMMLNNAAN
        L   A AWLNAFP  +I T   +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AAN
Subjt:  LKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACIQIEHFFRSLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTLVPVCQVNDLIYSF
        G FT K+FNEIV+IL+ L+ HN+ WCS++ R   K+ DPA VLALD  TSMQK++ T+ Q LK M    KN  A  + P  +                  
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTLVPVCQVNDLIYSF

Query:  CSENHIYDNCPHNPTSVFYVGHG----NNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYISPPQQQYN-QRTQTPPV
                    NP+ V+ +       N + FNPYSN YNPGW+ HPNFSW GQG SSG   GQ+QQ KQ Y P   P      P  QQYN Q+    P 
Subjt:  CSENHIYDNCPHNPTSVFYVGHG----NNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYISPPQQQYN-QRTQTPPV

Query:  QNNNSNLENMMKEY-----------MARTDAVIQSQA----------ASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDG
        Q N SN+E +MKE+           M RTDA I+              ++RN E QLGQLANE+  RPQGS P  TE P+R              +    
Subjt:  QNNNSNLENMMKEY-----------MARTDAVIQSQA----------ASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDG

Query:  PTMPTTDVQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMSNYAKLLKD
        P+    D Q+        +P+    PE          + SV PQV    +P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQM  YAK LKD
Subjt:  PTMPTTDVQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMSNYAKLLKD

Query:  IVSRKKKIREHQLVAMTKCSSEAVGSQLPMKCNDLGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRS----EGKIEDV
        I++RKKK+ E++ VA+T+CSS    S+ P K  D GSFTI C IGGK++GRALCDLGA INLMPLS+FK L IG+A PTTVTL LADRS    EGKIEDV
Subjt:  IVSRKKKIREHQLVAMTKCSSEAVGSQLPMKCNDLGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRS----EGKIEDV

Query:  LVQVDKFIFPANFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAITNFNPVMFDEFYDLLVTEIEEELD
        LV+VDKFIFPA+FIILDCEAD DVPIILGRPFLATG+T+ +V+KGE+TMRV++++V FN+LDAMK P D EEC  I+    +   E  DLL  EIE EL+
Subjt:  LVQVDKFIFPANFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAITNFNPVMFDEFYDLLVTEIEEELD

Query:  KIAEGPEDVTNPVEKIQKEECKSLLPSIVEPP
           E  ++    +  ++KE+ KS+ P  +EPP
Subjt:  KIAEGPEDVTNPVEKIQKEECKSLLPSIVEPP

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]2.0e-24198.82Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWIPPQNYGMISPKVPPPPQLPFLERGPQAPQLDPNILSTYNMGQLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWIPPQNYGMISPKVPPPPQLPFLERGPQAPQLDPNILSTYNMGQLKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWIPPQNYGMISPKVPPPPQLPFLERGPQAPQLDPNILSTYNMGQLKP

Query:  LEPPRMPTPTNMPMDAGDEHGGEQEKCHSHRLEPGVSIGQKRKGKEVMTDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSSFIITPEDGNDDFLLVSRGH
        LEPPRMPTPTNMPMDAGDEHGGEQEKCHSHRLEPGVSIGQKRKGKEVMTDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSSFIITPEDGNDDFLLVSRGH
Subjt:  LEPPRMPTPTNMPMDAGDEHGGEQEKCHSHRLEPGVSIGQKRKGKEVMTDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSSFIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
        CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
Subjt:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED

Query:  PEDTQTIREMFQYKRREKKSSK
        PEDTQTIR+MFQYKRREKK  K
Subjt:  PEDTQTIREMFQYKRREKKSSK

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]4.4e-26586.59Show/hide
Query:  MANIPPRDPVDPPTVNSNMRDHARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI
        MA+IPPRDPVDPP VN NMRDHARNDEFN+IQMADNRDVAMREYA TAFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI
Subjt:  MANIPPRDPVDPPTVNSNMRDHARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI

Query:  ANAFRLPGITDDALRLTLFPFSLKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACI
        ANAFRLPGITDDA  LTLFPFSLKD AR  LNAFP GSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC N+GLPAC 
Subjt:  ANAFRLPGITDDALRLTLFPFSLKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACI

Query:  QIEHFFRSLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQ
        QIEHFFR LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQ
Subjt:  QIEHFFRSLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQ

Query:  PVQSDYCTLVPVCQVNDLIYSFCSENHIYDNCPHNPTSVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYISP
        PVQSDYCT  PVCQVNDLI            C                            WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+I P
Subjt:  PVQSDYCTLVPVCQVNDLIYSFCSENHIYDNCPHNPTSVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYISP

Query:  PQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTD
        PQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANEL NRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTD
Subjt:  PQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTD

Query:  VQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVPPQ
        VQIPST+P VKIPENPTTPEKEN RKGN+DT SVPPQ
Subjt:  VQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVPPQ

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]7.8e-23854.82Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS
        A N   N I +AD RD AMR+YA    ++ +S ++N  PA A FE KPMM QML  IG FGG EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS

Query:  LKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACIQIEHFFRSLDHPTKMMLNNAAN
        +   A AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFR  D  TKMMLN AAN
Subjt:  LKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACIQIEHFFRSLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTLVPVCQVNDLIYSF
        G FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK M     N  A       +      PV Q+ +    +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTLVPVCQVNDLIYSF

Query:  CSENHIYDNCPHNPTSVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYISPPQQQYN-QRTQTPPVQNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P      P   QYN Q+    P Q N 
Subjt:  CSENHIYDNCPHNPTSVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYISPPQQQYN-QRTQTPPVQNNN

Query:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLGQL NE+  RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMSNYAKLLKDIV
              PS E      +    P+K       E   SVP  PQV  S  P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQM  YAK +KDI+
Subjt:  TTDVQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMSNYAKLLKDIV

Query:  SRKKKIREHQLVAMTKCSSEAVGSQLPMKCNDLGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRS----EGKIEDVLV
        +RKKK+ E++ VA+T+CSS    S++P K  D GSFTIPC IGGK++GRALCDLGASINLMPLS+FK   IG+A PTTVTLQLADRS    EGKIEDVLV
Subjt:  SRKKKIREHQLVAMTKCSSEAVGSQLPMKCNDLGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRS----EGKIEDVLV

Query:  QVDKFIFPANFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAITNFNPVMFDEFYDLLVTEIEEELDKI
        +VDKFIFP +FIILDCEAD DVPIILGRPFLATG+T+ +V+KGE+TMRV++++V FN+LDAMK   D EEC+ I+    +   E  DLL  EIE EL+  
Subjt:  QVDKFIFPANFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAITNFNPVMFDEFYDLLVTEIEEELDKI

Query:  AEGPEDVTNPVEKIQKEECKSLLPSIVEPP
         E  ++       ++KE+ KS+ P  +EPP
Subjt:  AEGPEDVTNPVEKIQKEECKSLLPSIVEPP

TrEMBL top hitse value%identityAlignment
A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201103.7e-22594.23Show/hide
Query:  HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMY
        HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPT NIPQ+SEFRAENPQQLPPMINPGMY
Subjt:  HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMY

Query:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWIPPQNYGMISPKVPPPPQLPFLERGPQAPQLDPNILSTYNMGQLKPLEPPRM
        QPFMFN VPSYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHW+PP NYGMISP+VPPPPQL FLERGPQAPQL+PNILST+NMGQLKPLEPPRM
Subjt:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWIPPQNYGMISPKVPPPPQLPFLERGPQAPQLDPNILSTYNMGQLKPLEPPRM

Query:  PTPTNMPMDAGDEHGGEQEKCHSHRLEPGVSIGQKRKGKEVMTDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSSFIITPEDGNDDFLLVSRGHCSNMPE
        PTPTNMPMD GDEHGGEQEK HSHRLEPGV IGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IIT EDGNDDFLLVSRGHCSNMPE
Subjt:  PTPTNMPMDAGDEHGGEQEKCHSHRLEPGVSIGQKRKGKEVMTDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSSFIITPEDGNDDFLLVSRGHCSNMPE

Query:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDPEDTQT
        TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGS PLATDEYVT MATLPGVRDAHTIPSNTVNPLGCDTGRS+VGENSTQEPTSEED ED QT
Subjt:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDPEDTQT

Query:  IREMFQYKRREKKSSK
        IREMFQYKRREK SSK
Subjt:  IREMFQYKRREKKSSK

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201109.3e-0383.87Show/hide
Query:  MSNYAKLLKDIVSRKKKIREHQLVAMTKCSS
        M NYAK LKDIVSRKKKI EH+LVAMTKCS+
Subjt:  MSNYAKLLKDIVSRKKKIREHQLVAMTKCSS

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201101.4e-21652.76Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS
        A N   N I +AD RD AMR+YA    ++ +S + N  PA A FE KPMM QML  IG F G EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS

Query:  LKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACIQIEHFFRSLDHPTKMMLNNAAN
        L   A AWLNAFP  +I T   +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LIR CPN G+PAC+QIEHFFR  D PT MMLN AAN
Subjt:  LKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACIQIEHFFRSLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTLVPVCQVNDLIYSF
        G FT K+FNEIV+IL+ L+ HN+ WCS++ R   K+ DPA VLALD  TSMQK++ T+ Q LK M    KN  A  + P  +                  
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTLVPVCQVNDLIYSF

Query:  CSENHIYDNCPHNPTSVFYVGHG----NNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYISPPQQQYN-QRTQTPPV
                    NP+ V+ +       N + FNPYSN YNPGW+ HPNFSW GQG SSG   GQ+QQ KQ Y P   P      P  QQYN Q+    P 
Subjt:  CSENHIYDNCPHNPTSVFYVGHG----NNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYISPPQQQYN-QRTQTPPV

Query:  QNNNSNLENMMKEY-----------MARTDAVIQSQA----------ASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDG
        Q N SN+E +MKE+           M RTDA I+              ++RN E QLGQLANE+  RPQGS P  TE P+R              +    
Subjt:  QNNNSNLENMMKEY-----------MARTDAVIQSQA----------ASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDG

Query:  PTMPTTDVQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMSNYAKLLKD
        P+    D Q+        +P+    PE          + SV PQV    +P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQM  YAK LKD
Subjt:  PTMPTTDVQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMSNYAKLLKD

Query:  IVSRKKKIREHQLVAMTKCSSEAVGSQLPMKCNDLGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRS----EGKIEDV
        I++RKKK+ E++ VA+T+CSS    S+ P K  D GSFTI C IGGK++GRALCDLGA INLMPLS+FK L IG+A PTTVTL LADRS    EGKIEDV
Subjt:  IVSRKKKIREHQLVAMTKCSSEAVGSQLPMKCNDLGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRS----EGKIEDV

Query:  LVQVDKFIFPANFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAITNFNPVMFDEFYDLLVTEIEEELD
        LV+VDKFIFPA+FIILDCEAD DVPIILGRPFLATG+T+ +V+KGE+TMRV++++V FN+LDAMK P D EEC  I+    +   E  DLL  EIE EL+
Subjt:  LVQVDKFIFPANFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAITNFNPVMFDEFYDLLVTEIEEELD

Query:  KIAEGPEDVTNPVEKIQKEECKSLLPSIVEPP
           E  ++    +  ++KE+ KS+ P  +EPP
Subjt:  KIAEGPEDVTNPVEKIQKEECKSLLPSIVEPP

A0A6J1DW02 uncharacterized protein LOC1110248972.1e-26586.59Show/hide
Query:  MANIPPRDPVDPPTVNSNMRDHARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI
        MA+IPPRDPVDPP VN NMRDHARNDEFN+IQMADNRDVAMREYA TAFQNFDSGIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI
Subjt:  MANIPPRDPVDPPTVNSNMRDHARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI

Query:  ANAFRLPGITDDALRLTLFPFSLKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACI
        ANAFRLPGITDDA  LTLFPFSLKD AR  LNAFP GSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC N+GLPAC 
Subjt:  ANAFRLPGITDDALRLTLFPFSLKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACI

Query:  QIEHFFRSLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQ
        QIEHFFR LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQ
Subjt:  QIEHFFRSLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQ

Query:  PVQSDYCTLVPVCQVNDLIYSFCSENHIYDNCPHNPTSVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYISP
        PVQSDYCT  PVCQVNDLI            C                            WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+I P
Subjt:  PVQSDYCTLVPVCQVNDLIYSFCSENHIYDNCPHNPTSVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYISP

Query:  PQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTD
        PQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANEL NRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTD
Subjt:  PQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTD

Query:  VQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVPPQ
        VQIPST+P VKIPENPTTPEKEN RKGN+DT SVPPQ
Subjt:  VQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVPPQ

A0A6J1DX11 uncharacterized protein LOC1110248609.6e-24298.82Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWIPPQNYGMISPKVPPPPQLPFLERGPQAPQLDPNILSTYNMGQLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWIPPQNYGMISPKVPPPPQLPFLERGPQAPQLDPNILSTYNMGQLKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWIPPQNYGMISPKVPPPPQLPFLERGPQAPQLDPNILSTYNMGQLKP

Query:  LEPPRMPTPTNMPMDAGDEHGGEQEKCHSHRLEPGVSIGQKRKGKEVMTDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSSFIITPEDGNDDFLLVSRGH
        LEPPRMPTPTNMPMDAGDEHGGEQEKCHSHRLEPGVSIGQKRKGKEVMTDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSSFIITPEDGNDDFLLVSRGH
Subjt:  LEPPRMPTPTNMPMDAGDEHGGEQEKCHSHRLEPGVSIGQKRKGKEVMTDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSSFIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
        CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
Subjt:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED

Query:  PEDTQTIREMFQYKRREKKSSK
        PEDTQTIR+MFQYKRREKK  K
Subjt:  PEDTQTIREMFQYKRREKKSSK

A0A6J1DY39 uncharacterized protein LOC1110256533.8e-23854.82Show/hide
Query:  ARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS
        A N   N I +AD RD AMR+YA    ++ +S ++N  PA A FE KPMM QML  IG FGG EHEDP  HLKSFI++AN FRLPGI+DDALRLTLFPFS
Subjt:  ARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDALRLTLFPFS

Query:  LKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACIQIEHFFRSLDHPTKMMLNNAAN
        +   A AWLNAFP  +ITTW  +V+KFL K+FPPTR+AD+REEIISFRQ + E V+ AWERFK+LI  CPN G+PAC+QIEHFFR  D  TKMMLN AAN
Subjt:  LKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACIQIEHFFRSLDHPTKMMLNNAAN

Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTLVPVCQVNDLIYSF
        G FT K+FNEIV+IL+ L+ HN  WCS++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK M     N  A       +      PV Q+ +    +
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTLVPVCQVNDLIYSF

Query:  CSENHIYDNCPHNPTSVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYISPPQQQYN-QRTQTPPVQNNN
        C + H  +NCP NP+S++YVG  N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P      P   QYN Q+    P Q N 
Subjt:  CSENHIYDNCPHNPTSVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQYISPPQQQYN-QRTQTPPVQNNN

Query:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        SN+E +MKE + + DA ++                         ++R  E QLGQL NE+  RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP
Subjt:  SNLENMMKEYMARTDAVIQS---------------------QAASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

Query:  TTDVQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMSNYAKLLKDIV
              PS E      +    P+K       E   SVP  PQV  S  P PPFPQRLV+KN D+ FRKFLDILKQLHINIP V+ALEQM  YAK +KDI+
Subjt:  TTDVQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVP--PQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMSNYAKLLKDIV

Query:  SRKKKIREHQLVAMTKCSSEAVGSQLPMKCNDLGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRS----EGKIEDVLV
        +RKKK+ E++ VA+T+CSS    S++P K  D GSFTIPC IGGK++GRALCDLGASINLMPLS+FK   IG+A PTTVTLQLADRS    EGKIEDVLV
Subjt:  SRKKKIREHQLVAMTKCSSEAVGSQLPMKCNDLGSFTIPCSIGGKNLGRALCDLGASINLMPLSVFKSLGIGEARPTTVTLQLADRS----EGKIEDVLV

Query:  QVDKFIFPANFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAITNFNPVMFDEFYDLLVTEIEEELDKI
        +VDKFIFP +FIILDCEAD DVPIILGRPFLATG+T+ +V+KGE+TMRV++++V FN+LDAMK   D EEC+ I+    +   E  DLL  EIE EL+  
Subjt:  QVDKFIFPANFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAITNFNPVMFDEFYDLLVTEIEEELDKI

Query:  AEGPEDVTNPVEKIQKEECKSLLPSIVEPP
         E  ++       ++KE+ KS+ P  +EPP
Subjt:  AEGPEDVTNPVEKIQKEECKSLLPSIVEPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATATTCCACCTCGTGATCCGGTTGATCCACCTACTGTTAACAGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAG
AGACGTGGCAATGCGAGAATATGCCACCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCC
AAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGGTATAACA
GACGATGCTCTTAGACTAACACTTTTTCCATTTTCTTTGAAGGACCATGCTAGAGCATGGCTTAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGA
GAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGGGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAAAGAT
TTAAAGAACTAATCAGGAAATGTCCGAACAATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTTAGAAGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCT
GCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAA
GAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTAAAAGAGATGGCATTGGGAATAAAAAATC
CATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCTTGTCCCTGTTTGCCAAGTCAACGATCTCATTTATTCATTTTGCAGTGAAAACCATATCTATGAT
AATTGTCCACATAACCCTACTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTT
CTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAAAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCTCGCCGCCGCAACAAC
AGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCG
GCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGATGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAA
AGAACAGTGCAAAGCTGTCACCCTTAGAAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAG
AAAATCCAACAACACCAGAAAAAGAAAATACTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAACCTTCTTTTACCCCTACTCCCCCGTTTCCACAA
AGATTAGTTAAGAAAAATAATGATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAATTGCATATAAATATACCATTAGTGGATGCTCTAGAACAGATGTCAAATTA
TGCTAAGCTTTTGAAAGATATAGTTTCTAGGAAGAAAAAGATAAGAGAGCATCAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCCAGCTACCCATGA
AGTGTAACGATCTTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGCAGGGCATTATGTGATCTAGGCGCTAGCATTAATCTTATGCCATTATCAGTT
TTCAAATCTTTAGGCATAGGAGAAGCTAGACCTACTACGGTTACCCTCCAACTAGCAGATAGATCTGAGGGAAAGATAGAAGATGTTTTAGTCCAGGTGGACAAATTCAT
TTTCCCTGCGAACTTTATCATTCTAGATTGCGAGGCGGACTTAGACGTTCCCATTATTTTGGGAAGACCATTTCTAGCCACTGGGGATACAGTTTTTAATGTGAGGAAGG
GAGAAATTACAATGAGGGTAAATAATGAAGAAGTTAAATTTAACGTTCTAGACGCCATGAAATTACCAGGAGACTTTGAAGAATGCTCTGCTATAACTAACTTTAATCCT
GTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCGGAAGATGTGACTAATCCTGTTGAAAAAATACAAAA
AGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGCTGGAGCAGAAGCCATTACCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTAC
CAGTTATTATAGCTTCCAATTTATCACCTACTAATGAATATTCTTTATTGCAGATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAACAGATATCCGAGGGATA
AGCCCGACCTTTTGCATGCACAAAATCTTATTGGAAGAAGATGCTAAGAACTTTATCGAGAATTTTCAGCCACCACCAAAAAGAATAAGAAATTTCCAGCATTTGCTTAC
ATCTATGGCTTCTTCTTCTCAACATACTTCACTTCCTACCTCTTCTACTCAAGCTCCAAATGCCACAAGCATTCCCTTTCCACCATTGGAGAACTTCCAACACCACATGG
TTTCAGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACATTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAATGAGCCCACATGGTTAT
GTTAATTTTCAGCAACTACCCACCCTTAATATACCTCAAAACAGTGAGTTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTT
TATGTTTAACCCGGTTCCTTCCTATCATTTTCCCTTGTCCCAAATGCAAATTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTC
CACCTTATTATGGAATGGGTCATTGGATACCTCCACAAAATTATGGGATGATTTCACCTAAGGTTCCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCA
CCACAATTGGACCCTAACATATTGAGTACCTACAACATGGGACAACTAAAACCCTTAGAGCCTCCTAGGATGCCAACCCCAACCAATATGCCAATGGATGCAGGAGATGA
GCATGGAGGAGAGCAAGAGAAGTGCCATAGCCACAGGCTAGAACCCGGGGTTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGACAGACCCAGAAATTGAGGAAG
ATGGGAGTAGTAGACGTCTGACGCCTAAGGATTCGACTATGGAGAACAGGGACGAGGAGCAGTTTTATTCATCTTCATTTATTATTACACCGGAAGATGGTAATGATGAT
TTTCTACTGGTTTCACGAGGCCATTGTTCAAATATGCCGGAAACAGAGGATACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATACACCTAC
AGAAGGAGCGTTTTGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGGATCTACTCCGTTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTACCAGGGGTAAGGG
ATGCTCACACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAGCGAAGAAGATCCTGAG
GACACGCAGACGATAAGGGAAATGTTCCAATACAAGAGGCGGGAGAAAAAGAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCCGACAGTGCCCATGAATGAACCTAA
AACGAGAGCTGCAAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATCAAACTGGACTTGTCTGAGGGAGAGGAGGTCGAGACGA
AATGGAACGCGGCAAAATTTAGCCACTCGAACTTCATTAATGAAATCCTAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTAGCACAGCTTGATGAGGCTCTA
GAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGCGTAGATGTGAATTATGGAGAATT
GATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTCGTCACTTCTTTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATC
AAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGACGTCTCCTACATCGAAGATCCGT
CGTCTTCGAGAGGAGAACCAACAGCTGCGAGATCAAGTTCGAGAAGTCGTGCAACATATCTACAACTTGAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCC
AGCGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAATATTCCACCTCGTGATCCGGTTGATCCACCTACTGTTAACAGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAG
AGACGTGGCAATGCGAGAATATGCCACCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCC
AAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGGTATAACA
GACGATGCTCTTAGACTAACACTTTTTCCATTTTCTTTGAAGGACCATGCTAGAGCATGGCTTAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGA
GAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCTGATATCAGGGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAAAGAT
TTAAAGAACTAATCAGGAAATGTCCGAACAATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTTAGAAGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCT
GCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAA
GAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTAAAAGAGATGGCATTGGGAATAAAAAATC
CATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCTTGTCCCTGTTTGCCAAGTCAACGATCTCATTTATTCATTTTGCAGTGAAAACCATATCTATGAT
AATTGTCCACATAACCCTACTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTT
CTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAAAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCTCGCCGCCGCAACAAC
AGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCG
GCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGATGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAA
AGAACAGTGCAAAGCTGTCACCCTTAGAAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAG
AAAATCCAACAACACCAGAAAAAGAAAATACTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAACCTTCTTTTACCCCTACTCCCCCGTTTCCACAA
AGATTAGTTAAGAAAAATAATGATTCCCAGTTTAGAAAATTTTTAGATATTTTAAAACAATTGCATATAAATATACCATTAGTGGATGCTCTAGAACAGATGTCAAATTA
TGCTAAGCTTTTGAAAGATATAGTTTCTAGGAAGAAAAAGATAAGAGAGCATCAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCCAGCTACCCATGA
AGTGTAACGATCTTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGGCAGGGCATTATGTGATCTAGGCGCTAGCATTAATCTTATGCCATTATCAGTT
TTCAAATCTTTAGGCATAGGAGAAGCTAGACCTACTACGGTTACCCTCCAACTAGCAGATAGATCTGAGGGAAAGATAGAAGATGTTTTAGTCCAGGTGGACAAATTCAT
TTTCCCTGCGAACTTTATCATTCTAGATTGCGAGGCGGACTTAGACGTTCCCATTATTTTGGGAAGACCATTTCTAGCCACTGGGGATACAGTTTTTAATGTGAGGAAGG
GAGAAATTACAATGAGGGTAAATAATGAAGAAGTTAAATTTAACGTTCTAGACGCCATGAAATTACCAGGAGACTTTGAAGAATGCTCTGCTATAACTAACTTTAATCCT
GTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCGGAAGATGTGACTAATCCTGTTGAAAAAATACAAAA
AGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGCTGGAGCAGAAGCCATTACCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTAC
CAGTTATTATAGCTTCCAATTTATCACCTACTAATGAATATTCTTTATTGCAGATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAACAGATATCCGAGGGATA
AGCCCGACCTTTTGCATGCACAAAATCTTATTGGAAGAAGATGCTAAGAACTTTATCGAGAATTTTCAGCCACCACCAAAAAGAATAAGAAATTTCCAGCATTTGCTTAC
ATCTATGGCTTCTTCTTCTCAACATACTTCACTTCCTACCTCTTCTACTCAAGCTCCAAATGCCACAAGCATTCCCTTTCCACCATTGGAGAACTTCCAACACCACATGG
TTTCAGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACATTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAATGAGCCCACATGGTTAT
GTTAATTTTCAGCAACTACCCACCCTTAATATACCTCAAAACAGTGAGTTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTT
TATGTTTAACCCGGTTCCTTCCTATCATTTTCCCTTGTCCCAAATGCAAATTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTC
CACCTTATTATGGAATGGGTCATTGGATACCTCCACAAAATTATGGGATGATTTCACCTAAGGTTCCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCA
CCACAATTGGACCCTAACATATTGAGTACCTACAACATGGGACAACTAAAACCCTTAGAGCCTCCTAGGATGCCAACCCCAACCAATATGCCAATGGATGCAGGAGATGA
GCATGGAGGAGAGCAAGAGAAGTGCCATAGCCACAGGCTAGAACCCGGGGTTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGACAGACCCAGAAATTGAGGAAG
ATGGGAGTAGTAGACGTCTGACGCCTAAGGATTCGACTATGGAGAACAGGGACGAGGAGCAGTTTTATTCATCTTCATTTATTATTACACCGGAAGATGGTAATGATGAT
TTTCTACTGGTTTCACGAGGCCATTGTTCAAATATGCCGGAAACAGAGGATACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATACACCTAC
AGAAGGAGCGTTTTGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGGATCTACTCCGTTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTACCAGGGGTAAGGG
ATGCTCACACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAGCGAAGAAGATCCTGAG
GACACGCAGACGATAAGGGAAATGTTCCAATACAAGAGGCGGGAGAAAAAGAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCCGACAGTGCCCATGAATGAACCTAA
AACGAGAGCTGCAAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATCAAACTGGACTTGTCTGAGGGAGAGGAGGTCGAGACGA
AATGGAACGCGGCAAAATTTAGCCACTCGAACTTCATTAATGAAATCCTAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTAGCACAGCTTGATGAGGCTCTA
GAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGCGTAGATGTGAATTATGGAGAATT
GATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTCGTCACTTCTTTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATC
AAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGACGTCTCCTACATCGAAGATCCGT
CGTCTTCGAGAGGAGAACCAACAGCTGCGAGATCAAGTTCGAGAAGTCGTGCAACATATCTACAACTTGAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCC
AGCGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MANIPPRDPVDPPTVNSNMRDHARNDEFNHIQMADNRDVAMREYATTAFQNFDSGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGIT
DDALRLTLFPFSLKDHARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNNGLPACIQIEHFFRSLDHPTKMMLNNA
ANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTLVPVCQVNDLIYSFCSENHIYD
NCPHNPTSVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYISPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQA
ASMRNFETQLGQLANELMNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENTRKGNEDTPSVPPQVQPSFTPTPPFPQ
RLVKKNNDSQFRKFLDILKQLHINIPLVDALEQMSNYAKLLKDIVSRKKKIREHQLVAMTKCSSEAVGSQLPMKCNDLGSFTIPCSIGGKNLGRALCDLGASINLMPLSV
FKSLGIGEARPTTVTLQLADRSEGKIEDVLVQVDKFIFPANFIILDCEADLDVPIILGRPFLATGDTVFNVRKGEITMRVNNEEVKFNVLDAMKLPGDFEECSAITNFNP
VMFDEFYDLLVTEIEEELDKIAEGPEDVTNPVEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPVIIASNLSPTNEYSLLQILEKHKKAIGWTITDIRGI
SPTFCMHKILLEEDAKNFIENFQPPPKRIRNFQHLLTSMASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGY
VNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWIPPQNYGMISPKVPPPPQLPFLERGPQA
PQLDPNILSTYNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKCHSHRLEPGVSIGQKRKGKEVMTDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSSFIITPEDGNDD
FLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDPE
DTQTIREMFQYKRREKKSSKRRAVQAKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIKLDLSEGEEVETKWNAAKFSHSNFINEILDAVGNKMLVTPTLAQLDEAL
ECVGKPSATWDLTTHGKDRALLVYAMLKGVDVNYGELINTSIHECAHRTRGKLYHPRFVTSLCLRQGVQLPEDQIKRDAPIVEEKNIRRIIAHALQRREGTGTSPTSKIR
RLREENQQLRDQVREVVQHIYNLRASLDFAVLPSWPPALAAILGHPSSSTDTDPSPQPPTS