; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012191 (gene) of Snake gourd v1 genome

Gene IDTan0012191
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG02:83448090..83450194
RNA-Seq ExpressionTan0012191
SyntenyTan0012191
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]3.1e-16150.61Show/hide
Query:  MNEEHDPELQPLDPEISRTFHCRRREQGGRAEVEGMVEPLLRIDNNGGTRVVE------DERDQIIQEYAVP---VLIPGIMPPEIEAQHFELKPVMFQM
        M++ H P +   DPEI RTF+ RR+ Q    + +  ++  +   NNG   +V       D++D+ I++YA P    L  GI+ P I+A  FELKPVMFQM
Subjt:  MNEEHDPELQPLDPEISRTFHCRRREQGGRAEVEGMVEPLLRIDNNGGTRVVE------DERDQIIQEYAVP---VLIPGIMPPEIEAQHFELKPVMFQM

Query:  LQHTGQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNEL-------------------------QQDDE
        LQ  GQF G   EDPHLHL+ FM +  SF  +GV ED LRL LFPYS+R +A  WLN++P G++T+WN+L                         QQDDE
Subjt:  LQHTGQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNEL-------------------------QQDDE

Query:  SLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDA-SAANLEAQIA
        SLY+ WERFKELLRKC +HGI + IQM+TFYNGLN  T+M VDASA GALLSK+YN AYEILE IA  NYQW +SR    ++VAG+ D  S  +++AQ+A
Subjt:  SLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDA-SAANLEAQIA

Query:  TLTNTLKSMNMGS--------GNQFGPPMSVSCVYCGEGHAFDVCPSNPASVNYVGYNKNFNPYSNTYNPGWRYHPNFAWNNQGI--GSSNQQMRPPVPL
        ++ + LK+++MG+         +Q     +VSCV+CGE H +D CPSNP SV Y+G      PYSNTYN  WR HPNF+W+NQG   G+S   ++   P 
Subjt:  TLTNTLKSMNMGS--------GNQFGPPMSVSCVYCGEGHAFDVCPSNPASVNYVGYNKNFNPYSNTYNPGWRYHPNFAWNNQGI--GSSNQQMRPPVPL

Query:  GFQQQGYPQQILQNPKAQPQESTSSLETLLKEHMAK-------TDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSG
        GF QQ             PQ  ++SLE +LKE++ K       T+A+VQSQAA++RNLENQ+GQLA+ L NR  GTLPSDTE P+G G E CKA+ L+SG
Subjt:  GFQQQGYPQQILQNPKAQPQESTSSLETLLKEHMAK-------TDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSG

Query:  KELEEPKSKALKVKEST--SIQENVEDNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKF
        K L    + A K  +S   S  E + D++  END+    + P+S  +++ + +P PPFPQ+FQK++Q+  F KFLD+LK L+INIPLVEAL+QM NY KF
Subjt:  KELEEPKSKALKVKEST--SIQENVEDNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKF

Query:  MKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRAL
        MKDILTKKRRLGEFE VALTK+CSS LQ KLP K+KDPG+FTIP +IG++Y G AL
Subjt:  MKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRAL

XP_021279280.1 LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 [Herrania umbratica]4.2e-15847.05Show/hide
Query:  MNEEHDPELQPLDPEISRTFHCRRREQGGRAEVEGMVEPLLRIDNNGGTRVVE--DERDQIIQEYAVPV---LIPGIMPPEIEAQHFELKPVMFQMLQHT
        M   ++  L P DP+I RTF   RRE     +V  + + +   +NN G   +    E ++ +++Y VP+   L   I  P I A +FE+KP   QM+Q +
Subjt:  MNEEHDPELQPLDPEISRTFHCRRREQGGRAEVEGMVEPLLRIDNNGGTRVVE--DERDQIIQEYAVPV---LIPGIMPPEIEAQHFELKPVMFQMLQHT

Query:  GQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNELQQ-------------------------DDESLYE
         QF G   +DP+ HL +F+ +C +F   GV++D +RL LFP+SLR KA  WLN++P G+IT+W +L Q                         D ESLYE
Subjt:  GQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNELQQ-------------------------DDESLYE

Query:  TWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASAANLEAQIATLTNT
         WERFKELLR+C +HGIP  +Q+QTFYNGL    +  +DA+AGGAL+SK   DAY +LE++A+NNYQWP+ R    + V   +  +   L  Q+A L+  
Subjt:  TWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASAANLEAQIATLTNT

Query:  LKSMNMGSGNQFGPPMSVSCVYCGEGHAFDVCPSNPASVNYVG-YNK-NFNPYSNTYNPGWRYHPNFAWNNQGIGSSNQQMRPPVPLGFQQQGYPQQILQ
        L ++ + +         V C  CG+ H++D CP N  SV +VG +N+   NPYSNTYNPGWR HPNF+W+N   G SN   +P +P GFQQQ  P     
Subjt:  LKSMNMGSGNQFGPPMSVSCVYCGEGHAFDVCPSNPASVNYVG-YNK-NFNPYSNTYNPGWRYHPNFAWNNQGIGSSNQQMRPPVPLGFQQQGYPQQILQ

Query:  NPKAQPQESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQ
            Q  E  S LE LL ++++KTDAI+QSQ A++RNLE Q+GQLA+++NNR QG+LPSDT+    KGKE C+AI LRSGKE+E    KA++ +     +
Subjt:  NPKAQPQESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQ

Query:  ENVEDN--ENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFEIVALT
        E + +N  E Q+ D + A     SQ+       PPPPFPQ+ QK++ +K F KFL++ K L+INIP  EAL+QM +Y KF+KDIL+KKR+LGEFE V LT
Subjt:  ENVEDN--ENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFEIVALT

Query:  KDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQLADRSVVYPEG
        ++CS+ILQ+KLPPKLKDPG+FTIP +IG  +  +AL DLG SINLMP S+F+KLG+GE +PT V+LQLADRS VYP G
Subjt:  KDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQLADRSVVYPEG

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]1.9e-15046.21Show/hide
Query:  LDPEISRTFHCRRR-------------EQGGRAEVE----GMVEPLLRIDNNGGTRVVEDERDQIIQEYAVPV---LIPGIMPPEIEAQHFELKPVMFQM
        LDPEI RTF  R +             E G +   E     M+    RI  N     + D+R++ I+ YA P    L P I+ PEI+   FELKPVMFQM
Subjt:  LDPEISRTFHCRRR-------------EQGGRAEVE----GMVEPLLRIDNNGGTRVVEDERDQIIQEYAVPV---LIPGIMPPEIEAQHFELKPVMFQM

Query:  LQHTGQFGGTTMEDPHLHLQSFMGV-------CKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNEL-----------------------
        LQ  GQF G  +EDPHLHL+SF+GV         SF  +GV +D +RL+LFPY LR  A  WLNT+ PG I SWN L                       
Subjt:  LQHTGQFGGTTMEDPHLHLQSFMGV-------CKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNEL-----------------------

Query:  --QQDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASA-A
          Q +DE+L E  ERFKE+LRKC +HG+P+ IQM+TFYNGLN  T+  VDASA GA+LSKTYN+AYEILE+IA+NN QW   R NP R+  GV +  A +
Subjt:  --QQDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASA-A

Query:  NLEAQIATLTNTLKSMNMGSGNQFGPPM----------SVSCVYCGEGHAFDVCPSNPASVNYVGY-----NKNFNPYSNTYNPGWRYHPNFAWNNQGIG
        ++ AQ+A++TN L+++ +G  +    P+          + SCVYCGE H FD CPSNPAS+ YVG      N   NP+SNTYNPGWR HPNF+W  Q + 
Subjt:  NLEAQIATLTNTLKSMNMGSGNQFGPPM----------SVSCVYCGEGHAFDVCPSNPASVNYVGY-----NKNFNPYSNTYNPGWRYHPNFAWNNQGIG

Query:  SSNQQMRPPV--PLGFQQQGY----PQQILQNPKAQPQ---ESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPS-DTEDPR
          NQQM P    P GF+ Q       QQ+    K   Q    S +S+E+L+KE+MAK DA++QSQ A++RNLE Q+G       N  QG   S +T D +
Subjt:  SSNQQMRPPV--PLGFQQQGY----PQQILQNPKAQPQ---ESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPS-DTEDPR

Query:  GKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQENVEDNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPL
         + +E   A+Q    K+  E       V+E   +Q      +                  E     P PPFPQ+ ++K+++ HF+KF+DILK ++INIPL
Subjt:  GKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQENVEDNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPL

Query:  VEALKQMQNYAKFMKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQ
        VEALKQM NY KF+KD+L  +R+  EF++V+L ++CS+IL++K+P K KDPG+FTIP SIG    GRALCDLG +INLMP+S++KKLGIGEARPT V+LQ
Subjt:  VEALKQMQNYAKFMKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQ

Query:  LADRSVVYPEGR
        LADRS+ YPEG+
Subjt:  LADRSVVYPEGR

XP_024021730.1 uncharacterized protein LOC112091702 [Morus notabilis]1.9e-14754Show/hide
Query:  MFQMLQHTGQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNEL-------------------------Q
        MFQMLQ  GQF G   EDPHLHL  FM V  +F I GV+ED LRL LFPYSLR +A  WLN++PP ++ +W++L                         Q
Subjt:  MFQMLQHTGQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNEL-------------------------Q

Query:  QDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASAAN-LE
         D ESLYE WERFKELL KC  HGIP+ IQ++TFYNGLN  TR  VDA+A G +L+K+YN+AYEILE++ N+NYQWPT RLN  R+ +G+ +  A     
Subjt:  QDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASAAN-LE

Query:  AQIATLTNTLKSMNMGSGNQFGPPMSVSCVYCGEGHAFDVCPSNPASVNYV-GYNKNFNPYSNTYNPGWRYHPNFAWNNQGIGSS--NQQMRPPVPLGFQ
        AQ+++LTN +K+++M +G         SCVYCGEGH ++ CPSNPAS  YV  +N++ NPYSN ++ G   HPN AW+NQG GSS  N   RP  P G+ 
Subjt:  AQIATLTNTLKSMNMGSGNQFGPPMSVSCVYCGEGHAFDVCPSNPASVNYV-GYNKNFNPYSNTYNPGWRYHPNFAWNNQGIGSS--NQQMRPPVPLGFQ

Query:  QQGYPQQILQNPKAQPQESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKA
         Q   QQ +Q       + +SSLE+LLKE +A+++A +QS  AT+RNLENQM QLA++LNNR QG+LPS+TEDPR +GKE CKAI LRSGKEL +P+++ 
Subjt:  QQGYPQQILQNPKAQPQESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKA

Query:  LKVK-ESTSIQENVEDNENQE---------NDQNNAAALPQ-SQLAEEVNS---RPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYA
          V+ E TSIQE  +  + QE         +D  NAA+ PQ S +A+  NS   RPPPPFP++FQ+++QD  F KFLD+LK L+INIPLVEAL++M NY 
Subjt:  LKVK-ESTSIQENVEDNENQE---------NDQNNAAALPQ-SQLAEEVNS---RPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYA

Query:  KFMKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIG
        KF+K+++TKK R GEFE+VALT++CS+ILQ+KLPPKLKDPG+FTIP SIG
Subjt:  KFMKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIG

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]6.3e-15449.3Show/hide
Query:  VVEDERDQIIQEYAVPV---LIPGIMPPEIEAQHFELKPVMFQMLQHTGQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWL
        ++ D+R + I+EYA P+   L PGI+ PEI+A  FELKPVMFQMLQ  GQF     EDPHLHL+SF+ +  SF I+GVSE+  RL LFP+SLR +A  WL
Subjt:  VVEDERDQIIQEYAVPV---LIPGIMPPEIEAQHFELKPVMFQMLQHTGQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWL

Query:  NTIPPGAITSWNEL-------------------------QQDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYN
        NT+ P ++T+WN+                          Q +DES  + WERFKELLRKC +HGIP+ IQM+TFYNGLN  ++M +DASA GA+LSK+YN
Subjt:  NTIPPGAITSWNEL-------------------------QQDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYN

Query:  DAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASAAN-LEAQIATLTNTLKSMNMGSGNQFGPPMS-----VSCVYCGEGHAFDVCPSNPASVNYVG---
        +A+EILE IA+NNYQW  +R   +R+VAGV +  A   L  Q+A++TN LK++++G+     P  +     VSCV+C EGHAF+ CPSNP SV Y+G   
Subjt:  DAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASAAN-LEAQIATLTNTLKSMNMGSGNQFGPPMS-----VSCVYCGEGHAFDVCPSNPASVNYVG---

Query:  YNKNFNPYSNTYNPGWRYHPNFAWNNQGIGSSNQQMRPPVPLGFQQQGYPQQILQNPKAQPQESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQL
        +N+N   +SN+YN  W+ HPN +W ++       Q R   P GF QQ      L++P+       SSLE+L++++MAK DA++QSQAA +RNLE Q+G L
Subjt:  YNKNFNPYSNTYNPGWRYHPNFAWNNQGIGSSNQQMRPPVPLGFQQQGYPQQILQNPKAQPQESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQL

Query:  ASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQ--ENVEDNENQE---NDQNNAAALPQSQLAEEV----NSRPPPPF
        A+ L  R QG+LPSDTE+PR  GKE CK+I LRSGK L+  + +     E TSIQ  E +     QE     + + A   QS   +      +S+PP PF
Subjt:  ASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQ--ENVEDNENQE---NDQNNAAALPQSQLAEEV----NSRPPPPF

Query:  PQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCD
        PQ+FQK++QD  F KFLD+LK L+INIPLVEAL+QM NY KF+KDILTKKRRLGEFE   LT+   ++L++K+PPKLKDPG+FTIP SIG          
Subjt:  PQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCD

Query:  LGVSINLMPMSVFKKLGIGEARPTIVSLQLADRSVVYPEGR
                     + LGIGEARPT V+LQLADRS+ +P+G+
Subjt:  LGVSINLMPMSVFKKLGIGEARPTIVSLQLADRSVVYPEGR

TrEMBL top hitse value%identityAlignment
A0A6A2WLX1 Reverse transcriptase8.3e-12843.47Show/hide
Query:  LIPGIMPPEIEAQHFELKPVMFQMLQHTGQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNEL------
        L PGI+ PEI+A HFE+KPVMF ML   GQFGG   ED   H+++F+ VC SF   GV ED L+L LFPYSLR +A  WL+ +P G++ SW +L      
Subjt:  LIPGIMPPEIEAQHFELKPVMFQMLQHTGQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNEL------

Query:  -------------------QQDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSR
                           Q DDES+YE W+R+K LL+KC+ HG     Q+  FYNG+N  TRM +DASA G LL K+  +A+ IL++IANN+YQ+P+SR
Subjt:  -------------------QQDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSR

Query:  LNPTRQVAGVQDASAA-NLEAQIATLTNTLKSMNMGSGNQFGPPMSVSCVYCGEGHAFDVCPSNPASVNYVG-YNK-NFNPYSNTYNPGWRYHPNFAWNN
        L   R+  G  +  A  ++ AQ++ +TN LK++   +  +     S++C+ C   H    CP+N  S+N+VG YN+ + NPYSNTYN GWR HPNF+W N
Subjt:  LNPTRQVAGVQDASAA-NLEAQIATLTNTLKSMNMGSGNQFGPPMSVSCVYCGEGHAFDVCPSNPASVNYVG-YNK-NFNPYSNTYNPGWRYHPNFAWNN

Query:  QGIGSSNQ---QMRPPVPLGFQ---------QQGYPQQILQNPKAQPQESTSSLETLLKEH----------MAKTDAIVQSQAATIRNLENQMGQLASAL
        QG  ++NQ   Q     P G+Q         ++      + + +A  QE  S+ +T+L++H          +    A++QS ++++R LE Q+GQ+A+AL
Subjt:  QGIGSSNQ---QMRPPVPLGFQ---------QQGYPQQILQNPKAQPQESTSSLETLLKEH----------MAKTDAIVQSQAATIRNLENQMGQLASAL

Query:  NNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQENVEDNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHF
          R QG LPSDTE  +G GKE C  + LRSG ++                +++ E++  +  D +  A + ++ +     +RPPPPFPQ+ +K   +  F
Subjt:  NNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQENVEDNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHF

Query:  DKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVF
         KF+DIL  L+INIPL+EA++QM  YAKFMKDI TKKR++    +   T+ CSS   SKL PK  DPG+F IP SIG  + G+ALCDLG S+NL+P S+F
Subjt:  DKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVF

Query:  KKLGIGEARPTIVSLQLADRSVVYPEGR
         KLGIG+ARPT V LQLAD+S V  EGR
Subjt:  KKLGIGEARPTIVSLQLADRSVVYPEGR

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.0e-15847.05Show/hide
Query:  MNEEHDPELQPLDPEISRTFHCRRREQGGRAEVEGMVEPLLRIDNNGGTRVVE--DERDQIIQEYAVPV---LIPGIMPPEIEAQHFELKPVMFQMLQHT
        M   ++  L P DP+I RTF   RRE     +V  + + +   +NN G   +    E ++ +++Y VP+   L   I  P I A +FE+KP   QM+Q +
Subjt:  MNEEHDPELQPLDPEISRTFHCRRREQGGRAEVEGMVEPLLRIDNNGGTRVVE--DERDQIIQEYAVPV---LIPGIMPPEIEAQHFELKPVMFQMLQHT

Query:  GQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNELQQ-------------------------DDESLYE
         QF G   +DP+ HL +F+ +C +F   GV++D +RL LFP+SLR KA  WLN++P G+IT+W +L Q                         D ESLYE
Subjt:  GQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNELQQ-------------------------DDESLYE

Query:  TWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASAANLEAQIATLTNT
         WERFKELLR+C +HGIP  +Q+QTFYNGL    +  +DA+AGGAL+SK   DAY +LE++A+NNYQWP+ R    + V   +  +   L  Q+A L+  
Subjt:  TWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASAANLEAQIATLTNT

Query:  LKSMNMGSGNQFGPPMSVSCVYCGEGHAFDVCPSNPASVNYVG-YNK-NFNPYSNTYNPGWRYHPNFAWNNQGIGSSNQQMRPPVPLGFQQQGYPQQILQ
        L ++ + +         V C  CG+ H++D CP N  SV +VG +N+   NPYSNTYNPGWR HPNF+W+N   G SN   +P +P GFQQQ  P     
Subjt:  LKSMNMGSGNQFGPPMSVSCVYCGEGHAFDVCPSNPASVNYVG-YNK-NFNPYSNTYNPGWRYHPNFAWNNQGIGSSNQQMRPPVPLGFQQQGYPQQILQ

Query:  NPKAQPQESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQ
            Q  E  S LE LL ++++KTDAI+QSQ A++RNLE Q+GQLA+++NNR QG+LPSDT+    KGKE C+AI LRSGKE+E    KA++ +     +
Subjt:  NPKAQPQESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQ

Query:  ENVEDN--ENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFEIVALT
        E + +N  E Q+ D + A     SQ+       PPPPFPQ+ QK++ +K F KFL++ K L+INIP  EAL+QM +Y KF+KDIL+KKR+LGEFE V LT
Subjt:  ENVEDN--ENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFEIVALT

Query:  KDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQLADRSVVYPEG
        ++CS+ILQ+KLPPKLKDPG+FTIP +IG  +  +AL DLG SINLMP S+F+KLG+GE +PT V+LQLADRS VYP G
Subjt:  KDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQLADRSVVYPEG

A0A6J1DY39 uncharacterized protein LOC1110256536.4e-12841.16Show/hide
Query:  VEDERDQIIQEYAVPVL------IPGIMPPEIEAQHFELKPVMFQMLQHTGQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALE
        V D+RD+ +++YA  +L      +    P + +   FE KP+M QML + GQFGG   EDP  HL+SF+ V  +F + G+S+D LRLTLFP+S+ G+A  
Subjt:  VEDERDQIIQEYAVPVL------IPGIMPPEIEAQHFELKPVMFQMLQHTGQFGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALE

Query:  WLNTIPPGAITSWNEL-------------------------QQDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKT
        WLN  P   IT+W+++                         Q+++E++   WERFK+L+  C N GIP  +Q++ F+ G +  T+M ++ +A G   SK+
Subjt:  WLNTIPPGAITSWNEL-------------------------QQDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKT

Query:  YNDAYEILEKIANNNYQW--PTSRLNPTR-QVAGVQDA-SAANLEAQIATLTNTLKSMNMGSGNQFGPPMSV-----------SCVYCGEGHAFDVCPSN
        +N+  EIL++++ +NYQW    SR    R   AGV    +  +++ QI T+T  LK+M   +        +            +C YCG+ H  + CPSN
Subjt:  YNDAYEILEKIANNNYQW--PTSRLNPTR-QVAGVQDA-SAANLEAQIATLTNTLKSMNMGSGNQFGPPMSV-----------SCVYCGEGHAFDVCPSN

Query:  PASVNYVGY--NKNFNPYSNTYNPGWRYHPNFAWNNQGIGSS---NQQMR--------------PPVPLGF-QQQGY---PQQILQNPKAQPQESTSSLE
        P+S+ YVG    + FNPYSNTYNPGW+ HPNF+W+ QG  ++   NQQ +              PP P  + QQ+ Y    QQ L N +   +E  +  +
Subjt:  PASVNYVGY--NKNFNPYSNTYNPGWRYHPNFAWNNQGIGSS---NQQMR--------------PPVPLGF-QQQGY---PQQILQNPKAQPQESTSSLE

Query:  TLLKEHMAKTDAIVQSQA----------ATIRNLENQMGQLASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQENVE
          +KE M +TD  ++              T+R LE Q+GQL + +  R QG+LPS TE+PR  GKE C +I  RSG + E P     ++ + +S   + E
Subjt:  TLLKEHMAKTDAIVQSQA----------ATIRNLENQMGQLASALNNRTQGTLPSDTEDPRGKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQENVE

Query:  DNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFEIVALTKDCSSI
         +     D+    A+      +  NSRPPPPFPQ+  +K QD +F KFLDILK L+INIP VEAL+QM  YAKF+KDI+T+K++LGE+E VALT+  S++
Subjt:  DNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFEIVALTKDCSSI

Query:  LQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQLADRSVVYPEGR
         +SK+PPKLKDPG+FTIP  IG    GRALCDLG SINLMP+S+FKK  IG+A PT V+LQLADRS+  PEG+
Subjt:  LQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQLADRSVVYPEGR

A0A6J1EQ90 uncharacterized protein LOC1114364119.1e-15146.21Show/hide
Query:  LDPEISRTFHCRRR-------------EQGGRAEVE----GMVEPLLRIDNNGGTRVVEDERDQIIQEYAVPV---LIPGIMPPEIEAQHFELKPVMFQM
        LDPEI RTF  R +             E G +   E     M+    RI  N     + D+R++ I+ YA P    L P I+ PEI+   FELKPVMFQM
Subjt:  LDPEISRTFHCRRR-------------EQGGRAEVE----GMVEPLLRIDNNGGTRVVEDERDQIIQEYAVPV---LIPGIMPPEIEAQHFELKPVMFQM

Query:  LQHTGQFGGTTMEDPHLHLQSFMGV-------CKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNEL-----------------------
        LQ  GQF G  +EDPHLHL+SF+GV         SF  +GV +D +RL+LFPY LR  A  WLNT+ PG I SWN L                       
Subjt:  LQHTGQFGGTTMEDPHLHLQSFMGV-------CKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNEL-----------------------

Query:  --QQDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASA-A
          Q +DE+L E  ERFKE+LRKC +HG+P+ IQM+TFYNGLN  T+  VDASA GA+LSKTYN+AYEILE+IA+NN QW   R NP R+  GV +  A +
Subjt:  --QQDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASA-A

Query:  NLEAQIATLTNTLKSMNMGSGNQFGPPM----------SVSCVYCGEGHAFDVCPSNPASVNYVGY-----NKNFNPYSNTYNPGWRYHPNFAWNNQGIG
        ++ AQ+A++TN L+++ +G  +    P+          + SCVYCGE H FD CPSNPAS+ YVG      N   NP+SNTYNPGWR HPNF+W  Q + 
Subjt:  NLEAQIATLTNTLKSMNMGSGNQFGPPM----------SVSCVYCGEGHAFDVCPSNPASVNYVGY-----NKNFNPYSNTYNPGWRYHPNFAWNNQGIG

Query:  SSNQQMRPPV--PLGFQQQGY----PQQILQNPKAQPQ---ESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPS-DTEDPR
          NQQM P    P GF+ Q       QQ+    K   Q    S +S+E+L+KE+MAK DA++QSQ A++RNLE Q+G       N  QG   S +T D +
Subjt:  SSNQQMRPPV--PLGFQQQGY----PQQILQNPKAQPQ---ESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPS-DTEDPR

Query:  GKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQENVEDNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPL
         + +E   A+Q    K+  E       V+E   +Q      +                  E     P PPFPQ+ ++K+++ HF+KF+DILK ++INIPL
Subjt:  GKGKETCKAIQLRSGKELEEPKSKALKVKESTSIQENVEDNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPL

Query:  VEALKQMQNYAKFMKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQ
        VEALKQM NY KF+KD+L  +R+  EF++V+L ++CS+IL++K+P K KDPG+FTIP SIG    GRALCDLG +INLMP+S++KKLGIGEARPT V+LQ
Subjt:  VEALKQMQNYAKFMKDILTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQ

Query:  LADRSVVYPEGR
        LADRS+ YPEG+
Subjt:  LADRSVVYPEGR

A0A6P8DD93 uncharacterized protein LOC1162064535.8e-12140.96Show/hide
Query:  MNEEHDPELQPLDPEISRTFHCRRREQGGRAEVE--GMVEPLLRIDNNGGTRVVEDERDQIIQEYAVPVLI-PGIMPPEIEAQHFELKPVMFQMLQHTGQ
        M      EL PLDPEI RT H  RRE   R E++   M +  +     G  R + D        YAVP ++   I  P I A +FELKP + QM+Q + Q
Subjt:  MNEEHDPELQPLDPEISRTFHCRRREQGGRAEVE--GMVEPLLRIDNNGGTRVVEDERDQIIQEYAVPVLI-PGIMPPEIEAQHFELKPVMFQMLQHTGQ

Query:  FGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNELQQ-------------------------DDESLYETW
        FGG   E P  H+  F+  C +  +  V++D +RL LFP+SLR KA  W N++P  +IT+W +L                           + ESLYE W
Subjt:  FGGTTMEDPHLHLQSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNELQQ-------------------------DDESLYETW

Query:  ERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDA-SAANLEAQIATLTNTL
        ERFKE +RKC +HG+P ++ ++ FY  L+   R  VDA+AGGAL+ K Y++A  ++E++A++ + W   R     +VA V D  + ANL  QI+ LT  +
Subjt:  ERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKTYNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDA-SAANLEAQIATLTNTL

Query:  KSMNMGSGNQFGPPMSVSCVYCGEGHAFDVCPS-NPAS------VNYVG--YNKNFNPYSNTYNPGWRYHPNFAWNNQGIGSSNQQMRPPVPLGFQQQGY
          +   S + F       C  C   H+   C S NP++      VN+V      N  PYSNTYNPGWR HPNF+W N+     N  ++PP   GFQ+QG 
Subjt:  KSMNMGSGNQFGPPMSVSCVYCGEGHAFDVCPS-NPAS------VNYVG--YNKNFNPYSNTYNPGWRYHPNFAWNNQGIGSSNQQMRPPVPLGFQQQGY

Query:  PQQILQNPKAQPQESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPSDTED-PRGKGKETCKAIQLRSGKELEEPKSKALKV
         Q       A PQ+S S +E L+  +M KTD ++Q+Q ATIRNLE Q+ Q++  L+NR  G+LPS+TE+ P+G       AI LRSGKELE    KA   
Subjt:  PQQILQNPKAQPQESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPSDTED-PRGKGKETCKAIQLRSGKELEEPKSKALKV

Query:  KESTSIQENVEDNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFE
        +ES       +D   Q+ ++    +L            PP PFP++ ++++ D  F KFLD+ K L INIP  EAL+QM +YA+FMKD+LTKKR+    E
Subjt:  KESTSIQENVEDNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKFMKDILTKKRRLGEFE

Query:  IVALTKDCSSILQS---KLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQLADRSVVYPEG
         V LT +CS ILQ     LP K +D G+FT+P +IG  +    L D G SINLMP+S+F+KLG+GE + T V+LQLADRS+ YP+G
Subjt:  IVALTKDCSSILQS---KLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQLADRSVVYPEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGAGGAGCACGATCCTGAATTGCAACCTCTTGATCCTGAGATCTCACGTACATTTCACTGTCGAAGAAGGGAACAGGGAGGACGAGCAGAAGTTGAAGGTATGGT
TGAACCACTGCTGAGAATTGATAATAATGGTGGTACTAGAGTAGTGGAAGATGAGAGAGATCAGATTATTCAGGAATATGCAGTGCCAGTTCTGATTCCAGGGATTATGC
CACCAGAAATAGAGGCACAACATTTTGAATTGAAGCCGGTCATGTTTCAGATGCTCCAGCACACAGGCCAGTTTGGGGGTACGACGATGGAGGATCCTCACTTACACCTC
CAGTCATTCATGGGAGTCTGCAAATCTTTTGTAATCAGAGGAGTCAGTGAGGATACTCTCAGGTTGACGCTCTTTCCTTACTCACTAAGAGGAAAAGCGTTAGAATGGCT
GAATACCATACCACCTGGAGCTATCACGAGTTGGAATGAGCTGCAACAAGATGATGAGTCTCTGTATGAAACTTGGGAGAGGTTCAAAGAGTTGCTTCGTAAATGTACGA
ACCATGGTATTCCCTACGACATCCAGATGCAGACTTTCTATAATGGGTTGAATACCCATACGAGGATGGCAGTGGATGCCTCAGCTGGAGGAGCATTGTTGTCGAAGACA
TACAATGATGCTTATGAGATTCTGGAGAAGATTGCAAATAATAACTATCAGTGGCCTACCTCACGCCTTAATCCGACTCGCCAGGTAGCAGGGGTGCAAGATGCAAGTGC
AGCGAATCTGGAAGCACAAATTGCCACCCTTACCAATACACTGAAATCAATGAACATGGGTTCAGGGAATCAGTTTGGGCCACCAATGTCAGTATCGTGTGTTTATTGTG
GGGAGGGCCACGCTTTTGATGTTTGCCCATCCAATCCGGCTTCAGTAAACTACGTGGGGTACAATAAGAATTTCAACCCATATTCGAATACATACAACCCGGGTTGGAGG
TATCATCCAAACTTCGCATGGAATAATCAAGGAATTGGGTCTTCAAATCAACAAATGAGACCTCCAGTACCCCTAGGATTTCAACAACAAGGATATCCTCAGCAAATTTT
GCAAAATCCTAAGGCACAACCTCAAGAGTCTACATCCTCACTTGAGACTCTGTTGAAAGAGCACATGGCTAAAACTGATGCCATAGTGCAGAGCCAAGCTGCAACAATTA
GAAACCTAGAGAATCAAATGGGTCAACTAGCCTCAGCCTTAAACAATAGAACCCAAGGTACCTTGCCTAGTGATACTGAGGATCCAAGAGGCAAAGGTAAGGAAACATGT
AAAGCAATCCAACTTCGAAGTGGGAAGGAGTTGGAAGAGCCGAAAAGCAAGGCTCTTAAGGTTAAAGAGTCCACTTCAATCCAAGAGAATGTTGAAGACAATGAAAATCA
AGAGAATGATCAAAACAACGCTGCAGCATTGCCACAATCACAGCTAGCTGAGGAAGTGAATTCGAGACCACCACCACCTTTTCCTCAACAATTTCAAAAGAAGGAGCAAG
ACAAACACTTTGATAAATTTCTTGATATTTTAAAGGGTCTGTACATTAATATTCCACTTGTTGAAGCCTTAAAGCAAATGCAAAACTATGCCAAGTTCATGAAAGATATT
TTGACCAAAAAACGAAGACTTGGGGAATTTGAAATAGTGGCTCTTACAAAAGATTGTAGCTCAATCCTTCAAAGTAAGCTTCCTCCTAAATTAAAGGATCCTGGTACTTT
CACTATCCCTTTCTCTATTGGTGAAACCTATAATGGAAGGGCTTTATGTGATTTGGGAGTAAGCATAAACTTGATGCCTATGTCTGTGTTTAAAAAGCTTGGTATTGGAG
AGGCAAGACCCACCATAGTCTCCTTACAGTTGGCAGATCGTTCAGTGGTTTATCCAGAGGGAAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGAGGAGCACGATCCTGAATTGCAACCTCTTGATCCTGAGATCTCACGTACATTTCACTGTCGAAGAAGGGAACAGGGAGGACGAGCAGAAGTTGAAGGTATGGT
TGAACCACTGCTGAGAATTGATAATAATGGTGGTACTAGAGTAGTGGAAGATGAGAGAGATCAGATTATTCAGGAATATGCAGTGCCAGTTCTGATTCCAGGGATTATGC
CACCAGAAATAGAGGCACAACATTTTGAATTGAAGCCGGTCATGTTTCAGATGCTCCAGCACACAGGCCAGTTTGGGGGTACGACGATGGAGGATCCTCACTTACACCTC
CAGTCATTCATGGGAGTCTGCAAATCTTTTGTAATCAGAGGAGTCAGTGAGGATACTCTCAGGTTGACGCTCTTTCCTTACTCACTAAGAGGAAAAGCGTTAGAATGGCT
GAATACCATACCACCTGGAGCTATCACGAGTTGGAATGAGCTGCAACAAGATGATGAGTCTCTGTATGAAACTTGGGAGAGGTTCAAAGAGTTGCTTCGTAAATGTACGA
ACCATGGTATTCCCTACGACATCCAGATGCAGACTTTCTATAATGGGTTGAATACCCATACGAGGATGGCAGTGGATGCCTCAGCTGGAGGAGCATTGTTGTCGAAGACA
TACAATGATGCTTATGAGATTCTGGAGAAGATTGCAAATAATAACTATCAGTGGCCTACCTCACGCCTTAATCCGACTCGCCAGGTAGCAGGGGTGCAAGATGCAAGTGC
AGCGAATCTGGAAGCACAAATTGCCACCCTTACCAATACACTGAAATCAATGAACATGGGTTCAGGGAATCAGTTTGGGCCACCAATGTCAGTATCGTGTGTTTATTGTG
GGGAGGGCCACGCTTTTGATGTTTGCCCATCCAATCCGGCTTCAGTAAACTACGTGGGGTACAATAAGAATTTCAACCCATATTCGAATACATACAACCCGGGTTGGAGG
TATCATCCAAACTTCGCATGGAATAATCAAGGAATTGGGTCTTCAAATCAACAAATGAGACCTCCAGTACCCCTAGGATTTCAACAACAAGGATATCCTCAGCAAATTTT
GCAAAATCCTAAGGCACAACCTCAAGAGTCTACATCCTCACTTGAGACTCTGTTGAAAGAGCACATGGCTAAAACTGATGCCATAGTGCAGAGCCAAGCTGCAACAATTA
GAAACCTAGAGAATCAAATGGGTCAACTAGCCTCAGCCTTAAACAATAGAACCCAAGGTACCTTGCCTAGTGATACTGAGGATCCAAGAGGCAAAGGTAAGGAAACATGT
AAAGCAATCCAACTTCGAAGTGGGAAGGAGTTGGAAGAGCCGAAAAGCAAGGCTCTTAAGGTTAAAGAGTCCACTTCAATCCAAGAGAATGTTGAAGACAATGAAAATCA
AGAGAATGATCAAAACAACGCTGCAGCATTGCCACAATCACAGCTAGCTGAGGAAGTGAATTCGAGACCACCACCACCTTTTCCTCAACAATTTCAAAAGAAGGAGCAAG
ACAAACACTTTGATAAATTTCTTGATATTTTAAAGGGTCTGTACATTAATATTCCACTTGTTGAAGCCTTAAAGCAAATGCAAAACTATGCCAAGTTCATGAAAGATATT
TTGACCAAAAAACGAAGACTTGGGGAATTTGAAATAGTGGCTCTTACAAAAGATTGTAGCTCAATCCTTCAAAGTAAGCTTCCTCCTAAATTAAAGGATCCTGGTACTTT
CACTATCCCTTTCTCTATTGGTGAAACCTATAATGGAAGGGCTTTATGTGATTTGGGAGTAAGCATAAACTTGATGCCTATGTCTGTGTTTAAAAAGCTTGGTATTGGAG
AGGCAAGACCCACCATAGTCTCCTTACAGTTGGCAGATCGTTCAGTGGTTTATCCAGAGGGAAGATAG
Protein sequenceShow/hide protein sequence
MNEEHDPELQPLDPEISRTFHCRRREQGGRAEVEGMVEPLLRIDNNGGTRVVEDERDQIIQEYAVPVLIPGIMPPEIEAQHFELKPVMFQMLQHTGQFGGTTMEDPHLHL
QSFMGVCKSFVIRGVSEDTLRLTLFPYSLRGKALEWLNTIPPGAITSWNELQQDDESLYETWERFKELLRKCTNHGIPYDIQMQTFYNGLNTHTRMAVDASAGGALLSKT
YNDAYEILEKIANNNYQWPTSRLNPTRQVAGVQDASAANLEAQIATLTNTLKSMNMGSGNQFGPPMSVSCVYCGEGHAFDVCPSNPASVNYVGYNKNFNPYSNTYNPGWR
YHPNFAWNNQGIGSSNQQMRPPVPLGFQQQGYPQQILQNPKAQPQESTSSLETLLKEHMAKTDAIVQSQAATIRNLENQMGQLASALNNRTQGTLPSDTEDPRGKGKETC
KAIQLRSGKELEEPKSKALKVKESTSIQENVEDNENQENDQNNAAALPQSQLAEEVNSRPPPPFPQQFQKKEQDKHFDKFLDILKGLYINIPLVEALKQMQNYAKFMKDI
LTKKRRLGEFEIVALTKDCSSILQSKLPPKLKDPGTFTIPFSIGETYNGRALCDLGVSINLMPMSVFKKLGIGEARPTIVSLQLADRSVVYPEGR