; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019778 (gene) of Snake gourd v1 genome

Gene IDTan0019778
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG06:59431326..59435666
RNA-Seq ExpressionTan0019778
SyntenyTan0019778
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025965.1 hypothetical protein SDJN02_12463, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0076.05Show/hide
Query:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA
        MASKRSSIVHQPQ LQ GFLHLPRKKPK LPQ DELASK GDK+S+Y AKDLRIKRVFSPNL+NRSSV S GQ SDK+G ITANG CPN DS VGKSS  
Subjt:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA

Query:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA
         E+RNENFCNS+GYV+L D+  RCNG+  E VHSTPPDAEVL GG VAASSNGCP+SS+GSVLGD CAK DCRI SVTRTGSVLKPCSKRKLFKAPGSIA
Subjt:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA

Query:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS
        YKRLLPFLLD DNY LQGD CSKRE NLEKK NIESNLCN A E SFIDSDT V NAVLA G++  T+KLN  PPDNGDAK  QNGS S+NDPTLVKE+S
Subjt:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS

Query:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC
        GLK+D  V VSSLDK+L ENGGP++HQIEDR SNEQSKT  +ERLDG + FISSNLSSEVDN KS+VSEKLC+N SED K ENH  EE+K+SSLDSNI C
Subjt:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC

Query:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE
        +LVKEERK+EKV CTRGTDQNLG STVGENDCN +TESDKKYG CVRNKVVRNPLVQL SKYSQVSVS+R                 AS NI+    EKE
Subjt:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE

Query:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL
        LPT+NLQ+PSSNSHNS D+SEG+ SCN+PCDG LDT SMPG N MNE VCETE+VL+  G++D+LLSSPKLQMH LHSEQEMLD   LK+DPQL+     
Subjt:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL

Query:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL
         DQAV S     SY+PL G+GSRM+S+Q PITSEGCT+LTDNVSD A+ SERNSLEPN  CVE G +P  RINV +GI K+NPRGCRGICNCLNCSSFRL
Subjt:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL

Query:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR
        HAERAFEFSRNQLQDAE VASDLMKELS +RDVLEKCSDGA GDAGY+SNKVKEACRKASEAELVAKDRLLQMN +LDI SRI C QRPNVRFS +++KR
Subjt:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR

Query:  EIEDGK
        +IE GK
Subjt:  EIEDGK

XP_022964022.1 uncharacterized protein LOC111464172 isoform X1 [Cucurbita moschata]0.0e+0076.8Show/hide
Query:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA
        MASKRSSIVHQPQ LQ GFLHLPRKKPK LP  DELASK GD + +Y AKDLRIKRVFSPN +NRSS+PS GQ SD+E  IT NGTCPN DS VGKSS  
Subjt:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA

Query:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA
         E+RNENFCNS+GYV+L D+  RC+G+  E VHSTPPDAEV+ GG VAASSNGCP+SSNGSVLGD CAKADCRI +VTRTGSVLKPCSKRKLFKAPGSIA
Subjt:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA

Query:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS
        YKRLLPFL D DNY L+GD CSKRE  LEKK NIESNLCN A E SFIDSDT V NAVLA G++C T+KLN  PP NGDAK  QNGS S+NDPTLVKE+S
Subjt:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS

Query:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC
        GLK+D  V VSSLDK+L ENGGP++HQIEDR SNEQSKT  +ERLDG + FISSNLSSEVDN KS+VSEKLCNN SEDIK ENH   E+++SSLDSNIAC
Subjt:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC

Query:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE
        +LVKEERK+EKV CTRGTDQNLG STVGEND N +TESDKKYG CVRNKVVRNPLVQL SKYSQVSVS+RRMLPFL DLFKDNPE CAS NI+    EKE
Subjt:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE

Query:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL
        LPT+NLQ+PSSNSHNS D+SEG+ SCN+PCDG LDT SMPG N MNE VCETE+VL+  G++D+LLSSPKLQMH LHSEQEMLD C LK+DPQL+     
Subjt:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL

Query:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL
         DQAV S     SY+PL G+GSRM+S+Q PITSE CT+LTDNVSD A+ SERNSLEPN  CVE G +P  RINV +GI K+NPRGCRGICNCLNCSSFRL
Subjt:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL

Query:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR
        HAERAFEFSRNQLQDAE VASDLMKELS +RDVLEKCSDGA GDAGY+SNKVKEACRKASEAELVAKDRLLQMN +LDI SRI C QRPNVRFS +++KR
Subjt:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR

Query:  EIEDGK
        +IE GK
Subjt:  EIEDGK

XP_023000090.1 uncharacterized protein LOC111494390 isoform X1 [Cucurbita maxima]0.0e+0077.79Show/hide
Query:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA
        MASKRSSIVHQPQ+LQ GFLHLPRKKPK+LP  DELAS  GDK+S+Y AKDLRIKRVFSPNL+NRSSVPS GQ SD+E  ITANGTCPN DS VGK S  
Subjt:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA

Query:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA
         E+RNENFCNS+GYV+L D+  RCNG++ ELVHSTPPDAEVL GG VAASSNGCP+SS+GSVLGD CAKADCRI SVTRTGSVLKPCSKRKLFKAPGSIA
Subjt:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA

Query:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS
        YKRLLPFLLD DNY LQGD CSKRENNLEKK NIESN CN A E SF+DSDT VK A+LA G++C T+KLNL PPDNGD K   NGS S+NDPTLVKE+S
Subjt:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS

Query:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC
        GLK+D  V VSSLDKRL ENG            ++QSKT G+ERLDG + F SSNLSSEVDN KS+VSEKLCNN S DIK ENH  EE+K+SSLDS+IAC
Subjt:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC

Query:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE
        +LVKEERK+EKV CTRGTDQNLG STVGENDCNI+TESDKKYG CVRNKV+RNPLVQL SKYSQV VSYRRMLPFLEDLFKDNPE CAS NI+    EKE
Subjt:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE

Query:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL
        LPT+NLQ+PSSNSHNS+DKSE + SCN+PC+G+LDT SMPGLN MNE VCETE+VL+  G+ D+LLSSPKLQMH  HSEQEMLDKC LK+DPQL+     
Subjt:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL

Query:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL
         DQAV SLY AASY+PL GEGSRM+S+Q PITSEGCT+LTDNVSD A+ SERNSLEPN  CVE  VLPESRINV +GI K+NPRGCRGICNCLNCSSFRL
Subjt:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL

Query:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR
        HAERAFEFSRNQLQDAE VASDLMKELS +RDVLEKCS+GA GDAGY+SNKVKEACRKASEAELVAKDRL QMN +LDIHSRI C QRPNVRFS +++KR
Subjt:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR

Query:  EIEDGK
        EIE GK
Subjt:  EIEDGK

XP_023000091.1 uncharacterized protein LOC111494390 isoform X2 [Cucurbita maxima]0.0e+0074.94Show/hide
Query:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA
        MASKRSSIVHQPQ+LQ GFLHLPRKKPK+LP  DELAS  GDK+S+Y AKDLRIKRVFSPNL+NRSSVPS GQ SD+E  ITANGTCPN DS VGK S  
Subjt:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA

Query:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA
         E+RNENFCNS+GYV+L D+  RCNG++ ELVHSTPPDAEVL GG VAASSNGCP+SS+GSVLGD CAKADCRI SVTRTGSVLKPCSKRKLFKAPGSIA
Subjt:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA

Query:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS
        YKRLLPFLLD DNY LQGD CSKRENNLEKK NIESN CN A E SF+DSDT VK A+LA G++C T+KLNL PPDNGD K   NGS S+NDPTLVKE+S
Subjt:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS

Query:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC
        GLK+D  V VSSLDKRL ENG            ++QSKT G+ERLDG + F SSNLSSEVDN KS+VSEKLCNN S DIK ENH  EE+K+SSLDS+IAC
Subjt:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC

Query:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE
        +LVKEERK+EKV CTRGTDQNLG STVGENDCNI+TESDKKYG CVRNKV+RNPLVQL SKYSQV VSYRRMLPFLEDLFKDNPE CAS NI+    EKE
Subjt:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE

Query:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL
        LPT+NLQ+PSSNSHNS+DKSE + SCN+PC+G+LDT SMPGLN MN                                E EMLDKC LK+DPQL+     
Subjt:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL

Query:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL
         DQAV SLY AASY+PL GEGSRM+S+Q PITSEGCT+LTDNVSD A+ SERNSLEPN  CVE  VLPESRINV +GI K+NPRGCRGICNCLNCSSFRL
Subjt:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL

Query:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR
        HAERAFEFSRNQLQDAE VASDLMKELS +RDVLEKCS+GA GDAGY+SNKVKEACRKASEAELVAKDRL QMN +LDIHSRI C QRPNVRFS +++KR
Subjt:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR

Query:  EIEDGK
        EIE GK
Subjt:  EIEDGK

XP_023514501.1 uncharacterized protein LOC111778760 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0078.41Show/hide
Query:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA
        MASKRSSIVHQPQ LQ GFLHLPRKKPK LPQ DELASK GD +S+Y AKDLRIKRVFSPNL+NRSS PS GQ SDK+G ITANGTCPN DS VGK S  
Subjt:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA

Query:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA
         E+RNENF NSNG V+L D+  RC+G+S ELVHSTPPDAEVL GG VAASSNGCP+SS+GSVLGD CAKADCRI SVTRTGSVLKPCSKRKLFKAPGSIA
Subjt:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA

Query:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS
        YKRLLPFLLD DNY LQGD CSKRE NL KK NIESNLCN A E SFIDSDT V NAVLA G++C T+KLN  PPDNGDAK  QNGS S+NDPTLVKE+S
Subjt:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS

Query:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC
        GLK+D  V VSSLDK+L ENGGP+++QIEDR SNEQSKT  +ERLDG + FISSNLSSEVDN KS+VSEKLCNN SEDIK ENH  EE+K+SSLDS+IAC
Subjt:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC

Query:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE
        +LVKEERK+EKV CTRGTDQ LG STVGENDCNI+TESDKKYG CVRNKVVRNPLVQL SKY+QVSVSYRRMLPFLEDLFKDNPE CAS NI+    EKE
Subjt:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE

Query:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL
        LPT+NLQ+PSSNSHNS+D+SEG+ SCN+PCDG LDT SMPG N MNE VCETE+VL+  G++D+LLSSPKLQMH LHSEQEMLD C LK+DPQLY     
Subjt:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL

Query:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL
         DQAV S     SY+ L G+GSRM+S+Q PITSEGCT+LTDNVSD A+ SERNSLEPN  CVEA VLP  RINV +GI K+NPRGCRGICNCLNCSSFRL
Subjt:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL

Query:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR
        HAERAFEFSRNQLQDA+ VASDLMKELS +RDVLEKCSDGA GDAGYHSNKVKEACRKASEAELVAK+RLLQMN +LDI SRI C QRPNVRFS +++KR
Subjt:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR

Query:  EIEDGK
        +IE GK
Subjt:  EIEDGK

TrEMBL top hitse value%identityAlignment
A0A6J1HGP9 uncharacterized protein LOC111464172 isoform X10.0e+0076.8Show/hide
Query:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA
        MASKRSSIVHQPQ LQ GFLHLPRKKPK LP  DELASK GD + +Y AKDLRIKRVFSPN +NRSS+PS GQ SD+E  IT NGTCPN DS VGKSS  
Subjt:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA

Query:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA
         E+RNENFCNS+GYV+L D+  RC+G+  E VHSTPPDAEV+ GG VAASSNGCP+SSNGSVLGD CAKADCRI +VTRTGSVLKPCSKRKLFKAPGSIA
Subjt:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA

Query:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS
        YKRLLPFL D DNY L+GD CSKRE  LEKK NIESNLCN A E SFIDSDT V NAVLA G++C T+KLN  PP NGDAK  QNGS S+NDPTLVKE+S
Subjt:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS

Query:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC
        GLK+D  V VSSLDK+L ENGGP++HQIEDR SNEQSKT  +ERLDG + FISSNLSSEVDN KS+VSEKLCNN SEDIK ENH   E+++SSLDSNIAC
Subjt:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC

Query:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE
        +LVKEERK+EKV CTRGTDQNLG STVGEND N +TESDKKYG CVRNKVVRNPLVQL SKYSQVSVS+RRMLPFL DLFKDNPE CAS NI+    EKE
Subjt:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE

Query:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL
        LPT+NLQ+PSSNSHNS D+SEG+ SCN+PCDG LDT SMPG N MNE VCETE+VL+  G++D+LLSSPKLQMH LHSEQEMLD C LK+DPQL+     
Subjt:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL

Query:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL
         DQAV S     SY+PL G+GSRM+S+Q PITSE CT+LTDNVSD A+ SERNSLEPN  CVE G +P  RINV +GI K+NPRGCRGICNCLNCSSFRL
Subjt:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL

Query:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR
        HAERAFEFSRNQLQDAE VASDLMKELS +RDVLEKCSDGA GDAGY+SNKVKEACRKASEAELVAKDRLLQMN +LDI SRI C QRPNVRFS +++KR
Subjt:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR

Query:  EIEDGK
        +IE GK
Subjt:  EIEDGK

A0A6J1HLX0 uncharacterized protein LOC111464172 isoform X20.0e+0073.82Show/hide
Query:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA
        MASKRSSIVHQPQ LQ GFLHLPRKKPK LP  DELASK GD + +Y AKDLRIKRVFSPN +NRSS+PS GQ SD+E  IT NGTCPN DS VGKSS  
Subjt:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA

Query:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA
         E+RNENFCNS+GYV+L D+  RC+G+  E VHSTPPDAEV+ GG VAASSNGCP+SSNGSVLGD CAKADCRI +VTRTGSVLKPCSKRKLFKAPGSIA
Subjt:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA

Query:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS
        YKRLLPFL D DNY L+GD CSKRE  LEKK NIESNLCN A E SFIDSDT V NAVLA G++C T+KLN  PP NGDAK  QNGS S+NDPTLVKE+S
Subjt:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS

Query:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC
        GLK+D  V VSSLDK+L ENGGP++HQIEDR SNEQSKT  +ERLDG + FISSNLSSEVDN KS+VSEKLCNN SEDIK ENH   E+++SSLDSNIAC
Subjt:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC

Query:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE
        +LVKEERK+EKV CTRGTDQNLG STVGEND N +TESDKKYG CVRNKVVRNPLVQL SKYSQVSVS+RRMLPFL DLFKDNPE CAS NI+    EKE
Subjt:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE

Query:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL
        LPT+NLQ+PSSNSHNS D+SEG+ SCN+PCDG LDT SMPG N MN                                E EMLD C LK+DPQL+     
Subjt:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL

Query:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL
         DQAV S     SY+PL G+GSRM+S+Q PITSE CT+LTDNVSD A+ SERNSLEPN  CVE G +P  RINV +GI K+NPRGCRGICNCLNCSSFRL
Subjt:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL

Query:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR
        HAERAFEFSRNQLQDAE VASDLMKELS +RDVLEKCSDGA GDAGY+SNKVKEACRKASEAELVAKDRLLQMN +LDI SRI C QRPNVRFS +++KR
Subjt:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR

Query:  EIEDGK
        +IE GK
Subjt:  EIEDGK

A0A6J1KEX1 uncharacterized protein LOC111494390 isoform X20.0e+0074.94Show/hide
Query:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA
        MASKRSSIVHQPQ+LQ GFLHLPRKKPK+LP  DELAS  GDK+S+Y AKDLRIKRVFSPNL+NRSSVPS GQ SD+E  ITANGTCPN DS VGK S  
Subjt:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA

Query:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA
         E+RNENFCNS+GYV+L D+  RCNG++ ELVHSTPPDAEVL GG VAASSNGCP+SS+GSVLGD CAKADCRI SVTRTGSVLKPCSKRKLFKAPGSIA
Subjt:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA

Query:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS
        YKRLLPFLLD DNY LQGD CSKRENNLEKK NIESN CN A E SF+DSDT VK A+LA G++C T+KLNL PPDNGD K   NGS S+NDPTLVKE+S
Subjt:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS

Query:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC
        GLK+D  V VSSLDKRL ENG            ++QSKT G+ERLDG + F SSNLSSEVDN KS+VSEKLCNN S DIK ENH  EE+K+SSLDS+IAC
Subjt:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC

Query:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE
        +LVKEERK+EKV CTRGTDQNLG STVGENDCNI+TESDKKYG CVRNKV+RNPLVQL SKYSQV VSYRRMLPFLEDLFKDNPE CAS NI+    EKE
Subjt:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE

Query:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL
        LPT+NLQ+PSSNSHNS+DKSE + SCN+PC+G+LDT SMPGLN MN                                E EMLDKC LK+DPQL+     
Subjt:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL

Query:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL
         DQAV SLY AASY+PL GEGSRM+S+Q PITSEGCT+LTDNVSD A+ SERNSLEPN  CVE  VLPESRINV +GI K+NPRGCRGICNCLNCSSFRL
Subjt:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL

Query:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR
        HAERAFEFSRNQLQDAE VASDLMKELS +RDVLEKCS+GA GDAGY+SNKVKEACRKASEAELVAKDRL QMN +LDIHSRI C QRPNVRFS +++KR
Subjt:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR

Query:  EIEDGK
        EIE GK
Subjt:  EIEDGK

A0A6J1KHD4 uncharacterized protein LOC111494390 isoform X10.0e+0077.79Show/hide
Query:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA
        MASKRSSIVHQPQ+LQ GFLHLPRKKPK+LP  DELAS  GDK+S+Y AKDLRIKRVFSPNL+NRSSVPS GQ SD+E  ITANGTCPN DS VGK S  
Subjt:  MASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEA

Query:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA
         E+RNENFCNS+GYV+L D+  RCNG++ ELVHSTPPDAEVL GG VAASSNGCP+SS+GSVLGD CAKADCRI SVTRTGSVLKPCSKRKLFKAPGSIA
Subjt:  EELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIA

Query:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS
        YKRLLPFLLD DNY LQGD CSKRENNLEKK NIESN CN A E SF+DSDT VK A+LA G++C T+KLNL PPDNGD K   NGS S+NDPTLVKE+S
Subjt:  YKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDS

Query:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC
        GLK+D  V VSSLDKRL ENG            ++QSKT G+ERLDG + F SSNLSSEVDN KS+VSEKLCNN S DIK ENH  EE+K+SSLDS+IAC
Subjt:  GLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIAC

Query:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE
        +LVKEERK+EKV CTRGTDQNLG STVGENDCNI+TESDKKYG CVRNKV+RNPLVQL SKYSQV VSYRRMLPFLEDLFKDNPE CAS NI+    EKE
Subjt:  SLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKE

Query:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL
        LPT+NLQ+PSSNSHNS+DKSE + SCN+PC+G+LDT SMPGLN MNE VCETE+VL+  G+ D+LLSSPKLQMH  HSEQEMLDKC LK+DPQL+     
Subjt:  LPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILL

Query:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL
         DQAV SLY AASY+PL GEGSRM+S+Q PITSEGCT+LTDNVSD A+ SERNSLEPN  CVE  VLPESRINV +GI K+NPRGCRGICNCLNCSSFRL
Subjt:  NDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRL

Query:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR
        HAERAFEFSRNQLQDAE VASDLMKELS +RDVLEKCS+GA GDAGY+SNKVKEACRKASEAELVAKDRL QMN +LDIHSRI C QRPNVRFS +++KR
Subjt:  HAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKR

Query:  EIEDGK
        EIE GK
Subjt:  EIEDGK

A0A6J1L546 uncharacterized protein LOC1114992190.0e+0073.81Show/hide
Query:  MESKRGAMASKRSSIVHQPQALQFGFLHLPRKKPKKLP--QPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNED
        MESK  AMASKRSSIV+QP+ALQ GFLHLPRKKPK LP  Q +ELASKDGD VS +VAKDLR+KRVFSPNLENRSSV SG   SDKEG +TANGTC NED
Subjt:  MESKRGAMASKRSSIVHQPQALQFGFLHLPRKKPKKLP--QPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNED

Query:  SRVGKSSEAEELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRK
        S VGK SE  E+RNENFCNSN Y +  D+V +CNG S E +HSTPPD E L GGFVAASS+GCP+SSNG V+GD CAKADCRI SVTRTGSVLKPCSKRK
Subjt:  SRVGKSSEAEELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRK

Query:  LFKAPGSIAYKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKN
        LFKAPGSIAYKR+LPFLLD+DN+TL  D   KRENNLEKK NIESNLCNPA   SF+DSDT VKNAV ASG  CKT+KL+LPPPDNGD K  QNGS   +
Subjt:  LFKAPGSIAYKRLLPFLLDTDNYTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKN

Query:  DPTLVKEDSGLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKM
        DPTLV+E S LKKDN VC S +D+R      PTK+ IEDRSS EQSKT GMERLDG N  I     SE +N KS+VSEKLCNN SED+ RE+HFNEELKM
Subjt:  DPTLVKEDSGLKKDNAVCVSSLDKRLIENGGPTKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKM

Query:  SSLDSNIACSLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRN
        S LDSNI C+ VKEER+DEKV CTRG D+ LG STVGEN CNI+TESDKKYG+ VRNK+VRNPL QL   YSQ SVSYRRMLPFLEDLFKDNP+ CAS N
Subjt:  SSLDSNIACSLVKEERKDEKVECTRGTDQNLGGSTVGENDCNISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRN

Query:  INCLSLEKELPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSP-KLQMHQLHSEQEMLDKCKLKM
        INC   EKELPT+NL  PSSNSHNSQDKSE +VSCN+PCDG+ D LSMP  N +N+ VCE + VLMP GV+D LLS P       LHS+ EML+KCKLKM
Subjt:  INCLSLEKELPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKLLSSP-KLQMHQLHSEQEMLDKCKLKM

Query:  DPQLYGPILLNDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGIC
        DPQ      LNDQAVSS Y A SYEPL GEGSRM S+Q P TSE CT+LT+ VSDG + +ERNSL+P    VEA +LPE+ IN+R+GILKRN RGCRGIC
Subjt:  DPQLYGPILLNDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGIC

Query:  NCLNCSSFRLHAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAK-GDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRP
        NCLNCSSFRLHAERAFEFSRNQLQDAEEVASDLMKEL  LR VLEK +D  K GDAGYHSNKVKEACRKASEAEL+AKDRLLQMNYEL IH RITCSQRP
Subjt:  NCLNCSSFRLHAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAK-GDAGYHSNKVKEACRKASEAELVAKDRLLQMNYELDIHSRITCSQRP

Query:  NVRFSCQVEKREIEDGK
        NVRFS +VE+ EIEDGK
Subjt:  NVRFSCQVEKREIEDGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G23740.1 unknown protein7.4e-3332.45Show/hide
Query:  SVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKELPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKL
        SV+YRRMLP+L+D+ +DNP             +K    + + +P  NS +  + ++ +V+ N+  +    +         NE     ERV +     D  
Subjt:  SVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKELPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLNIMNETVCETERVLMPTGVDDKL

Query:  LSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILLNDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAG
                H +   +  L   ++ +   L G    ++   S+L+     + L+GE +   +E                   A   E +S +     V+  
Subjt:  LSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILLNDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERNSLEPNFSCVEAG

Query:  VL--PESRINVRRGILKRNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAE
        V+    S  +  +GILKR+ RGCRGIC+CLNCSSFRLHAERAFEFSRNQLQD E +  DL+ E+S LRD+LEK +     +   + ++  EA ++A EA 
Subjt:  VL--PESRINVRRGILKRNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAE

Query:  LVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKREI
         +AK RL QMN +L IH RI   QR  V+F+  + ++ I
Subjt:  LVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKREI

AT3G23740.1 unknown protein2.9e-0523.48Show/hide
Query:  KDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEAEELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAA
        KD+R++RVFSP                    I+ +  C  +D      ++ +   +E  C+ +            N  S++   +TPPD+E+L       
Subjt:  KDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEAEELRNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAA

Query:  SSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIAYKRLLPFLLD-TDNYTLQGDSCSKR-ENNLEKKA----------NIESN
              +  NGSV+     K+D  +       SVL PCS+ K+FK  G  +YKRLLP+L+  +D+ T     CSK    N+ K            +   +
Subjt:  SSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIAYKRLLPFLLD-TDNYTLQGDSCSKR-ENNLEKKA----------NIESN

Query:  LCNPAKEPSFIDSDTRVKNAVLASGMTCKT---------------LKLNLPPPDNGDAKIVQN---GSVSKNDPTLVKEDSGLKKDNAVCVSSLDKRL--
         C        + + T  KNA  + G   KT               ++ + P P     +++ +    S S N+ T     S + +++       ++ L  
Subjt:  LCNPAKEPSFIDSDTRVKNAVLASGMTCKT---------------LKLNLPPPDNGDAKIVQN---GSVSKNDPTLVKEDSGLKKDNAVCVSSLDKRL--

Query:  ------IENGGP-------TKHQIEDRSSNEQSKTLGMERLDGENSFISSNLS----SEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDS
              +E   P        KH I D  +N  S+      L G  S    N S    + VDNL   V E+  N    +I       EEL+  S D+
Subjt:  ------IENGGP-------TKHQIEDRSSNEQSKTLGMERLDGENSFISSNLS----SEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCAAAGCGTGGTGCTATGGCTTCCAAACGCTCCTCCATTGTTCATCAACCTCAAGCTCTGCAATTTGGATTCCTCCACTTACCCCGCAAAAAACCTAAGAAGCT
GCCGCAGCCGGATGAACTTGCTTCGAAAGATGGGGATAAGGTTTCGTTTTATGTTGCGAAGGATCTGCGTATCAAGCGAGTTTTCTCTCCCAATTTAGAGAATCGCTCCT
CGGTGCCATCGGGAGGGCAGACTAGCGACAAAGAAGGGTCGATAACCGCGAATGGGACTTGTCCGAATGAAGATAGTCGAGTCGGTAAAAGCTCCGAGGCTGAAGAGTTA
CGGAATGAGAATTTTTGCAATTCTAATGGGTACGTTAAATTAAGCGATGACGTTCCGAGATGTAATGGAAGGAGTGAGGAACTGGTGCATTCTACACCCCCTGATGCGGA
GGTTCTGGTTGGGGGTTTTGTGGCGGCTTCTTCAAACGGATGTCCTCAATCAAGTAATGGAAGCGTTTTAGGGGATGCTTGTGCGAAAGCTGACTGTAGAATTGGCTCTG
TTACTAGAACCGGATCGGTCCTCAAACCATGTTCTAAACGGAAGCTATTCAAAGCACCTGGCTCCATTGCCTACAAAAGATTGCTACCCTTTTTGCTGGATACGGATAAT
TACACCCTACAAGGTGATTCGTGCTCAAAACGTGAAAACAATTTGGAGAAGAAGGCAAATATTGAATCTAATCTGTGTAATCCAGCCAAGGAGCCATCTTTTATTGATTC
AGATACTAGAGTAAAGAATGCAGTGTTGGCCTCCGGCATGACATGTAAGACTTTGAAGCTAAATTTACCACCTCCTGATAATGGTGATGCTAAAATTGTTCAGAATGGCA
GTGTTTCGAAGAATGACCCGACCTTAGTCAAGGAGGATTCTGGTTTGAAAAAAGATAATGCAGTTTGTGTCTCTTCTCTTGACAAAAGGCTGATAGAGAATGGAGGGCCC
ACAAAGCATCAGATAGAGGATCGTTCTTCTAATGAACAATCCAAAACCTTGGGAATGGAGAGGCTTGATGGCGAAAATTCCTTTATATCTTCAAATCTTTCATCAGAAGT
AGACAATCTTAAGTCCTATGTTTCGGAGAAGTTGTGTAACAATGGTTCTGAGGATATCAAAAGAGAAAATCATTTTAACGAAGAACTCAAGATGTCATCATTGGATTCTA
ACATTGCTTGTAGCCTAGTGAAGGAAGAAAGGAAGGATGAAAAAGTTGAATGTACTCGAGGCACAGATCAAAATCTTGGTGGTTCTACTGTTGGTGAAAATGATTGCAAC
ATTTCTACAGAGAGTGACAAGAAGTATGGCTCCTGTGTTAGAAACAAAGTGGTTCGCAATCCACTTGTACAACTGAATTCGAAATACAGCCAAGTTTCAGTTAGCTATCG
AAGGATGCTTCCATTCCTTGAGGATCTTTTTAAAGATAATCCAGAGATCTGTGCCTCAAGAAACATCAACTGTCTGAGCCTAGAGAAAGAATTGCCAACTATAAATTTGC
AAACACCGAGTTCAAATTCTCACAATTCCCAGGATAAATCAGAAGGCATAGTATCGTGCAACATTCCATGCGATGGAGATTTAGATACTCTCTCAATGCCTGGGTTGAAT
ATCATGAATGAAACGGTTTGTGAAACAGAAAGAGTTTTAATGCCTACTGGAGTCGATGATAAACTTCTATCATCACCAAAATTACAGATGCATCAGCTGCATTCTGAACA
GGAGATGTTGGATAAATGTAAGTTGAAAATGGACCCTCAATTGTATGGTCCAATACTTTTAAATGATCAAGCAGTCTCATCATTGTATCCTGCTGCTAGTTATGAGCCTC
TAATTGGAGAAGGATCTAGAATGATCTCTGAACAATTACCAATTACTTCAGAAGGCTGTACAAGTTTGACAGATAATGTTTCTGATGGTGCTAGACGCTCTGAGAGAAAC
AGCTTAGAACCTAATTTTTCATGTGTAGAAGCAGGTGTTCTGCCAGAAAGTCGAATTAATGTTAGAAGGGGAATTCTTAAGCGAAACCCACGAGGATGCAGAGGAATCTG
CAATTGTTTGAACTGTTCCTCTTTTCGTCTTCATGCTGAAAGGGCATTTGAATTTTCCAGAAATCAACTGCAAGATGCTGAAGAAGTTGCTTCAGATTTGATGAAAGAAT
TGTCATCTCTCCGTGATGTGCTAGAAAAATGTTCTGATGGTGCAAAAGGGGATGCTGGATATCATTCAAATAAGGTGAAAGAAGCTTGTAGGAAAGCATCTGAAGCAGAG
TTAGTTGCAAAAGACCGCCTTCTACAAATGAACTACGAACTTGACATTCACAGCAGAATAACGTGCTCGCAACGACCAAATGTCAGATTTTCTTGTCAAGTTGAGAAAAG
AGAGATTGAAGATGGCAAATAG
mRNA sequenceShow/hide mRNA sequence
GACTTCTTCCCGCTCTCACTTTTTCCCTCTTTCCCTCTCCAATTGACTATCATCGCCATCGATTCTAGGTTTTCTCAGTACAATCCTCCATGGAATCAAAGCGTGGTGCT
ATGGCTTCCAAACGCTCCTCCATTGTTCATCAACCTCAAGCTCTGCAATTTGGATTCCTCCACTTACCCCGCAAAAAACCTAAGAAGCTGCCGCAGCCGGATGAACTTGC
TTCGAAAGATGGGGATAAGGTTTCGTTTTATGTTGCGAAGGATCTGCGTATCAAGCGAGTTTTCTCTCCCAATTTAGAGAATCGCTCCTCGGTGCCATCGGGAGGGCAGA
CTAGCGACAAAGAAGGGTCGATAACCGCGAATGGGACTTGTCCGAATGAAGATAGTCGAGTCGGTAAAAGCTCCGAGGCTGAAGAGTTACGGAATGAGAATTTTTGCAAT
TCTAATGGGTACGTTAAATTAAGCGATGACGTTCCGAGATGTAATGGAAGGAGTGAGGAACTGGTGCATTCTACACCCCCTGATGCGGAGGTTCTGGTTGGGGGTTTTGT
GGCGGCTTCTTCAAACGGATGTCCTCAATCAAGTAATGGAAGCGTTTTAGGGGATGCTTGTGCGAAAGCTGACTGTAGAATTGGCTCTGTTACTAGAACCGGATCGGTCC
TCAAACCATGTTCTAAACGGAAGCTATTCAAAGCACCTGGCTCCATTGCCTACAAAAGATTGCTACCCTTTTTGCTGGATACGGATAATTACACCCTACAAGGTGATTCG
TGCTCAAAACGTGAAAACAATTTGGAGAAGAAGGCAAATATTGAATCTAATCTGTGTAATCCAGCCAAGGAGCCATCTTTTATTGATTCAGATACTAGAGTAAAGAATGC
AGTGTTGGCCTCCGGCATGACATGTAAGACTTTGAAGCTAAATTTACCACCTCCTGATAATGGTGATGCTAAAATTGTTCAGAATGGCAGTGTTTCGAAGAATGACCCGA
CCTTAGTCAAGGAGGATTCTGGTTTGAAAAAAGATAATGCAGTTTGTGTCTCTTCTCTTGACAAAAGGCTGATAGAGAATGGAGGGCCCACAAAGCATCAGATAGAGGAT
CGTTCTTCTAATGAACAATCCAAAACCTTGGGAATGGAGAGGCTTGATGGCGAAAATTCCTTTATATCTTCAAATCTTTCATCAGAAGTAGACAATCTTAAGTCCTATGT
TTCGGAGAAGTTGTGTAACAATGGTTCTGAGGATATCAAAAGAGAAAATCATTTTAACGAAGAACTCAAGATGTCATCATTGGATTCTAACATTGCTTGTAGCCTAGTGA
AGGAAGAAAGGAAGGATGAAAAAGTTGAATGTACTCGAGGCACAGATCAAAATCTTGGTGGTTCTACTGTTGGTGAAAATGATTGCAACATTTCTACAGAGAGTGACAAG
AAGTATGGCTCCTGTGTTAGAAACAAAGTGGTTCGCAATCCACTTGTACAACTGAATTCGAAATACAGCCAAGTTTCAGTTAGCTATCGAAGGATGCTTCCATTCCTTGA
GGATCTTTTTAAAGATAATCCAGAGATCTGTGCCTCAAGAAACATCAACTGTCTGAGCCTAGAGAAAGAATTGCCAACTATAAATTTGCAAACACCGAGTTCAAATTCTC
ACAATTCCCAGGATAAATCAGAAGGCATAGTATCGTGCAACATTCCATGCGATGGAGATTTAGATACTCTCTCAATGCCTGGGTTGAATATCATGAATGAAACGGTTTGT
GAAACAGAAAGAGTTTTAATGCCTACTGGAGTCGATGATAAACTTCTATCATCACCAAAATTACAGATGCATCAGCTGCATTCTGAACAGGAGATGTTGGATAAATGTAA
GTTGAAAATGGACCCTCAATTGTATGGTCCAATACTTTTAAATGATCAAGCAGTCTCATCATTGTATCCTGCTGCTAGTTATGAGCCTCTAATTGGAGAAGGATCTAGAA
TGATCTCTGAACAATTACCAATTACTTCAGAAGGCTGTACAAGTTTGACAGATAATGTTTCTGATGGTGCTAGACGCTCTGAGAGAAACAGCTTAGAACCTAATTTTTCA
TGTGTAGAAGCAGGTGTTCTGCCAGAAAGTCGAATTAATGTTAGAAGGGGAATTCTTAAGCGAAACCCACGAGGATGCAGAGGAATCTGCAATTGTTTGAACTGTTCCTC
TTTTCGTCTTCATGCTGAAAGGGCATTTGAATTTTCCAGAAATCAACTGCAAGATGCTGAAGAAGTTGCTTCAGATTTGATGAAAGAATTGTCATCTCTCCGTGATGTGC
TAGAAAAATGTTCTGATGGTGCAAAAGGGGATGCTGGATATCATTCAAATAAGGTGAAAGAAGCTTGTAGGAAAGCATCTGAAGCAGAGTTAGTTGCAAAAGACCGCCTT
CTACAAATGAACTACGAACTTGACATTCACAGCAGAATAACGTGCTCGCAACGACCAAATGTCAGATTTTCTTGTCAAGTTGAGAAAAGAGAGATTGAAGATGGCAAATA
GAGTAAAATGATGAAGACTACGAATCTGATTACTGTGATAAAAGTTGCAGCCTTCACTCTTCCCGCTTATGTGAGGCTGGCATTGAATGGATGAATTTGGATGGTCTTTC
ATTCCATTTGGGATGCCCGGTTGATTTGTAAAAATGCATTTGCTTCACTCAGTGAAAGGATTGCCTCTTGACGATGAAGGAAAAGGTTTCATTCTGATTGAACTCTAGTA
TGTGGAAAAATGTCATAATTTCTTAGACTCCTTAAATTGAAGGCGTCAATACTTTTTCTTTTTCTTTTTTCTTTTAAAAAAGAGAGTAGCATAATAGTAATGGAGTGGAT
TCTGTATCATGTAGATTATTGGTGGCTTGGTAAGAAAGGGT
Protein sequenceShow/hide protein sequence
MESKRGAMASKRSSIVHQPQALQFGFLHLPRKKPKKLPQPDELASKDGDKVSFYVAKDLRIKRVFSPNLENRSSVPSGGQTSDKEGSITANGTCPNEDSRVGKSSEAEEL
RNENFCNSNGYVKLSDDVPRCNGRSEELVHSTPPDAEVLVGGFVAASSNGCPQSSNGSVLGDACAKADCRIGSVTRTGSVLKPCSKRKLFKAPGSIAYKRLLPFLLDTDN
YTLQGDSCSKRENNLEKKANIESNLCNPAKEPSFIDSDTRVKNAVLASGMTCKTLKLNLPPPDNGDAKIVQNGSVSKNDPTLVKEDSGLKKDNAVCVSSLDKRLIENGGP
TKHQIEDRSSNEQSKTLGMERLDGENSFISSNLSSEVDNLKSYVSEKLCNNGSEDIKRENHFNEELKMSSLDSNIACSLVKEERKDEKVECTRGTDQNLGGSTVGENDCN
ISTESDKKYGSCVRNKVVRNPLVQLNSKYSQVSVSYRRMLPFLEDLFKDNPEICASRNINCLSLEKELPTINLQTPSSNSHNSQDKSEGIVSCNIPCDGDLDTLSMPGLN
IMNETVCETERVLMPTGVDDKLLSSPKLQMHQLHSEQEMLDKCKLKMDPQLYGPILLNDQAVSSLYPAASYEPLIGEGSRMISEQLPITSEGCTSLTDNVSDGARRSERN
SLEPNFSCVEAGVLPESRINVRRGILKRNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEEVASDLMKELSSLRDVLEKCSDGAKGDAGYHSNKVKEACRKASEAE
LVAKDRLLQMNYELDIHSRITCSQRPNVRFSCQVEKREIEDGK