; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004121 (gene) of Snake gourd v1 genome

Gene IDTan0004121
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein PHLOEM PROTEIN 2-LIKE A9-like
Genome locationLG05:80404439..80410269
RNA-Seq ExpressionTan0004121
SyntenyTan0004121
Gene Ontology termsNA
InterPro domainsIPR025886 - Phloem protein 2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012603.1 Protein PHLOEM PROTEIN 2-LIKE A9 [Cucurbita argyrosperma subsp. argyrosperma]4.7e-5760.77Show/hide
Query:  SDPHYRAASKPLKK---MEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFI
        S+PHYRA +K +KK    + ++ IYPR L+ITWG+D RYW     +  EEDS+AELKQV WLEVTGST E +E+GKW+KV FNV+LRP+AFGWDE N++I
Subjt:  SDPHYRAASKPLKK---MEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFI

Query:  MAKIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPK
        MAKIGK+G+F FKKL+L+ K   +RF+IP  EL I        +S++D  LYFGMY+VWS RWKGGLRIHHA V+ECE PK
Subjt:  MAKIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPK

XP_022954489.1 protein PHLOEM PROTEIN 2-LIKE A9-like isoform X1 [Cucurbita moschata]1.9e-5861.54Show/hide
Query:  SDPHYRAASKPLKK---MEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFI
        S+PHYRA +K +KK    + ++ IYPR L+ITWG+D RYW     D + EDS+AELKQV WLEVTGST E +E+GKW+KV FNV+LRP+AFGWDE N++I
Subjt:  SDPHYRAASKPLKK---MEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFI

Query:  MAKIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK
        MAKIGK+G+FSFKKL+L +K   ERF+IP  EL I        +S++D  LYFGMY+VWS RWKGGLRIHHA V+ECE PK+
Subjt:  MAKIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK

XP_022954490.1 protein PHLOEM PROTEIN 2-LIKE A9-like isoform X2 [Cucurbita moschata]5.1e-5962.78Show/hide
Query:  SDPHYRAASKPLKK-MEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMA
        S+PHYRA +K +KK    K+ IYPR L+ITWG+D RYW     D + EDS+AELKQV WLEVTGST E +E+GKW+KV FNV+LRP+AFGWDE N++IMA
Subjt:  SDPHYRAASKPLKK-MEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMA

Query:  KIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK
        KIGK+G+FSFKKL+L +K   ERF+IP  EL I        +S++D  LYFGMY+VWS RWKGGLRIHHA V+ECE PK+
Subjt:  KIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK

XP_022994899.1 protein PHLOEM PROTEIN 2-LIKE A9-like [Cucurbita maxima]4.1e-6165.56Show/hide
Query:  SDPHYRAASKPLK-KMEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMA
        S+PH+RA S  +K + ++K  IYPR LDITWG+D RYW L   DM+ EDS+AELKQV WLEVTGST +NL+IG W+KV FNV+LRP+AFGWDECNV+IMA
Subjt:  SDPHYRAASKPLK-KMEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMA

Query:  KIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK
        KIGK+G +SFKKL+LN K    RF+IP+ EL  I V    N S +DLKLYFGMY+VWS RWKGGLRI+HA V+ECE PK+
Subjt:  KIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK

XP_038896030.1 protein PHLOEM PROTEIN 2-LIKE A9-like [Benincasa hispida]7.3e-5861.14Show/hide
Query:  DPHYRAASKPLKKMED-KLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMAK
        DPH++A S P+K +E+ K  IYPRAL+ITWG D R+W LP  D + EDS+AELKQVCWLEVTGST++++ + + YKVGF +SLRPDAFGWD+C+V+IMAK
Subjt:  DPHYRAASKPLKKMED-KLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMAK

Query:  IGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECE
        IG+RG +SFKK+SL     G+R  IP+++ +II V  +F   +DDLKLYFG+Y+VW+ RWKGGLRIHHAFV+  E
Subjt:  IGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECE

TrEMBL top hitse value%identityAlignment
A0A0A0M0X5 Uncharacterized protein4.2e-5156.08Show/hide
Query:  RRAWKPFIIPSDPHYRAASKPLKKMEDKLIIYPRALDITWGSDPRYWKLP----SPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPD
        RR  +P   PSDPH+RA +      ED   IYPRAL+ITWGSD RYW +P    + D ++ED +AELKQVCWLEVTGST  +L   K YKV F VSL PD
Subjt:  RRAWKPFIIPSDPHYRAASKPLKKMEDKLIIYPRALDITWGSDPRYWKLP----SPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPD

Query:  AFGWDECNVFIMAKIGKRGKFSFKKLSL---NDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQ
        AFGWD+C+V+IMAKIGK+G F F+K++L         E   IP  EL +   T   N ++DDLKLYFG+YDVW+ RWKGGLRIH+A V+
Subjt:  AFGWDECNVFIMAKIGKRGKFSFKKLSL---NDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQ

A0A6J1CNK5 protein PHLOEM PROTEIN 2-LIKE A9-like isoform X11.3e-4959.2Show/hide
Query:  PHYRAASKPLKKME-DKLIIYPRALDITWGSDPRYWKLPSPDM--KEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMA
        PHY+  S    K E +K  +YPR L+ITWG D RYW+LP   +   +EDS AELKQV WLEVTGST +++ IGK YKVGF VSLRPDAFGW+   V+IMA
Subjt:  PHYRAASKPLKKME-DKLIIYPRALDITWGSDPRYWKLPSPDM--KEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMA

Query:  KIGKRGKFSFKKLSLNDKVAGE--RFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFV
        KIGKRG  SFKK++   K  G    FDIP ++L+I          SDD KLYFGMY+VWSK WKGGL+IHHAFV
Subjt:  KIGKRGKFSFKKLSLNDKVAGE--RFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFV

A0A6J1GSJ8 protein PHLOEM PROTEIN 2-LIKE A9-like isoform X22.5e-5962.78Show/hide
Query:  SDPHYRAASKPLKK-MEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMA
        S+PHYRA +K +KK    K+ IYPR L+ITWG+D RYW     D + EDS+AELKQV WLEVTGST E +E+GKW+KV FNV+LRP+AFGWDE N++IMA
Subjt:  SDPHYRAASKPLKK-MEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMA

Query:  KIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK
        KIGK+G+FSFKKL+L +K   ERF+IP  EL I        +S++D  LYFGMY+VWS RWKGGLRIHHA V+ECE PK+
Subjt:  KIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK

A0A6J1GT48 protein PHLOEM PROTEIN 2-LIKE A9-like isoform X19.3e-5961.54Show/hide
Query:  SDPHYRAASKPLKK---MEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFI
        S+PHYRA +K +KK    + ++ IYPR L+ITWG+D RYW     D + EDS+AELKQV WLEVTGST E +E+GKW+KV FNV+LRP+AFGWDE N++I
Subjt:  SDPHYRAASKPLKK---MEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFI

Query:  MAKIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK
        MAKIGK+G+FSFKKL+L +K   ERF+IP  EL I        +S++D  LYFGMY+VWS RWKGGLRIHHA V+ECE PK+
Subjt:  MAKIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK

A0A6J1K456 protein PHLOEM PROTEIN 2-LIKE A9-like2.0e-6165.56Show/hide
Query:  SDPHYRAASKPLK-KMEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMA
        S+PH+RA S  +K + ++K  IYPR LDITWG+D RYW L   DM+ EDS+AELKQV WLEVTGST +NL+IG W+KV FNV+LRP+AFGWDECNV+IMA
Subjt:  SDPHYRAASKPLK-KMEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMA

Query:  KIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK
        KIGK+G +SFKKL+LN K    RF+IP+ EL  I V    N S +DLKLYFGMY+VWS RWKGGLRI+HA V+ECE PK+
Subjt:  KIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK

SwissProt top hitse value%identityAlignment
P0DSP5 Lectin4.8e-0429.73Show/hide
Query:  IIYPRALDITWGSDPRYWKLPSPDM-KEEDSYAELKQVCWLEVTGSTQ-ENLEIGKWYKVGFNVSLRPDAFGWD
        +++PRA  +TW  D RYW     D    +   A+L +V W +   +    +L+   WY V   V +   A GW+
Subjt:  IIYPRALDITWGSDPRYWKLPSPDM-KEEDSYAELKQVCWLEVTGSTQ-ENLEIGKWYKVGFNVSLRPDAFGWD

Q3E6P4 F-box protein At2g022408.2e-0440.43Show/hide
Query:  LIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGST
        +++  + L ITWGS P YW+  S      +  AEL  VCW E+ G T
Subjt:  LIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGST

Q9FHE8 Protein PHLOEM PROTEIN 2-LIKE A66.3e-0427.7Show/hide
Query:  RALDITWGSDPRYW---KLPSPDMKEEDSYAELKQVCWLEVTGS-TQENLEIGKWYKVGFNVSLRPDAFGWDE---CNVFIMAKIGKRGKFSFKKLSLND
        R LDIT    P+ W    +       E   A L +V WL++ G+ T ENL  G  Y+  F V L  +A GW++     + ++   G   +   +  +LND
Subjt:  RALDITWGSDPRYW---KLPSPDMKEEDSYAELKQVCWLEVTGS-TQENLEIGKWYKVGFNVSLRPDAFGWDE---CNVFIMAKIGKRGKFSFKKLSLND

Query:  KVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGL
         +     DI     ++ P T           + F MY    K  K GL
Subjt:  KVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGL

Q9SA16 Protein PHLOEM PROTEIN 2-LIKE A93.8e-3344.63Show/hide
Query:  HYRAASKPLKKMEDKL-IIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMAKIG
        H++A SK  +    K  I  P  L+  WG D RYW +P    KE    AELK V WLEVTGS  + +E GK Y++GF +S +PDA GWD+  VF+ AKIG
Subjt:  HYRAASKPLKKMEDKL-IIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMAKIG

Query:  KRGKFSFKKL-SLND-----KVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQE
        K+GK  +K++ S++      K   E  +IP+E   +  +  S    + D KL FG+Y+VW+ RWK GL IH AFVQE
Subjt:  KRGKFSFKKL-SLND-----KVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQE

Arabidopsis top hitse value%identityAlignment
AT1G10155.1 phloem protein 2-A102.1e-3142.31Show/hide
Query:  HYRAASKPLKKMEDKLIIY-PRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMAKIG
        HY A S   + +  K  ++ P  L+  WG D RYW +P+    E+ + AELK+V WLEVTGS  + +E GK Y++GF +S   DA GWD+  VF+ AKIG
Subjt:  HYRAASKPLKKMEDKLIIY-PRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMAKIG

Query:  KRGKFSFKKL-SLN---DKVAGER--FDIPNE-----ELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQE
        K+G+  +K++ S+N   DK+ G     +IP+E     E+ + P   + NQ   D KL FG+Y+VW+ +WK GL I+ AFV+E
Subjt:  KRGKFSFKKL-SLN---DKVAGER--FDIPNE-----ELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQE

AT1G31200.1 phloem protein 2-A92.7e-3444.63Show/hide
Query:  HYRAASKPLKKMEDKL-IIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMAKIG
        H++A SK  +    K  I  P  L+  WG D RYW +P    KE    AELK V WLEVTGS  + +E GK Y++GF +S +PDA GWD+  VF+ AKIG
Subjt:  HYRAASKPLKKMEDKL-IIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVFIMAKIG

Query:  KRGKFSFKKL-SLND-----KVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQE
        K+GK  +K++ S++      K   E  +IP+E   +  +  S    + D KL FG+Y+VW+ RWK GL IH AFVQE
Subjt:  KRGKFSFKKL-SLND-----KVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQE

AT2G02240.1 F-box family protein5.9e-0540.43Show/hide
Query:  LIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGST
        +++  + L ITWGS P YW+  S      +  AEL  VCW E+ G T
Subjt:  LIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGST

AT5G45080.1 phloem protein 2-A64.5e-0527.7Show/hide
Query:  RALDITWGSDPRYW---KLPSPDMKEEDSYAELKQVCWLEVTGS-TQENLEIGKWYKVGFNVSLRPDAFGWDE---CNVFIMAKIGKRGKFSFKKLSLND
        R LDIT    P+ W    +       E   A L +V WL++ G+ T ENL  G  Y+  F V L  +A GW++     + ++   G   +   +  +LND
Subjt:  RALDITWGSDPRYW---KLPSPDMKEEDSYAELKQVCWLEVTGS-TQENLEIGKWYKVGFNVSLRPDAFGWDE---CNVFIMAKIGKRGKFSFKKLSLND

Query:  KVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGL
         +     DI     ++ P T           + F MY    K  K GL
Subjt:  KVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAGGTTTAGAAGAGCTTGGAAACCCTTCATAATTCCTTCGGACCCTCATTATCGGGCTGCTTCAAAACCATTGAAAAAGATGGAGGATAAGCTCATAATTTATCC
AAGGGCACTAGATATTACCTGGGGTAGCGACCCGCGTTATTGGAAGTTACCAAGTCCCGACATGAAGGAGGAAGACAGTTATGCAGAGCTGAAACAAGTATGCTGGCTGG
AAGTAACTGGTTCAACACAGGAAAATCTTGAGATTGGGAAATGGTACAAAGTGGGTTTCAATGTATCATTAAGACCAGATGCATTTGGATGGGATGAGTGCAACGTTTTC
ATAATGGCTAAGATCGGAAAAAGGGGTAAGTTCTCTTTCAAGAAGCTGAGTCTTAATGACAAAGTTGCAGGTGAAAGATTTGATATTCCCAATGAGGAATTGATAATCAT
CCCAGTGACTGATTCCTTTAACCAATCCTCTGATGATCTCAAGCTCTACTTTGGTATGTATGATGTTTGGAGCAAGAGATGGAAGGGAGGTTTGAGGATTCATCATGCAT
TCGTTCAGGAGTGTGAACCACCCAAAAAGTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAATAAATAAATAAAACCTTTGGCTCTACCCTAAATTACTATCAACAAGGCTCAACAAGAAAAAAAAAAAAAAAAAAAAAAAAAACCTTTTTATCATTTAGAAGAAG
GATATAAATAACTTCATCAAAGGGAGCCAAAGGTCGAAAGGTAATCGAGTCAAATGATAAGGTTTAGAAGAGCTTGGAAACCCTTCATAATTCCTTCGGACCCTCATTAT
CGGGCTGCTTCAAAACCATTGAAAAAGATGGAGGATAAGCTCATAATTTATCCAAGGGCACTAGATATTACCTGGGGTAGCGACCCGCGTTATTGGAAGTTACCAAGTCC
CGACATGAAGGAGGAAGACAGTTATGCAGAGCTGAAACAAGTATGCTGGCTGGAAGTAACTGGTTCAACACAGGAAAATCTTGAGATTGGGAAATGGTACAAAGTGGGTT
TCAATGTATCATTAAGACCAGATGCATTTGGATGGGATGAGTGCAACGTTTTCATAATGGCTAAGATCGGAAAAAGGGGTAAGTTCTCTTTCAAGAAGCTGAGTCTTAAT
GACAAAGTTGCAGGTGAAAGATTTGATATTCCCAATGAGGAATTGATAATCATCCCAGTGACTGATTCCTTTAACCAATCCTCTGATGATCTCAAGCTCTACTTTGGTAT
GTATGATGTTTGGAGCAAGAGATGGAAGGGAGGTTTGAGGATTCATCATGCATTCGTTCAGGAGTGTGAACCACCCAAAAAGTAATTAATGTGATTTGACCATGTTCATC
ACAATTTCAATAGACTCATCCTGTGGTTTCAAAATTTAAAAAAAAAAATAAGAATAATGTTATTATATGTATGTAATGATTATAATTAAAGAGGTACATCACATGAACAT
CTTTTATTTGGAAACTAAAGTTCAACATACTTCAAGGATTGCATTATGCACGAAAAATATATATTCTATTTAATTTGTTGGTATGCTATATATATGTTTGTTTTGGTTGT
TCTCTCTGATTTATCTCTCACCGTTTGAGTGTTGAGTGTTGATAGAATTACATTTACTATAACTCATATGTTTAAGCATTTGAGTTGATTGGTGATATTTAACATGCTAT
ATTAAAGCAGGAAATCTTGTGTTTAAATTCTAGTAATTAATGTCAATTTTTCTTCTAATTAATATTGATTTTCATTCAAGTGAGGAGGAGTTTGTTGGAGTGTTGATATG
ATTAAATTTACCATAACCTGCTTTGATAAAAGTTGGGATAAAAGGCAAGCAGTTAACAAAGTGTGACATAACGACACTCCTTTTCAAATGCAACACCCTTTTTCACGCAG
TAAAAAGAATACTTCAATTTCTCAAGGGCATAATCAACTATGGGGTCATTTTTCAAAAATACTACTCGCTTTTACGGAGTTTTCATACAGAAAATATATTCTCATGCGCT
TTAATGATTGATGATTAAGATACTGAAAATAATGGAAATATGATCAATTGTAACGTGACAATGCGTGTAAAATTATCATGAGAGCAGTATGAGGCGATATTGCTTATTTA
TGCTCTCATACGCGAAAATGTTATTGCTCTAGTGCTCTCGATCTCGCTCAAATGAGTGGTCGAAAATCTAGAGAAAAAAAGAGGCAAGAAAACAAAATATGTTAGAGAAG
AGCTAGAGTGAGAATTAGAAAGCGATTTCATTGTCTCTTTCTTTTCTCCATATATAATTTCAAACCACTCATTCCTATCATAAGCAAGAGCAAGAACTTGAGCGAGAGAG
CTAGAGTAAGATCGAGTATGGTGTCGTCACGATAATTTGGACGGAATGCAAAATATACAAAAAGGGTCATATTTTGGTCTTCAAACATGTTTGAAAAGTAGCTCTTGTGT
CTATAGTAATAGTTTTTGGATTAAGCTTTGAAGTTGAATCTCCTATTGCCTTTTACTAGATCAGATCTCTTTCAAGAGCCATCCTTTGATGATTTTATCAGGATTGGGTT
TACCTCCATGCTTGGCTCTGCTCGCGCTATTCTTTGGTATAATCTTTTTGGAACTACTCTATGGAACATCTGGTTGAAGCGCAACGAAAGGATCTTCAATGATAAATCAC
GTGGAGTACAAACTTTGTGGGAAAATATATTGGTCAATGCTGCTGGTTGGAGTACAAAATCCAAGCTCTTTTGTAATTATGACATGGCGACTATTGCTTTAAATTGGAGA
GCTTTTCTCTAGCTCCCCTTCTTCTTGGTTTGTTTGATGTACTTCAGACTCCTTTTTTTAATATATTTTGGAGCTTGGGGATAGATGATGAGGGTGTTAATTTTGTGCCA
ACATAGTTGGGATGATTGGGTGCACCTACTGACTCCTTGTCTCTAGTATCTTCTTCTAAAAAAAAGCTTTGAAGTTGAATGTGTAACACCCCTAAAATTTCAACGTCAAT
GGTACCGACATTTCCCATTAGCTAACACCTGATAAACAATAAGTCTAAATTCCTAAATGAAAAAAGCTAACGAGAAAAGTTCAATCAGCTA
Protein sequenceShow/hide protein sequence
MIRFRRAWKPFIIPSDPHYRAASKPLKKMEDKLIIYPRALDITWGSDPRYWKLPSPDMKEEDSYAELKQVCWLEVTGSTQENLEIGKWYKVGFNVSLRPDAFGWDECNVF
IMAKIGKRGKFSFKKLSLNDKVAGERFDIPNEELIIIPVTDSFNQSSDDLKLYFGMYDVWSKRWKGGLRIHHAFVQECEPPKK