; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018578 (gene) of Snake gourd v1 genome

Gene IDTan0018578
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLEA_2 domain-containing protein
Genome locationLG08:75666208..75666930
RNA-Seq ExpressionTan0018578
SyntenyTan0018578
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607382.1 hypothetical protein SDJN03_00724, partial [Cucurbita argyrosperma subsp. sororia]6.3e-8369.75Show/hide
Query:  NNDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFC-PRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ
        +N+ VAIQL+RVPS DKGARRVAFS+SLPKH +TS    KF   RLFA CAWICL VFGI +TLLILGVIF+SFLQSGLPEITV+MLDLS  +I+NSTNQ
Subjt:  NNDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFC-PRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ

Query:  NDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHL
        N A+LN KV M+I+IKNKN+K+ELSYSD+ + LVSE+++LG+NVI  FS  PGNTT LNVT+NV  DS DR++   LEDDRK+ Q+ V+ITM  +VGFHL
Subjt:  NDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHL

Query:  GIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVPPR
        GIF LNKVPIHV C+FQQ+LLLYR KEPPC+I M P R
Subjt:  GIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVPPR

XP_008457557.1 PREDICTED: uncharacterized protein LOC103497223 [Cucumis melo]1.5e-8973.03Show/hide
Query:  NDDVAIQLNRVPSQDKGARRVAFSNSLPKHHT-----TSAHHIKFCPRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNS
        N+  A +L+R+PSQ+KG+RRVAFS+SLPKH        S  H K CPRLFACCAWIC+G+FGI++ +LILGVIF+SFLQSGLPEITVRML+LSNFEIKNS
Subjt:  NDDVAIQLNRVPSQDKGARRVAFSNSLPKHHT-----TSAHHIKFCPRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNS

Query:  TNQND--ALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEAT
        TNQND  ALLNAK+ MSIE++NKN+K+ELSYS I VNLVSEDVKLG++VI  FSH+PGNTT LNVTMNV   STD++N  QLEDDRK+VQMDVQ+ MEA 
Subjt:  TNQND--ALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEAT

Query:  VGFHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVP
        VGFH+GIFNL  VPIHVACDFQQ LL+YR  EPPCNIRM P
Subjt:  VGFHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVP

XP_022949169.1 uncharacterized protein LOC111452600 [Cucurbita moschata]1.2e-8168.91Show/hide
Query:  NNDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFC-PRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ
        +N+ VAIQL+RVPS DKGARRVAFS+SLPKH +TS    KF    L A CAWICL VFGI +TLLILGVIF+SFLQSGLPEITV+MLDLS  +I+NSTNQ
Subjt:  NNDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFC-PRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ

Query:  NDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHL
        N A+LN KV M+I+IKNKN+K+ELSYSD+ + LVSE+++LG+NVI  FS  PGNTT LNVT+NV  DS DR++   LEDDRK+ Q+ V+ITM  +VGFHL
Subjt:  NDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHL

Query:  GIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVPPR
        GIF LNKVPIHV C+FQQ+LLLYR KEPPC+I M P R
Subjt:  GIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVPPR

XP_022998792.1 uncharacterized protein LOC111493353 [Cucurbita maxima]5.0e-8067.65Show/hide
Query:  NNDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFC-PRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ
        +N+ VAIQL+RVPS DKGARRVAFS+SLPKH + S    KF    LFA CAWICL VFGI +TLLILGVIF+SFLQS LPEITV+MLDLS  +I+NSTNQ
Subjt:  NNDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFC-PRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ

Query:  NDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHL
        N A+LN KV M+I+I+NKN+K+ELSYSD+ + LVSE+++LG+NVI  FS  PGNTT LNVT+ V  DS DR++   LEDDRK+ Q+ V+ITM  +VGFHL
Subjt:  NDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHL

Query:  GIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVPPR
        GIF LNKVPIHV C+FQQ+LLLYR KEPPC+I M P R
Subjt:  GIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVPPR

XP_038895624.1 uncharacterized protein LOC120083816 [Benincasa hispida]4.8e-9172.38Show/hide
Query:  NDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSA-----HHIKFCPRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNS
        N+  + QL+R+PSQ++G+RRVAFS SLP+H   S      H  K CPRLFACCAWICL VFGI+L +LILGVIFMSFLQSGLP+ITV+ML+LS FE  NS
Subjt:  NDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSA-----HHIKFCPRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNS

Query:  TNQNDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVG
        TNQN+ LLNAKV +SIE++NKNDK+ELSYS+I VNL S+DVKLG++VI GF+H PGNTT  NVTMNVVG STD++N  QLEDDRKRVQM+VQ+TME+TVG
Subjt:  TNQNDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVG

Query:  FHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVP
        FH+GIFNLN VPIHVACDF+QFLLLYR  EPPCNIRM P
Subjt:  FHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVP

TrEMBL top hitse value%identityAlignment
A0A1S3C5S1 uncharacterized protein LOC1034972237.4e-9073.03Show/hide
Query:  NDDVAIQLNRVPSQDKGARRVAFSNSLPKHHT-----TSAHHIKFCPRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNS
        N+  A +L+R+PSQ+KG+RRVAFS+SLPKH        S  H K CPRLFACCAWIC+G+FGI++ +LILGVIF+SFLQSGLPEITVRML+LSNFEIKNS
Subjt:  NDDVAIQLNRVPSQDKGARRVAFSNSLPKHHT-----TSAHHIKFCPRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNS

Query:  TNQND--ALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEAT
        TNQND  ALLNAK+ MSIE++NKN+K+ELSYS I VNLVSEDVKLG++VI  FSH+PGNTT LNVTMNV   STD++N  QLEDDRK+VQMDVQ+ MEA 
Subjt:  TNQND--ALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEAT

Query:  VGFHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVP
        VGFH+GIFNL  VPIHVACDFQQ LL+YR  EPPCNIRM P
Subjt:  VGFHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVP

A0A5A7V2C7 Putative transmembrane protein7.4e-9073.03Show/hide
Query:  NDDVAIQLNRVPSQDKGARRVAFSNSLPKHHT-----TSAHHIKFCPRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNS
        N+  A +L+R+PSQ+KG+RRVAFS+SLPKH        S  H K CPRLFACCAWIC+G+FGI++ +LILGVIF+SFLQSGLPEITVRML+LSNFEIKNS
Subjt:  NDDVAIQLNRVPSQDKGARRVAFSNSLPKHHT-----TSAHHIKFCPRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNS

Query:  TNQND--ALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEAT
        TNQND  ALLNAK+ MSIE++NKN+K+ELSYS I VNLVSEDVKLG++VI  FSH+PGNTT LNVTMNV   STD++N  QLEDDRK+VQMDVQ+ MEA 
Subjt:  TNQND--ALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEAT

Query:  VGFHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVP
        VGFH+GIFNL  VPIHVACDFQQ LL+YR  EPPCNIRM P
Subjt:  VGFHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVP

A0A6J1E0W1 uncharacterized protein LOC1110249231.1e-6962.07Show/hide
Query:  LNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFCPRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ--NDALLNA
        +N   ++DKG RRV FS SLP H  TS    K   RLFA C  IC+G FGI+L LLI+ VIFMSFLQSGLPEI+++ L LS FEI +STNQ  N+A+L+A
Subjt:  LNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFCPRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ--NDALLNA

Query:  KVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHLGIFNLNK
        +V +S+ ++NKNDK+ELSY DI VN+ S+DVKLGK+VI GFSH PGNTT LNVT NVVGD  DRENAL++++++KRV+M  Q+ MEA +GFH GIF++ K
Subjt:  KVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHLGIFNLNK

Query:  VPIHVAC-DFQQFLLLYRTKEPPCNIRMVPPR
        VPIHV C D QQFLL+ R KE  CNIRM P R
Subjt:  VPIHVAC-DFQQFLLLYRTKEPPCNIRMVPPR

A0A6J1GC15 uncharacterized protein LOC1114526005.7e-8268.91Show/hide
Query:  NNDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFC-PRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ
        +N+ VAIQL+RVPS DKGARRVAFS+SLPKH +TS    KF    L A CAWICL VFGI +TLLILGVIF+SFLQSGLPEITV+MLDLS  +I+NSTNQ
Subjt:  NNDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFC-PRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ

Query:  NDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHL
        N A+LN KV M+I+IKNKN+K+ELSYSD+ + LVSE+++LG+NVI  FS  PGNTT LNVT+NV  DS DR++   LEDDRK+ Q+ V+ITM  +VGFHL
Subjt:  NDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHL

Query:  GIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVPPR
        GIF LNKVPIHV C+FQQ+LLLYR KEPPC+I M P R
Subjt:  GIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVPPR

A0A6J1K8Y2 uncharacterized protein LOC1114933532.4e-8067.65Show/hide
Query:  NNDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFC-PRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ
        +N+ VAIQL+RVPS DKGARRVAFS+SLPKH + S    KF    LFA CAWICL VFGI +TLLILGVIF+SFLQS LPEITV+MLDLS  +I+NSTNQ
Subjt:  NNDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFC-PRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQ

Query:  NDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHL
        N A+LN KV M+I+I+NKN+K+ELSYSD+ + LVSE+++LG+NVI  FS  PGNTT LNVT+ V  DS DR++   LEDDRK+ Q+ V+ITM  +VGFHL
Subjt:  NDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHL

Query:  GIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVPPR
        GIF LNKVPIHV C+FQQ+LLLYR KEPPC+I M P R
Subjt:  GIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMVPPR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G17620.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.0e-0828.65Show/hide
Query:  CCAWICLGVFGIILTLLIL----GVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQNDALLNAKVAMSIEIKNKNDKMELSY--SDIAVNLVS----EDV
        CC   C  +F IIL LLI+     V+++ + +   P  TV  L +S     ++       L   +++S+  +N N  +   Y  +DI +   S    +DV
Subjt:  CCAWICLGVFGIILTLLIL----GVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQNDALLNAKVAMSIEIKNKNDKMELSY--SDIAVNLVS----EDV

Query:  KLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRK-RVQMDVQITMEATVGFHLGIFNLNKVPIHVACD
         +GK  IA FSH   NTT L  T+    D  D  +A +L+ D K +  + ++I + + V   +G     K  I V C+
Subjt:  KLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRK-RVQMDVQITMEATVGFHLGIFNLNKVPIHVACD

AT2G30505.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.6e-1527.42Show/hide
Query:  CCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQNDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGF
        CCA  C+ V  +++ +L++G+   S ++S LP++ V  L  S  +I  S+   D L+NA +   +++ N NDK  L YS +  ++ SE++ LGK  ++GF
Subjt:  CCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQNDALLNAKVAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGF

Query:  SHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRM
          +PGN T L +   +        +A  L +  K ++  V + +   +      F ++ +PI +AC+  +   +    +P C++R+
Subjt:  SHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRM

AT4G01110.1 unknown protein3.4e-1026.4Show/hide
Query:  RLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQNDAL--LNAKVAMSIEIKNKNDKMELSYS--DIAVNLVSED--V
        R+F CC  +C+ V  +IL L++   +F  +    LP + +    +SNF         D L  L A+    ++ +N N K+   Y   D+AV++  +D   
Subjt:  RLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQNDAL--LNAKVAMSIEIKNKNDKMELSYS--DIAVNLVSED--V

Query:  KLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMV
         LG   + GF   PGN T++ V + V     D     +L  D K  ++ V++  +  VG  +G   +  V + ++C   +   L  +K   C I+M+
Subjt:  KLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHLGIFNLNKVPIHVACDFQQFLLLYRTKEPPCNIRMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAATAATAATGATGATGTTGCTATTCAACTCAATCGAGTTCCAAGTCAAGACAAAGGAGCGCGTCGGGTTGCCTTCTCCAATTCCCTTCCTAAACACCATACGAC
ATCCGCCCATCACATCAAATTTTGTCCCCGATTGTTTGCTTGTTGCGCATGGATATGTCTTGGGGTGTTCGGAATTATTCTCACCCTGCTCATCCTTGGCGTAATATTTA
TGTCCTTCCTTCAATCAGGATTGCCAGAAATCACCGTAAGAATGTTGGACTTGTCCAATTTTGAGATTAAAAACTCCACAAATCAGAATGATGCTCTCCTGAATGCAAAA
GTAGCGATGTCAATCGAGATAAAGAACAAGAATGACAAAATGGAGTTGAGTTATAGCGATATTGCGGTGAATCTGGTGTCAGAGGACGTGAAATTGGGCAAGAACGTGAT
TGCTGGTTTCTCTCATAATCCTGGAAATACCACATTGTTAAATGTAACGATGAATGTGGTTGGAGATTCCACAGATAGAGAGAATGCATTGCAACTAGAAGATGACAGAA
AAAGGGTGCAAATGGATGTGCAGATCACAATGGAAGCTACAGTTGGTTTTCATCTTGGGATATTCAACCTGAACAAGGTGCCAATCCATGTAGCCTGTGATTTTCAACAA
TTTCTTCTTCTATATCGCACAAAGGAGCCCCCATGTAATATTAGAATGGTTCCCCCCAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATAATAATAATGATGATGTTGCTATTCAACTCAATCGAGTTCCAAGTCAAGACAAAGGAGCGCGTCGGGTTGCCTTCTCCAATTCCCTTCCTAAACACCATACGAC
ATCCGCCCATCACATCAAATTTTGTCCCCGATTGTTTGCTTGTTGCGCATGGATATGTCTTGGGGTGTTCGGAATTATTCTCACCCTGCTCATCCTTGGCGTAATATTTA
TGTCCTTCCTTCAATCAGGATTGCCAGAAATCACCGTAAGAATGTTGGACTTGTCCAATTTTGAGATTAAAAACTCCACAAATCAGAATGATGCTCTCCTGAATGCAAAA
GTAGCGATGTCAATCGAGATAAAGAACAAGAATGACAAAATGGAGTTGAGTTATAGCGATATTGCGGTGAATCTGGTGTCAGAGGACGTGAAATTGGGCAAGAACGTGAT
TGCTGGTTTCTCTCATAATCCTGGAAATACCACATTGTTAAATGTAACGATGAATGTGGTTGGAGATTCCACAGATAGAGAGAATGCATTGCAACTAGAAGATGACAGAA
AAAGGGTGCAAATGGATGTGCAGATCACAATGGAAGCTACAGTTGGTTTTCATCTTGGGATATTCAACCTGAACAAGGTGCCAATCCATGTAGCCTGTGATTTTCAACAA
TTTCTTCTTCTATATCGCACAAAGGAGCCCCCATGTAATATTAGAATGGTTCCCCCCAGGTAA
Protein sequenceShow/hide protein sequence
MNNNNDDVAIQLNRVPSQDKGARRVAFSNSLPKHHTTSAHHIKFCPRLFACCAWICLGVFGIILTLLILGVIFMSFLQSGLPEITVRMLDLSNFEIKNSTNQNDALLNAK
VAMSIEIKNKNDKMELSYSDIAVNLVSEDVKLGKNVIAGFSHNPGNTTLLNVTMNVVGDSTDRENALQLEDDRKRVQMDVQITMEATVGFHLGIFNLNKVPIHVACDFQQ
FLLLYRTKEPPCNIRMVPPR