; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003086 (gene) of Snake gourd v1 genome

Gene IDTan0003086
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein HAPLESS 2 isoform X1
Genome locationLG01:105456124..105459247
RNA-Seq ExpressionTan0003086
SyntenyTan0003086
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608041.1 Protein HAPLESS 2, partial [Cucurbita argyrosperma subsp. sororia]2.7e-8976.73Show/hide
Query:  MLNLSDVMGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWT
        MLNLSD MGNVASSLASG F  + K+FGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVIL YFVLLLLYL HKIGIFGCIGRG CRMIWT
Subjt:  MLNLSDVMGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWT

Query:  CLASYFHAWEYCCTFMCIKLASVKRTRRRRH-RRDLEEEFAS-EEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSG
        CLASY HAWEYCCTFMCIKLASVKRTRRRRH RRDLEEE  S EE KHR  +SSDSSN  + +ESR SKRVS +  R+HRGSQ  K LRP+SHGIRVRSG
Subjt:  CLASYFHAWEYCCTFMCIKLASVKRTRRRRH-RRDLEEEFAS-EEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSG

Query:  RVLVYGKHRRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQ
        RVLVY K                    HGSSK V KERKYRRGRQ
Subjt:  RVLVYGKHRRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQ

XP_004142706.1 protein HAPLESS 2 isoform X1 [Cucumis sativus]8.8e-10182.77Show/hide
Query:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH
        MGNVASSLAS  FS IGKIFGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLKMGMV IL YFVLLLLYLLHKIGIF CIGRG CRMIWTCLASYF+
Subjt:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH

Query:  AWEYCCTFMCIKLASVKRTRRRR-HRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKH
        AWEYCC FMCIKLASVKRTRRR   RRD+EEEF  EEGK R E++SDS+NV +HVES+SS+RVS RW RNHR SQ RK+LRPK HG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLASVKRTRRRR-HRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKH

Query:  RRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR
        RRK+ EVG+HLNEI S G++GSSK+VHKERKYRRGR R
Subjt:  RRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR

XP_008444189.1 PREDICTED: protein HAPLESS 2 isoform X1 [Cucumis melo]3.0e-10183.19Show/hide
Query:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH
        MGNVASSLAS  FS IGKIFGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANL+KMGMV IL YFVLLLLYLLHKIGIFGCIGRG CRMIWTCLASYF+
Subjt:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH

Query:  AWEYCCTFMCIKLASVKRTRRRR-HRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKH
        AWEYCC+FMCIKLASVKRTRRR   RRDLEEEF SEEGK + E++SDSSNV +HVESRSS+  S RW RNH+ SQ RK+LRPK HG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLASVKRTRRRR-HRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKH

Query:  RRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR
        RRK+ EVG+H NEI S G++GSSKFVHKERKYRRGRQR
Subjt:  RRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR

XP_022154011.1 uncharacterized protein LOC111021352 [Momordica charantia]2.6e-10079.18Show/hide
Query:  MLNLSDVMGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWT
        M NL D+MGNVASS+ASGFFS +GK+F SPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLK+GMV+IL  FV+LLLYLLHKIGIFGCI RG CRM WT
Subjt:  MLNLSDVMGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWT

Query:  CLASYFHAWEYCCTFMCIKLASVKRTRRRRH-RRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGR
        C+ASYF+AW+YCCTFMCIKL SVKRTRRRRH RRDLEEEF SE GKHR  +SSDSS+VP+ +E RSS+R S RW  NHRGSQMRKALRPKS GIRVRSGR
Subjt:  CLASYFHAWEYCCTFMCIKLASVKRTRRRRH-RRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGR

Query:  VLVYGKHRRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR
         LVYGKHRRK++EV + L EIHS G HGSSKFVH+E +Y+RGRQ+
Subjt:  VLVYGKHRRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR

XP_038897053.1 uncharacterized protein LOC120085227 isoform X1 [Benincasa hispida]6.1e-10282.35Show/hide
Query:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH
        MGNVASSL SG FS IGK+FGSPLDFLSG+SCSSVCGSTWDFICYIENFCV+NLLKMGMV+IL YFVLL L LLHKIGIFGCIGRG C+MIWTCLASYF+
Subjt:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH

Query:  AWEYCCTFMCIKLASVKRTRRRR-HRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKH
        AWEYCCTFMCIKLASVKRTRRR   RRDLEEEF SEEGK +  ++SDSSNVP+HVESRSS+R   RW RNHR S+MRK+LRP+ HGIRVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLASVKRTRRRR-HRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKH

Query:  RRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR
        RRK+ EVG+HLNEIHS G++GSSKFVHKERKYRR  QR
Subjt:  RRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR

TrEMBL top hitse value%identityAlignment
A0A0A0L1V0 Uncharacterized protein4.3e-10182.77Show/hide
Query:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH
        MGNVASSLAS  FS IGKIFGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLKMGMV IL YFVLLLLYLLHKIGIF CIGRG CRMIWTCLASYF+
Subjt:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH

Query:  AWEYCCTFMCIKLASVKRTRRRR-HRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKH
        AWEYCC FMCIKLASVKRTRRR   RRD+EEEF  EEGK R E++SDS+NV +HVES+SS+RVS RW RNHR SQ RK+LRPK HG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLASVKRTRRRR-HRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKH

Query:  RRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR
        RRK+ EVG+HLNEI S G++GSSK+VHKERKYRRGR R
Subjt:  RRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR

A0A1S4DVV8 protein HAPLESS 2 isoform X11.5e-10183.19Show/hide
Query:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH
        MGNVASSLAS  FS IGKIFGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANL+KMGMV IL YFVLLLLYLLHKIGIFGCIGRG CRMIWTCLASYF+
Subjt:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH

Query:  AWEYCCTFMCIKLASVKRTRRRR-HRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKH
        AWEYCC+FMCIKLASVKRTRRR   RRDLEEEF SEEGK + E++SDSSNV +HVESRSS+  S RW RNH+ SQ RK+LRPK HG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLASVKRTRRRR-HRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKH

Query:  RRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR
        RRK+ EVG+H NEI S G++GSSKFVHKERKYRRGRQR
Subjt:  RRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR

A0A6J1DMF3 uncharacterized protein LOC1110213521.2e-10079.18Show/hide
Query:  MLNLSDVMGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWT
        M NL D+MGNVASS+ASGFFS +GK+F SPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLK+GMV+IL  FV+LLLYLLHKIGIFGCI RG CRM WT
Subjt:  MLNLSDVMGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWT

Query:  CLASYFHAWEYCCTFMCIKLASVKRTRRRRH-RRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGR
        C+ASYF+AW+YCCTFMCIKL SVKRTRRRRH RRDLEEEF SE GKHR  +SSDSS+VP+ +E RSS+R S RW  NHRGSQMRKALRPKS GIRVRSGR
Subjt:  CLASYFHAWEYCCTFMCIKLASVKRTRRRRH-RRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGR

Query:  VLVYGKHRRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR
         LVYGKHRRK++EV + L EIHS G HGSSKFVH+E +Y+RGRQ+
Subjt:  VLVYGKHRRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR

A0A6J1FIX0 uncharacterized protein LOC1114458932.4e-8875.82Show/hide
Query:  MLNLSDVMGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWT
        MLNLSD MGNVASSLASG F  + K+FGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVIL YFVLLLLYL HKIGIFGCIGRG CRMIWT
Subjt:  MLNLSDVMGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWT

Query:  CLASYFHAWEYCCTFMCIKLASVKRTRRRRHRRDLEEEFASEE-GKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGR
        CLASY HAWEYCCTFMCIKLASVKRTRRR  RRDLEEE  SEE  K+R  +SSDSSN  K +ESR SKRVS +  R+HRGSQ  K LRP SHGIRVRSGR
Subjt:  CLASYFHAWEYCCTFMCIKLASVKRTRRRRHRRDLEEEFASEE-GKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGR

Query:  VLVYGKHRRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQ
        VLVY K                    HGSSK V KER YRRGRQ
Subjt:  VLVYGKHRRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQ

A0A6J1IVE5 uncharacterized protein LOC1114803121.9e-8875.1Show/hide
Query:  MLNLSDVMGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWT
        MLNLSD MGNVASSLASG F  + K+FGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVIL YFVLLLLYL HKIGIFGCIGRGFCRMIWT
Subjt:  MLNLSDVMGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWT

Query:  CLASYFHAWEYCCTFMCIKLASVKRTRRRRHRRDLEEEFAS-EEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGR
        CLASY HAWEYCCTFMC+KLASVKRTRR   RRDLEEE  S EE KHR  +S DSSN  + +ESR S+RVS +  R+HRGSQ  K LRP SHGIRVRSGR
Subjt:  CLASYFHAWEYCCTFMCIKLASVKRTRRRRHRRDLEEEFAS-EEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGR

Query:  VLVYGKHRRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR
        VLVY K                    HGSSKFV K+RKYRRGRQR
Subjt:  VLVYGKHRRKATEVGSHLNEIHSSGIHGSSKFVHKERKYRRGRQR

SwissProt top hitse value%identityAlignment
F4JP36 Protein HAPLESS 26.1e-0436.49Show/hide
Query:  GFFSVI----GKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIF
        GFF  I     KI    +DF++G +C + C S +DF C+I+  C++ ++  G+++ L     LLL+LLH+ G+F
Subjt:  GFFSVI----GKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIF

Arabidopsis top hitse value%identityAlignment
AT1G21722.1 unknown protein1.0e-3842.13Show/hide
Query:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH
        MGNV  S  +GF   IG  FGSPLDFLSGKSCSSVC S WDFICY+ENFCVANL K  +++IL YF L  +Y+L+K+G + CI  GF +++W  ++ +F+
Subjt:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH

Query:  AWEYCCTFMCIKLASVKRTRRRRHRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKHR
           YCC+F C  L   KR RRRRH R +EE++  ++     +   D  +   H     SKR  CR     +  ++RK+LRP++H +RV         K  
Subjt:  AWEYCCTFMCIKLASVKRTRRRRHRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKHR

Query:  RKATEVGSHL---NEIHSSGIHGSSKFVHKERKYR
        R  + +  H    + IH   +   SKF  K  K R
Subjt:  RKATEVGSHL---NEIHSSGIHGSSKFVHKERKYR

AT1G78922.1 unknown protein2.1e-1528.99Show/hide
Query:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH
        MGN     ++     IG IF +PL    G+SC  VC   WD  C+IE+FC+ ++ K+ ++  L + +L+ + LL K+GI  C+ +  C+M     A+Y+ 
Subjt:  MGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFH

Query:  AWEYCCTFMCIKLASVKRTRRRRHRRDLEEEFASEEGKHRCETSSDSSNVPKHVES----RSSKRVSCRWTRNHRGSQM--RKALRPKSHGIRVRSGRVL
              + +C  L ++ R  RRR R D   +  +    +  +  S SS+ P  +++    +  +R+  + + +H GS    R+ +R  S  + VR G   
Subjt:  AWEYCCTFMCIKLASVKRTRRRRHRRDLEEEFASEEGKHRCETSSDSSNVPKHVES----RSSKRVSCRWTRNHRGSQM--RKALRPKSHGIRVRSGRVL

Query:  VYGKHRR
          GK RR
Subjt:  VYGKHRR

AT4G11720.1 hapless 24.3e-0536.49Show/hide
Query:  GFFSVI----GKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIF
        GFF  I     KI    +DF++G +C + C S +DF C+I+  C++ ++  G+++ L     LLL+LLH+ G+F
Subjt:  GFFSVI----GKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAATCTATCCGACGTAATGGGTAATGTAGCCAGTTCATTGGCCTCCGGTTTCTTTTCGGTCATTGGCAAAATCTTCGGATCCCCACTTGATTTTCTCTCTGGAAA
ATCCTGCAGTTCAGTTTGTGGATCAACATGGGATTTCATATGCTACATAGAAAATTTCTGCGTTGCCAATTTGCTAAAGATGGGCATGGTCGTGATCCTTGTATATTTTG
TTCTTTTACTCCTGTATTTACTGCATAAAATTGGGATCTTTGGATGCATCGGTCGGGGGTTCTGCAGAATGATATGGACTTGTTTAGCTTCCTATTTCCATGCATGGGAG
TACTGCTGCACTTTCATGTGTATCAAGCTTGCCAGTGTCAAACGAACAAGAAGACGGCGCCATAGAAGAGACCTGGAAGAAGAGTTTGCAAGTGAAGAAGGAAAACATCG
GTGCGAGACATCGAGTGATTCGAGCAATGTCCCCAAACACGTCGAGTCGAGAAGTAGCAAACGAGTGTCCTGCAGATGGACGAGGAACCACAGAGGTTCTCAAATGAGAA
AGGCATTGAGGCCGAAGAGTCATGGAATTCGAGTAAGGAGTGGTAGAGTGTTGGTCTATGGTAAGCATAGAAGAAAAGCCACTGAGGTTGGGAGTCATTTGAATGAGATC
CATAGCTCTGGAATACATGGATCATCCAAGTTTGTGCATAAAGAAAGAAAGTATAGAAGAGGAAGGCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
CTTGATTTGATCATCAACGAGCGCTTTGGCCGACGGTTATCTTTCCAAAATCCGGCGACAGTGGAAATGAGACACCCTCCATTCTTCTTCCATCTGCTGTTCTTCACTTC
GTGGTGTTCTTTAAGCATTCTAATCGATGCAATTGTTTATGGAGTATTTGCATTTTCAAATTTTCATTTGATTCTACAAATTCGATTTAACCAACAAGCTTCTTTGATGG
TTATGTGACCGCAAATCTCATCTGCAAGTGAAAATACTAATGATTGACACTCCATGCCGTCGTTAATTAAACCCTAACCCCTTTATACGATCGTTTCATCTAGTTTAGAT
GGTATTGTTTTCATTTCTTTTCCCATCCAGGCCTTGATTTTTCACTTGTTCATCTTTTTTTGCAGGAGGGTTTTCTTTTTGTTGTGTGTTATTAGTTCTTAGTGAATGCT
GAATCTATCCGACGTAATGGGTAATGTAGCCAGTTCATTGGCCTCCGGTTTCTTTTCGGTCATTGGCAAAATCTTCGGATCCCCACTTGATTTTCTCTCTGGAAAATCCT
GCAGTTCAGTTTGTGGATCAACATGGGATTTCATATGCTACATAGAAAATTTCTGCGTTGCCAATTTGCTAAAGATGGGCATGGTCGTGATCCTTGTATATTTTGTTCTT
TTACTCCTGTATTTACTGCATAAAATTGGGATCTTTGGATGCATCGGTCGGGGGTTCTGCAGAATGATATGGACTTGTTTAGCTTCCTATTTCCATGCATGGGAGTACTG
CTGCACTTTCATGTGTATCAAGCTTGCCAGTGTCAAACGAACAAGAAGACGGCGCCATAGAAGAGACCTGGAAGAAGAGTTTGCAAGTGAAGAAGGAAAACATCGGTGCG
AGACATCGAGTGATTCGAGCAATGTCCCCAAACACGTCGAGTCGAGAAGTAGCAAACGAGTGTCCTGCAGATGGACGAGGAACCACAGAGGTTCTCAAATGAGAAAGGCA
TTGAGGCCGAAGAGTCATGGAATTCGAGTAAGGAGTGGTAGAGTGTTGGTCTATGGTAAGCATAGAAGAAAAGCCACTGAGGTTGGGAGTCATTTGAATGAGATCCATAG
CTCTGGAATACATGGATCATCCAAGTTTGTGCATAAAGAAAGAAAGTATAGAAGAGGAAGGCAAAGATGATAGCAGTTGGCTGGGCTGATAACTGTATATTTTTGCTAGA
AATGTATGAATCTGTTTGTTTTTTTTGAAGGTTTTTGCTTTCTTTGTTTAAGGTTAAATTACAAGTTTAGCTCATAAACATTTAAGTGGGTGTCAAATAGACTTCTGATC
TGCAAAAAGTATCTAATGTCTAATAGGTATACAAATTTTAGAAAGTGTTTAATAAGTTTCTAAGGGTATGTTTGGTAGACAATCTGGATTTTGTTTTATGTTTTCCAGTG
AATATGTGTTTGACAGATGTATTTTCAAATACTGTGATAATAGTAGAAATTATGAAAACAACATTTTTATGTCTTT
Protein sequenceShow/hide protein sequence
MLNLSDVMGNVASSLASGFFSVIGKIFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILVYFVLLLLYLLHKIGIFGCIGRGFCRMIWTCLASYFHAWE
YCCTFMCIKLASVKRTRRRRHRRDLEEEFASEEGKHRCETSSDSSNVPKHVESRSSKRVSCRWTRNHRGSQMRKALRPKSHGIRVRSGRVLVYGKHRRKATEVGSHLNEI
HSSGIHGSSKFVHKERKYRRGRQR