; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006469 (gene) of Snake gourd v1 genome

Gene IDTan0006469
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF3511)
Genome locationLG02:75262755..75271392
RNA-Seq ExpressionTan0006469
SyntenyTan0006469
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR021899 - Protein of unknown function DUF3511


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064730.1 putative transmembrane protein [Cucumis melo var. makuwa]4.8e-8278.32Show/hide
Query:  MVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVT--NPSAGDSENDGDQTEHPMKELP
        MVRSFPTD+NPLIHPAI+L LL AGMAAAIAMITTLCGVRSRR S PSEASSPTA  NNK +EN SPT T     T  NPS  ++EN+G+     +KELP
Subjt:  MVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVT--NPSAGDSENDGDQTEHPMKELP

Query:  LPPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRG
        LPPKMQQV +SSPP+QI KSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEK KGE+SIWTKTIILGEKCKVS+EEDG+I+EGKGKKI+AYHPR 
Subjt:  LPPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRG

Query:  PSSMSI--SRQGSAIDAEALPNPEPK
        PSSMSI  SRQ SA +AEALPN E K
Subjt:  PSSMSI--SRQGSAIDAEALPNPEPK

KAG6585751.1 hypothetical protein SDJN03_18484, partial [Cucurbita argyrosperma subsp. sororia]7.5e-5970.65Show/hide
Query:  LVAGMAAAIAMITTLCGV--RSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPLPPKMQQVANSSPPSQISKSA
        + AGM AAIAMIT LCGV  RSRRSSPPSEASS T  A+ KQD N S T T     T P AG+S NDG +TE  MKELPLPPKMQQVA+ SPPS+I+KSA
Subjt:  LVAGMAAAIAMITTLCGV--RSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPLPPKMQQVANSSPPSQISKSA

Query:  SERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGPSSM--SISRQGSAIDAEALP
        SER+L HM S+++KVPRS SV R  L+EG  RRKEK   EE IW KTIILGEKC+VS+EEDGVI+EGKGK+ISAYHPR PSSM  SISRQ SAIDAEALP
Subjt:  SERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGPSSM--SISRQGSAIDAEALP

Query:  N
        +
Subjt:  N

KAG6598610.1 hypothetical protein SDJN03_08388, partial [Cucurbita argyrosperma subsp. sororia]2.1e-7774.11Show/hide
Query:  MVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPLP
        MVRSFPTDENPLIHPAI+LSLL AGMAAAIAMITTLCG RSR++   SEASSPT AA+NK DENASPT        NPS G++ NDGD  E  +KELPLP
Subjt:  MVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPLP

Query:  PKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGPS
        PKMQQ+ ++SPP+QISKSASERKL HM+SM++ VPRSLSV R++LD+GR ++K+K KGE+SIWTKTIILGEKCKV++EEDG+I+EGKGKKISAYHPR PS
Subjt:  PKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGPS

Query:  SM--SISRQGSAIDAEALPNPEPK
        SM  S+SRQGSAIDAEALP+PE K
Subjt:  SM--SISRQGSAIDAEALPNPEPK

KGN47811.2 hypothetical protein Csa_018925 [Cucumis sativus]3.1e-8178.12Show/hide
Query:  MVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPLP
        MVRSFPTD+NPLIHPAI L LL AGMAAAI+MITTLCGVRSRR S PSE SS  AAA+NK DEN SPT T      NPS G++EN+G+     MKELPLP
Subjt:  MVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPLP

Query:  PKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGPS
        PKMQQVA++SPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKE+ KGE+SIWTKTIILGEKCKVS+EEDG+I+EGKGKKI+AYHPR PS
Subjt:  PKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGPS

Query:  SMSI--SRQGSAIDAEALPNPEPK
        SMSI  SRQ SA++AEAL + E K
Subjt:  SMSI--SRQGSAIDAEALPNPEPK

XP_023002698.1 uncharacterized protein LOC111496482 [Cucurbita maxima]1.6e-6971.04Show/hide
Query:  VMVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPL
        VMV SFPTDEN LIHPAI+L+L+ AGM  AIAMIT LCGVRSRRS  PSEASS T  A+ KQD N S T T       P+AGDS +DG +TE  MKELPL
Subjt:  VMVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPL

Query:  PPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGP
        PPKMQQVA+ SPPSQI+KSASER+L HM  M+++VPRS SV R  L+EG +RRKEK K EE IW KTIILGEKC+VS+EEDGVI+EGKGK+ISAYHPR P
Subjt:  PPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGP

Query:  SSM--SISRQGSAIDAEALPN
        SSM  SISRQ SAIDAEALP+
Subjt:  SSM--SISRQGSAIDAEALPN

TrEMBL top hitse value%identityAlignment
A0A0A0KD46 Uncharacterized protein4.7e-6777.2Show/hide
Query:  MITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPLPPKMQQVANSSPPSQISKSASERKLNHMRSMS
        MITTLCGVRSRR S PSE SS  AAA+NK DEN SPT T      NPS G++EN+G+     MKELPLPPKMQQVA++SPPSQISKSASERKLNHMRSMS
Subjt:  MITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPLPPKMQQVANSSPPSQISKSASERKLNHMRSMS

Query:  IKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGPSSMSI--SRQGSAIDAEALPNPEPK
        IKVPRSLSVVRNHLDEGRQRRKE+ KGE+SIWTKTIILGEKCKVS+EEDG+I+EGKGKKI+AYHPR PSSMSI  SRQ SA++AEAL + E K
Subjt:  IKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGPSSMSI--SRQGSAIDAEALPNPEPK

A0A5A7VC31 Putative transmembrane protein2.3e-8278.32Show/hide
Query:  MVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVT--NPSAGDSENDGDQTEHPMKELP
        MVRSFPTD+NPLIHPAI+L LL AGMAAAIAMITTLCGVRSRR S PSEASSPTA  NNK +EN SPT T     T  NPS  ++EN+G+     +KELP
Subjt:  MVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVT--NPSAGDSENDGDQTEHPMKELP

Query:  LPPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRG
        LPPKMQQV +SSPP+QI KSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEK KGE+SIWTKTIILGEKCKVS+EEDG+I+EGKGKKI+AYHPR 
Subjt:  LPPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRG

Query:  PSSMSI--SRQGSAIDAEALPNPEPK
        PSSMSI  SRQ SA +AEALPN E K
Subjt:  PSSMSI--SRQGSAIDAEALPNPEPK

A0A6A1WQ24 Uncharacterized protein3.3e-3646.05Show/hide
Query:  VMVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRS-RRSSPPSEASSPTAAANNKQDENASPT----ATQEEPVTNPSAGDSENDG--DQTEH
        VMVR  PT+E PL HP ++ SLL  G+AAAIA IT LCG R  +R SPPS +S     A      N + T     +Q  P    +A  +E++G     E 
Subjt:  VMVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRS-RRSSPPSEASSPTAAANNKQDENASPT----ATQEEPVTNPSAGDSENDG--DQTEH

Query:  PMKELPLPPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKIS
          +ELPLPP M  +  S    +I+KS+S+R L   +++++K+PRSLS+ R    E   +RK+    E+SIW KTIILGEKCKV +E+D VI++GKGK+IS
Subjt:  PMKELPLPPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKIS

Query:  AYHPRGPSSMSISRQGSAIDAEALPNPE
         YHP+ PS +S+SR+GS  + EA P PE
Subjt:  AYHPRGPSSMSISRQGSAIDAEALPNPE

A0A6J1KUB9 uncharacterized protein LOC1114964827.8e-7071.04Show/hide
Query:  VMVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPL
        VMV SFPTDEN LIHPAI+L+L+ AGM  AIAMIT LCGVRSRRS  PSEASS T  A+ KQD N S T T       P+AGDS +DG +TE  MKELPL
Subjt:  VMVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTATQEEPVTNPSAGDSENDGDQTEHPMKELPL

Query:  PPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGP
        PPKMQQVA+ SPPSQI+KSASER+L HM  M+++VPRS SV R  L+EG +RRKEK K EE IW KTIILGEKC+VS+EEDGVI+EGKGK+ISAYHPR P
Subjt:  PPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEEDGVIFEGKGKKISAYHPRGP

Query:  SSM--SISRQGSAIDAEALPN
        SSM  SISRQ SAIDAEALP+
Subjt:  SSM--SISRQGSAIDAEALPN

A0A7N2KRH9 Uncharacterized protein5.8e-3345.78Show/hide
Query:  MVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENA--SPTATQEEPVTNPSAGDS---ENDGDQTEHPMK
        M R  P+DEN LIHP  + SLL  GM A IAMIT LCGVRSR+ S P   SSP     NK++EN+  S +A+    +T+  A +    E+     +  +K
Subjt:  MVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENA--SPTATQEEPVTNPSAGDS---ENDGDQTEHPMK

Query:  ELPLPPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVR----NHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEED-GVIFEGKGKK
        ELPLPP M+ +  S   + I+KS+SER+L    S+S+K+PRSLS  +       +E  Q++K     E+++W KTIILGEKCKV  E+D  VI++GKG +
Subjt:  ELPLPPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVR----NHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEED-GVIFEGKGKK

Query:  ISAYHPRGPSSMSISRQGSAIDAEA
        ISAYHP+ PS  S+SRQ S  D ++
Subjt:  ISAYHPRGPSSMSISRQGSAIDAEA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G19460.1 Protein of unknown function (DUF3511)1.1e-1254.67Show/hide
Query:  SASSSSYPQTQISSKKGKSTSGSSSSKSWRFAVADPEFQRKKRVASYKMYSVEGKMKGSFRKSFRWLKDKYTQML
        S S + YP T+I   +      SSSS SW F   DP+ QRKKRV SY+ Y+VEGK+KGSFRKSF+W+KDK  ++L
Subjt:  SASSSSYPQTQISSKKGKSTSGSSSSKSWRFAVADPEFQRKKRVASYKMYSVEGKMKGSFRKSFRWLKDKYTQML

AT3G05725.1 Protein of unknown function (DUF3511)7.1e-0745.76Show/hide
Query:  KSTSGSSSSKSWRFAVADPEFQRKKRVASYKMYSVEGKMKGSFRKSFRWLKDKYTQMLY
        +S+S S+  K W     DPE +RK+RVA YK+YS EGKMK + RKS++W+K + +++++
Subjt:  KSTSGSSSSKSWRFAVADPEFQRKKRVASYKMYSVEGKMKGSFRKSFRWLKDKYTQMLY

AT3G13910.1 Protein of unknown function (DUF3511)6.8e-1057.38Show/hide
Query:  QISSKKGKSTSGS--SSSKSWRFAVADPEFQRKKRVASYKMYSVEGKMKGSFRKSFRWLKD
        Q+  KK KS   +  ++S+SW F  +DPE +RK+RVA YK+YSVE KMKGS RKSF+W KD
Subjt:  QISSKKGKSTSGS--SSSKSWRFAVADPEFQRKKRVASYKMYSVEGKMKGSFRKSFRWLKD

AT3G62640.1 Protein of unknown function (DUF3511)7.1e-0736.76Show/hide
Query:  QTQISSKKGKSTSGSSSSKSWRFAVADPEFQRKKRVASYKMYSVEGKMKGSFRKSFRWLKDKYTQMLY
        Q  I    G ++   +++  WR  + D E +RKKR+A+YK Y++EGK+K + +K F W+KD+Y+ +++
Subjt:  QTQISSKKGKSTSGSSSSKSWRFAVADPEFQRKKRVASYKMYSVEGKMKGSFRKSFRWLKDKYTQMLY

AT5G11970.1 Protein of unknown function (DUF3511)8.3e-1653.33Show/hide
Query:  MQIESYYGPQSAS----SSSYPQTQISSKKGKSTSGSSSSKSWRFAVADPEFQRKKRVASYKMYSVEGKMKGSFRKSFRWLKDKYTQMLY
        MQI+ Y+G Q       S+SY   + +    K     + SKSW   + DPE QRKKRVASYKMY VEGK+KGSFR SFRWLK +YTQ++Y
Subjt:  MQIESYYGPQSAS----SSSYPQTQISSKKGKSTSGSSSSKSWRFAVADPEFQRKKRVASYKMYSVEGKMKGSFRKSFRWLKDKYTQMLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTTTGTATGGGTGAACTTGCCCCAATATGGGGCATTTGGGTTCGGGTATTGTTTGTGGCACTCCCTTCTGAGAGTGGAAAAGAATACAATGTTGCAAGTTTTGT
CACAAGTTTGTTTCTGTATTTCGAGTCCAAGCAGGCAGCATGCATTGCACAATTGGCTCTCTGCTGTTACTTTTCATGTATGCAGATTGAAAGCTATTATGGGCCTCAAA
GTGCTTCTTCCTCCTCCTATCCACAAACCCAGATTTCATCCAAGAAAGGCAAAAGCACTTCTGGGTCTTCTTCATCAAAATCATGGAGATTTGCAGTTGCAGACCCTGAG
TTTCAGAGGAAGAAAAGGGTTGCCAGTTACAAGATGTATTCTGTTGAAGGCAAGATGAAAGGTTCCTTTAGAAAGAGCTTCAGATGGCTCAAGGATAAATACACTCAAAT
GCTTTATGTCATGGTGAGAAGCTTTCCGACGGATGAGAATCCGTTAATTCATCCGGCGATTACCCTATCCCTTTTAGTGGCCGGAATGGCGGCGGCCATCGCCATGATCA
CGACGCTATGCGGTGTCCGGTCCCGACGAAGTTCGCCGCCGTCCGAAGCTTCATCTCCAACCGCCGCCGCCAACAACAAACAAGATGAAAACGCTTCTCCAACGGCAACC
CAAGAAGAACCAGTGACAAACCCATCCGCCGGAGATTCAGAAAACGACGGCGACCAGACAGAACATCCGATGAAAGAACTTCCCCTGCCCCCAAAAATGCAACAAGTGGC
GAATTCGAGCCCGCCGAGCCAAATCTCAAAATCGGCTTCGGAGAGAAAATTGAACCACATGAGAAGCATGAGCATCAAAGTTCCGAGGAGCCTTTCGGTTGTGAGGAACC
ATTTGGACGAAGGGCGGCAGAGGAGGAAAGAGAAGGCCAAAGGGGAAGAATCGATTTGGACGAAGACGATAATTTTAGGGGAAAAATGCAAAGTTTCAGAAGAAGAAGAT
GGGGTTATATTCGAAGGAAAAGGGAAGAAGATATCGGCTTATCATCCGAGAGGTCCTAGTTCGATGTCGATTTCTAGGCAAGGTTCCGCCATTGATGCTGAAGCTCTGCC
AAACCCAGAACCAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTTTTGTATGGGTGAACTTGCCCCAATATGGGGCATTTGGGTTCGGGTATTGTTTGTGGCACTCCCTTCTGAGAGTGGAAAAGAATACAATGTTGCAAGTTTTGT
CACAAGTTTGTTTCTGTATTTCGAGTCCAAGCAGGCAGCATGCATTGCACAATTGGCTCTCTGCTGTTACTTTTCATGTATGCAGATTGAAAGCTATTATGGGCCTCAAA
GTGCTTCTTCCTCCTCCTATCCACAAACCCAGATTTCATCCAAGAAAGGCAAAAGCACTTCTGGGTCTTCTTCATCAAAATCATGGAGATTTGCAGTTGCAGACCCTGAG
TTTCAGAGGAAGAAAAGGGTTGCCAGTTACAAGATGTATTCTGTTGAAGGCAAGATGAAAGGTTCCTTTAGAAAGAGCTTCAGATGGCTCAAGGATAAATACACTCAAAT
GCTTTATGTCATGGTGAGAAGCTTTCCGACGGATGAGAATCCGTTAATTCATCCGGCGATTACCCTATCCCTTTTAGTGGCCGGAATGGCGGCGGCCATCGCCATGATCA
CGACGCTATGCGGTGTCCGGTCCCGACGAAGTTCGCCGCCGTCCGAAGCTTCATCTCCAACCGCCGCCGCCAACAACAAACAAGATGAAAACGCTTCTCCAACGGCAACC
CAAGAAGAACCAGTGACAAACCCATCCGCCGGAGATTCAGAAAACGACGGCGACCAGACAGAACATCCGATGAAAGAACTTCCCCTGCCCCCAAAAATGCAACAAGTGGC
GAATTCGAGCCCGCCGAGCCAAATCTCAAAATCGGCTTCGGAGAGAAAATTGAACCACATGAGAAGCATGAGCATCAAAGTTCCGAGGAGCCTTTCGGTTGTGAGGAACC
ATTTGGACGAAGGGCGGCAGAGGAGGAAAGAGAAGGCCAAAGGGGAAGAATCGATTTGGACGAAGACGATAATTTTAGGGGAAAAATGCAAAGTTTCAGAAGAAGAAGAT
GGGGTTATATTCGAAGGAAAAGGGAAGAAGATATCGGCTTATCATCCGAGAGGTCCTAGTTCGATGTCGATTTCTAGGCAAGGTTCCGCCATTGATGCTGAAGCTCTGCC
AAACCCAGAACCAAAATAG
Protein sequenceShow/hide protein sequence
MNFCMGELAPIWGIWVRVLFVALPSESGKEYNVASFVTSLFLYFESKQAACIAQLALCCYFSCMQIESYYGPQSASSSSYPQTQISSKKGKSTSGSSSSKSWRFAVADPE
FQRKKRVASYKMYSVEGKMKGSFRKSFRWLKDKYTQMLYVMVRSFPTDENPLIHPAITLSLLVAGMAAAIAMITTLCGVRSRRSSPPSEASSPTAAANNKQDENASPTAT
QEEPVTNPSAGDSENDGDQTEHPMKELPLPPKMQQVANSSPPSQISKSASERKLNHMRSMSIKVPRSLSVVRNHLDEGRQRRKEKAKGEESIWTKTIILGEKCKVSEEED
GVIFEGKGKKISAYHPRGPSSMSISRQGSAIDAEALPNPEPK