; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021741 (gene) of Snake gourd v1 genome

Gene IDTan0021741
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF1985 domain-containing protein
Genome locationLG05:37126891..37132435
RNA-Seq ExpressionTan0021741
SyntenyTan0021741
Gene Ontology termsGO:0048856 - anatomical structure development (biological process)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060374.1 uncharacterized protein E6C27_scaffold22G001730 [Cucumis melo var. makuwa]6.2e-5448.09Show/hide
Query:  SDEHKKAKGKRKMVEETAKSDKTESEESCRKKEGNKDSRKEKEGRVVKFCMRDFQKITGLNCGELPEINMQKLRGRFLNKYFDIEKPIKRTIVSELFHSM
        SDE      KR++ E   K    E+E      E + +     EG + KF +R F+ ITGLNC  LP ++  K++G+FL+KYF  E PI R+ VS LF+  
Subjt:  SDEHKKAKGKRKMVEETAKSDKTESEESCRKKEGNKDSRKEKEGRVVKFCMRDFQKITGLNCGELPEINMQKLRGRFLNKYFDIEKPIKRTIVSELFHSM

Query:  EGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDEDQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYI
        + +K +DK+++AK+YFL NFLLGKQ  TG + +HI LLDDE  F+ YPWGRI YN  +DSIKK IKNP A+ VGISG   +L+VW Y+C+PLL  PSI+ 
Subjt:  EGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDEDQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYI

Query:  AQMVSSGNPKINNWIAYVHPEWRDLTIKVFEHKDV
        AQ +      I NWI   HPEW++L  +VF H+ V
Subjt:  AQMVSSGNPKINNWIAYVHPEWRDLTIKVFEHKDV

XP_038883715.1 uncharacterized protein LOC120074618 isoform X1 [Benincasa hispida]4.0e-5354.21Show/hide
Query:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE
        EGR+ KF M++F  ITGLNCGELPEI+M K+ +G+F  +YF  EK IKRT + E+F  M+  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD+
Subjt:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE

Query:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE
        +QF++YPWGRI+Y  T+D +KK IK+ DA  +GI G   AL+VWAYE IPLL   S ++A  VS G P++NNW A VHPEW+DL+ KVF+
Subjt:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE

XP_038883717.1 uncharacterized protein LOC120074618 isoform X3 [Benincasa hispida]4.0e-5354.21Show/hide
Query:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE
        EGR+ KF M++F  ITGLNCGELPEI+M K+ +G+F  +YF  EK IKRT + E+F  M+  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD+
Subjt:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE

Query:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE
        +QF++YPWGRI+Y  T+D +KK IK+ DA  +GI G   AL+VWAYE IPLL   S ++A  VS G P++NNW A VHPEW+DL+ KVF+
Subjt:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE

XP_038883718.1 uncharacterized protein LOC120074618 isoform X4 [Benincasa hispida]4.0e-5354.21Show/hide
Query:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE
        EGR+ KF M++F  ITGLNCGELPEI+M K+ +G+F  +YF  EK IKRT + E+F  M+  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD+
Subjt:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE

Query:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE
        +QF++YPWGRI+Y  T+D +KK IK+ DA  +GI G   AL+VWAYE IPLL   S ++A  VS G P++NNW A VHPEW+DL+ KVF+
Subjt:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE

XP_038883719.1 uncharacterized protein LOC120074618 isoform X5 [Benincasa hispida]4.0e-5354.21Show/hide
Query:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE
        EGR+ KF M++F  ITGLNCGELPEI+M K+ +G+F  +YF  EK IKRT + E+F  M+  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD+
Subjt:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE

Query:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE
        +QF++YPWGRI+Y  T+D +KK IK+ DA  +GI G   AL+VWAYE IPLL   S ++A  VS G P++NNW A VHPEW+DL+ KVF+
Subjt:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE

TrEMBL top hitse value%identityAlignment
A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X57.4e-5353.16Show/hide
Query:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE
        EGR+ KF M+DF  ITGLNCGELP I+M K+ +G+F  +YF  EK I+RT + E+F  M+  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD+
Subjt:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE

Query:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE
        +QF+ YPWGRI+Y  T+D +KK IK+ DA  +G+ G P AL VWAYE IPLL+  S + A  +S G P++NNW A VHPEW+DL+ KVF+
Subjt:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE

A0A1S3B181 uncharacterized protein LOC103484737 isoform X77.4e-5353.16Show/hide
Query:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE
        EGR+ KF M+DF  ITGLNCGELP I+M K+ +G+F  +YF  EK I+RT + E+F  M+  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD+
Subjt:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE

Query:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE
        +QF+ YPWGRI+Y  T+D +KK IK+ DA  +G+ G P AL VWAYE IPLL+  S + A  +S G P++NNW A VHPEW+DL+ KVF+
Subjt:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE

A0A5A7UZA2 DUF1985 domain-containing protein3.0e-5448.09Show/hide
Query:  SDEHKKAKGKRKMVEETAKSDKTESEESCRKKEGNKDSRKEKEGRVVKFCMRDFQKITGLNCGELPEINMQKLRGRFLNKYFDIEKPIKRTIVSELFHSM
        SDE      KR++ E   K    E+E      E + +     EG + KF +R F+ ITGLNC  LP ++  K++G+FL+KYF  E PI R+ VS LF+  
Subjt:  SDEHKKAKGKRKMVEETAKSDKTESEESCRKKEGNKDSRKEKEGRVVKFCMRDFQKITGLNCGELPEINMQKLRGRFLNKYFDIEKPIKRTIVSELFHSM

Query:  EGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDEDQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYI
        + +K +DK+++AK+YFL NFLLGKQ  TG + +HI LLDDE  F+ YPWGRI YN  +DSIKK IKNP A+ VGISG   +L+VW Y+C+PLL  PSI+ 
Subjt:  EGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDEDQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYI

Query:  AQMVSSGNPKINNWIAYVHPEWRDLTIKVFEHKDV
        AQ +      I NWI   HPEW++L  +VF H+ V
Subjt:  AQMVSSGNPKINNWIAYVHPEWRDLTIKVFEHKDV

A0A5D3CNI7 TF-B3 domain-containing protein7.4e-5353.16Show/hide
Query:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE
        EGR+ KF M+DF  ITGLNCGELP I+M K+ +G+F  +YF  EK I+RT + E+F  M+  + KD V++AKLY L  F+LGKQ+ TG+  ++  L+DD+
Subjt:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE

Query:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE
        +QF+ YPWGRI+Y  T+D +KK IK+ DA  +G+ G P AL VWAYE IPLL+  S + A  +S G P++NNW A VHPEW+DL+ KVF+
Subjt:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFE

A0A6J1BX50 uncharacterized protein LOC1110055243.3e-5355.03Show/hide
Query:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE
        EGRV KF ++DF  ITG+NCGELP I+M K+ R  F  +YF  E+ IKRT + E+F  M+  + KD V++AKLY L  FLLGKQ+ TG+  ++  L+DD+
Subjt:  EGRVVKFCMRDFQKITGLNCGELPEINMQKL-RGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDE

Query:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVF
        +QFE YPWGR++Y  T+D +KK IK+ DA  +GI G P AL+VWAYE IPLLS  S   A  +SSG P++NNW+A VHPEWRDL+ K+F
Subjt:  DQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)9.3e-0824.39Show/hide
Query:  GRVVKFCMRDFQKITGLNCGELP-EINMQKLR-GRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKD--KVRLAKLYFLSNFLLGKQVGTGVEIDHINLLD
        G  ++F +R+F  +TGL CG+LP E  ++K +  ++L+ +  +    +   + ++   ++  K     K+ LA +  +   ++     + V +D + +L+
Subjt:  GRVVKFCMRDFQKITGLNCGELP-EINMQKLR-GRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKD--KVRLAKLYFLSNFLLGKQVGTGVEIDHINLLD

Query:  DEDQFEKYPWGRIAYNTTL----------DSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLL
        D D F +YPWGR A+  T+          + + K+ K          G P AL +  +E IP++
Subjt:  DEDQFEKYPWGRIAYNTTL----------DSIKKVIKNPDAMIVGISGLPQALVVWAYECIPLL

AT3G31910.1 Domain of unknown function (DUF1985)1.6e-0731.4Show/hide
Query:  KDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDEDQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIP
        K+++ + +L  LS  + G   G+ + +     + D   FE+YPWGR+A+ + ++S+K V  + D+ +  I     ALV+W YE +P
Subjt:  KDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDEDQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIP

AT4G08430.1 Ulp1 protease family protein1.4e-1133.12Show/hide
Query:  RVVKFCMRDFQKITGLNCGELPEINMQKLRGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKV-RLAKLYFLSNFLL------GKQVGTGVEIDHINL
        R ++F + +F+ ITGLNC    E +     G    K F  E  +  + V  LF  +E V    K   L K   +    L      G   G+ V +     
Subjt:  RVVKFCMRDFQKITGLNCGELPEINMQKLRGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKV-RLAKLYFLSNFLL------GKQVGTGVEIDHINL

Query:  LDDEDQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIP
        + D   FEKYPWGR+A+++ L S+K V  + D+ +  I G  QAL+VW YE +P
Subjt:  LDDEDQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIP

AT5G28810.1 Domain of unknown function (DUF1985)1.6e-1230.34Show/hide
Query:  VKFCMRDFQKITGLNCGELPEINMQKLRGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDEDQFEK
        ++F + +F+ ITGLNC    E + +  R   ++K + +EK                     ++ + +L  LS  + G   G+ V +     + D   FEK
Subjt:  VKFCMRDFQKITGLNCGELPEINMQKLRGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDEDQFEK

Query:  YPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIP
        YPWGR+A+++ L S+K V  + D+ +  I G  QAL+VW YE +P
Subjt:  YPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIP

AT5G45570.1 Ulp1 protease family protein1.4e-1131.17Show/hide
Query:  RVVKFCMRDFQKITGLNCGELPEINMQKLRGRFLNKYFDIEKPIKRTIVSELFHSMEGV-------KRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINL
        R ++F + +F+ ITGLNC    E +     G    K F  E  +  + V  LF  +E V         + ++ + +L  LS  + G   G+ V +     
Subjt:  RVVKFCMRDFQKITGLNCGELPEINMQKLRGRFLNKYFDIEKPIKRTIVSELFHSMEGV-------KRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINL

Query:  LDDEDQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIP
        + D   FEKYPWGR+A+++   S+K V  + D+ +  I G  Q L+VW YE +P
Subjt:  LDDEDQFEKYPWGRIAYNTTLDSIKKVIKNPDAMIVGISGLPQALVVWAYECIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGAAAGCAACCCTCATGCCACTACACGCCGGCATAGAGCAGAGCAGGTACGTATCCCACACGCCCTATGGGGAATCGCCCCTAACCCAGCACGGGATCATGGAAA
GCAACCACTCCCGATCTCTACGGGGACACAGGAGGAGAAGAAGAAAAAAAAAACGGGCGTTACATTCACTGTGCATTTCGAAGAAGAAGAAGTCGTCCAACGAAAGGAAG
AAGAGGACGAACCCTATTGCGAACGACACTGGTATTCACCATTCTTTGATCGGTTGTTTGGCAATATGGATCGAGAAATGCAAGCTCCAGAAACTCAAGTTATACCATCA
CAATTACAAGATCCAGATACAACGGGTTCACAAATAGTTTCTTATGGCTACTGTTTTGAGGACGAAGTTCAACTAGATGATAATGATGATGGAGAAATAAACATATCTAA
TGACCAGGTATATATTATATACATATATGATACTGTTGATGAAGAAATAGACATATCTGATGAACATAAAAAAGCAAAGGGAAAGAGAAAGATGGTAGAAGAGACGGCTA
AGAGCGACAAGACAGAGAGTGAAGAAAGTTGTCGTAAGAAAGAAGGAAACAAAGATTCTAGAAAAGAAAAAGAAGGTAGAGTAGTTAAATTCTGCATGAGGGACTTCCAA
AAAATAACTGGTTTAAACTGTGGTGAACTACCAGAGATTAACATGCAGAAATTGAGGGGGAGATTTTTAAACAAATACTTTGATATTGAGAAACCGATCAAACGGACAAT
AGTGAGTGAATTGTTTCATAGTATGGAAGGAGTAAAAAGGAAGGACAAAGTTAGACTAGCGAAATTGTACTTCCTGTCGAACTTCTTACTAGGCAAACAAGTAGGCACTG
GAGTAGAAATAGATCATATCAATCTATTAGATGATGAGGATCAATTTGAAAAATATCCTTGGGGACGTATCGCTTATAACACCACTTTGGACTCAATAAAGAAGGTCATA
AAAAATCCAGATGCAATGATTGTTGGGATTTCTGGATTACCACAAGCTTTAGTTGTGTGGGCATATGAATGTATACCTCTGTTGTCTGGACCGTCTATATACATTGCACA
AATGGTGTCGTCTGGAAATCCTAAAATTAACAACTGGATAGCATATGTACACCCGGAATGGAGGGATTTGACCATCAAAGTGTTTGAGCATAAGGATGTGAGTATAATTG
CTAATTTATTATTTACATTTCTTATCTATTGCAGTAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGAAAGCAACCCTCATGCCACTACACGCCGGCATAGAGCAGAGCAGGTACGTATCCCACACGCCCTATGGGGAATCGCCCCTAACCCAGCACGGGATCATGGAAA
GCAACCACTCCCGATCTCTACGGGGACACAGGAGGAGAAGAAGAAAAAAAAAACGGGCGTTACATTCACTGTGCATTTCGAAGAAGAAGAAGTCGTCCAACGAAAGGAAG
AAGAGGACGAACCCTATTGCGAACGACACTGGTATTCACCATTCTTTGATCGGTTGTTTGGCAATATGGATCGAGAAATGCAAGCTCCAGAAACTCAAGTTATACCATCA
CAATTACAAGATCCAGATACAACGGGTTCACAAATAGTTTCTTATGGCTACTGTTTTGAGGACGAAGTTCAACTAGATGATAATGATGATGGAGAAATAAACATATCTAA
TGACCAGGTATATATTATATACATATATGATACTGTTGATGAAGAAATAGACATATCTGATGAACATAAAAAAGCAAAGGGAAAGAGAAAGATGGTAGAAGAGACGGCTA
AGAGCGACAAGACAGAGAGTGAAGAAAGTTGTCGTAAGAAAGAAGGAAACAAAGATTCTAGAAAAGAAAAAGAAGGTAGAGTAGTTAAATTCTGCATGAGGGACTTCCAA
AAAATAACTGGTTTAAACTGTGGTGAACTACCAGAGATTAACATGCAGAAATTGAGGGGGAGATTTTTAAACAAATACTTTGATATTGAGAAACCGATCAAACGGACAAT
AGTGAGTGAATTGTTTCATAGTATGGAAGGAGTAAAAAGGAAGGACAAAGTTAGACTAGCGAAATTGTACTTCCTGTCGAACTTCTTACTAGGCAAACAAGTAGGCACTG
GAGTAGAAATAGATCATATCAATCTATTAGATGATGAGGATCAATTTGAAAAATATCCTTGGGGACGTATCGCTTATAACACCACTTTGGACTCAATAAAGAAGGTCATA
AAAAATCCAGATGCAATGATTGTTGGGATTTCTGGATTACCACAAGCTTTAGTTGTGTGGGCATATGAATGTATACCTCTGTTGTCTGGACCGTCTATATACATTGCACA
AATGGTGTCGTCTGGAAATCCTAAAATTAACAACTGGATAGCATATGTACACCCGGAATGGAGGGATTTGACCATCAAAGTGTTTGAGCATAAGGATGTGAGTATAATTG
CTAATTTATTATTTACATTTCTTATCTATTGCAGTAAATAG
Protein sequenceShow/hide protein sequence
MRESNPHATTRRHRAEQVRIPHALWGIAPNPARDHGKQPLPISTGTQEEKKKKKTGVTFTVHFEEEEVVQRKEEEDEPYCERHWYSPFFDRLFGNMDREMQAPETQVIPS
QLQDPDTTGSQIVSYGYCFEDEVQLDDNDDGEINISNDQVYIIYIYDTVDEEIDISDEHKKAKGKRKMVEETAKSDKTESEESCRKKEGNKDSRKEKEGRVVKFCMRDFQ
KITGLNCGELPEINMQKLRGRFLNKYFDIEKPIKRTIVSELFHSMEGVKRKDKVRLAKLYFLSNFLLGKQVGTGVEIDHINLLDDEDQFEKYPWGRIAYNTTLDSIKKVI
KNPDAMIVGISGLPQALVVWAYECIPLLSGPSIYIAQMVSSGNPKINNWIAYVHPEWRDLTIKVFEHKDVSIIANLLFTFLIYCSK