; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004718 (gene) of Snake gourd v1 genome

Gene IDTan0004718
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG05:74094880..74096345
RNA-Seq ExpressionTan0004718
SyntenyTan0004718
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572935.1 hypothetical protein SDJN03_26822, partial [Cucurbita argyrosperma subsp. sororia]2.4e-9582.25Show/hide
Query:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL
        ME  TDNR+R RDE D SL DSA SKLRRLNS ESRFVKPC          SKSEQVGSDG DLRIDLA+S+EIQD+LLNILED D V ERDESIQG EL
Subjt:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL

Query:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA
        DSFIRSFEEEI ALPPA+T+S++NETPQAELGYLFEASDDELGLPPT GSS EGK+EAIDF PACS  GVFE+DGN+GFEDEIPCYDSFEIGIGIGSGAA
Subjt:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA

Query:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL
        EEN LGGEFVALGGLFDYSDVPFRPESLSAL
Subjt:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL

KAG6584364.1 hypothetical protein SDJN03_20296, partial [Cucurbita argyrosperma subsp. sororia]6.4e-8574.68Show/hide
Query:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL
        ME  TDN++R+  EFDDSL DSAESKLRRL+S++   + PCTKGNWNVV   +S        D  I+L+ES +IQD+LLNILED D V ERDESI+GLEL
Subjt:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL

Query:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGK-MEAIDFMPACSS-AGVFELDGNLGFEDEIPCYDSFEIGIGIGSG
        DSFIRSFEEEIQALP  KT S +NETPQ ELGYL+ ASDDELGLPPTGG ST+GK MEAIDFMPA SS  GVFELDGN GFEDEIPCYD FEIG+G  SG
Subjt:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGK-MEAIDFMPACSS-AGVFELDGNLGFEDEIPCYDSFEIGIGIGSG

Query:  AAEENCLGGEFVALGGLFDYSDVPFRPESLSAL
        AAEEN LGGEFVALGGLFDYSDVPFRPESLSAL
Subjt:  AAEENCLGGEFVALGGLFDYSDVPFRPESLSAL

XP_022954576.1 uncharacterized protein LOC111456805 [Cucurbita moschata]1.6e-9682.68Show/hide
Query:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL
        ME  TDNR+R RDE D SL DSAESKLRRLNS ESRFVKPC          SKSEQVGSDG DLRIDLA+S+EIQD+LLNILED D V ERDESIQG EL
Subjt:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL

Query:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA
        DSFIRSFEEEI ALPPA+T+S++NETPQAELGYLFEASDDELGLPPT GSS EGK+EAIDF PACS  GVFE+DGN+GFEDEIPCYDSFEIGIGIGSGAA
Subjt:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA

Query:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL
        EEN LGGEFVALGGLFDYSDVPFRPESLSAL
Subjt:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL

XP_022994085.1 uncharacterized protein LOC111489916 [Cucurbita maxima]1.5e-9480.95Show/hide
Query:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL
        ME  TDNR+R RD+ D SL DSAESKLRRLN  ESRFVKPC          SKSEQVGSDGDDLRIDLA+S+EI D+LLNILED D V ERDE IQG EL
Subjt:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL

Query:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA
        DSFIRSFEEEI ALPPA+T+S++NETPQAELGYLFEASDDELGLPPT GSS EG +EAIDF PACS  GVFE+DGN+GFEDEIPCYDSFEIGIGIGSGAA
Subjt:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA

Query:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL
        EEN LGGEFVALGGLFDYSDVPFRPESLSAL
Subjt:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL

XP_023541956.1 uncharacterized protein LOC111801944 [Cucurbita pepo subsp. pepo]6.2e-9682.25Show/hide
Query:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL
        ME  TDNR+R RDE D SL DSAESKLRRL+S ESRFVKPC          SKSE VGSDGDDLRIDLA+S+EIQD+LLNILED D V ERDESIQG EL
Subjt:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL

Query:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA
        DSFIRSFEEEI ALPPA+T SD+NETPQAELGYLFEASDDELGLPPT GSS EGK+EAIDF PACS  GVFE+DGN+GFEDEIPCYDSFEIGIGIGSG A
Subjt:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA

Query:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL
        EEN LGGEFVALGGLFDYSDVPFRPESLSAL
Subjt:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL

TrEMBL top hitse value%identityAlignment
A0A0A0LQD5 Uncharacterized protein1.3e-7068.44Show/hide
Query:  MEIYTDNRRRVRDEF-DDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRI-DLAESEEIQDE-LLNILEDGDAVTERDES-IQ
        M+ ++++R+RV DE  DDSL DSAESKLRRLNS++ R  KPCTK ++NVV  S      +   DL I DL ESEEIQDE LLNILED D V ERDES I+
Subjt:  MEIYTDNRRRVRDEF-DDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRI-DLAESEEIQDE-LLNILEDGDAVTERDES-IQ

Query:  GLELDSFIRSFEEEIQALPPA-KTTSDRNETPQAELGYLFEASDDELGLPPTGG--SSTEG-KMEAIDFMPACSSAG--VFELDGNLGFEDEIPCYDSFE
        GLELDSFI+SFEEEIQ +P +    ++ NETPQAELGYLF ASDDELGLPP+GG  S+TEG KMEAIDFMP  SS    VFEL+G LGF+D+IPCYDSFE
Subjt:  GLELDSFIRSFEEEIQALPPA-KTTSDRNETPQAELGYLFEASDDELGLPPTGG--SSTEG-KMEAIDFMPACSSAG--VFELDGNLGFEDEIPCYDSFE

Query:  IGIGIGSGA--AEENCL-GGEFVALGGLFDYSDVPFRPESLSAL
        +G+GIGSGA  AE+N L GGEFVALGGLFDYSDV FRPESL AL
Subjt:  IGIGIGSGA--AEENCL-GGEFVALGGLFDYSDVPFRPESLSAL

A0A5E4F0J5 PREDICTED: AT1G133608.9e-4050.41Show/hide
Query:  NRRRVRDEFDDSLVDS----AESKLRRLNSAESRFVKP--------CTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDES
        N  R R  +D   +++     ESKL R NS+ S    P         T    N   S   + VG D D+L ++  E + IQD+LLNIL+D D VT+RD +
Subjt:  NRRRVRDEFDDSLVDS----AESKLRRLNSAESRFVKP--------CTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDES

Query:  IQGLELDSFIRSFEEEIQ--ALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFE-DEIPCYDSFEI
        IQ  +LDS I+SFEEEIQ  A P ++TTS    + Q ELGYL EASDDELGLPPT G S +GK+EA DF  + S A    LDG LGFE D IP YDSFE+
Subjt:  IQGLELDSFIRSFEEEIQ--ALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFE-DEIPCYDSFEI

Query:  GIGIGSGAAEENCLGG-EFVALGGLFDY-----SDVPFRPESLSAL
        GIG G      N  GG E+VALGGLFDY     SDV +R ESLSAL
Subjt:  GIGIGSGAAEENCLGG-EFVALGGLFDY-----SDVPFRPESLSAL

A0A6J1E811 uncharacterized protein LOC1114315773.4e-8473.82Show/hide
Query:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL
        ME  TDN++R+  EFDDSL DSAESKLRRL+S++   + PCT+GNWNVV   +S        D  I+L+ES +IQD+LLNILED D V ERDESI+GLEL
Subjt:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL

Query:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGK-MEAIDFMPACSS-AGVFELDGNLGFEDEIPCYDSFEIGIGIGSG
        DSFIRSFEEEIQALP  KT S +NETPQ ELGYL+ ASDDELGLPPTGG ST+GK  EAIDFMPA SS  GVFELDGN GFEDEIPCYD FEIG+G  SG
Subjt:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGK-MEAIDFMPACSS-AGVFELDGNLGFEDEIPCYDSFEIGIGIGSG

Query:  AAEENCLGGEFVALGGLFDYSDVPFRPESLSAL
        AAEEN LGGEFVALGGLFDYSDVPFRPESLSAL
Subjt:  AAEENCLGGEFVALGGLFDYSDVPFRPESLSAL

A0A6J1GR95 uncharacterized protein LOC1114568057.9e-9782.68Show/hide
Query:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL
        ME  TDNR+R RDE D SL DSAESKLRRLNS ESRFVKPC          SKSEQVGSDG DLRIDLA+S+EIQD+LLNILED D V ERDESIQG EL
Subjt:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL

Query:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA
        DSFIRSFEEEI ALPPA+T+S++NETPQAELGYLFEASDDELGLPPT GSS EGK+EAIDF PACS  GVFE+DGN+GFEDEIPCYDSFEIGIGIGSGAA
Subjt:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA

Query:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL
        EEN LGGEFVALGGLFDYSDVPFRPESLSAL
Subjt:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL

A0A6J1K097 uncharacterized protein LOC1114899167.4e-9580.95Show/hide
Query:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL
        ME  TDNR+R RD+ D SL DSAESKLRRLN  ESRFVKPC          SKSEQVGSDGDDLRIDLA+S+EI D+LLNILED D V ERDE IQG EL
Subjt:  MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLEL

Query:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA
        DSFIRSFEEEI ALPPA+T+S++NETPQAELGYLFEASDDELGLPPT GSS EG +EAIDF PACS  GVFE+DGN+GFEDEIPCYDSFEIGIGIGSGAA
Subjt:  DSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAA

Query:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL
        EEN LGGEFVALGGLFDYSDVPFRPESLSAL
Subjt:  EENCLGGEFVALGGLFDYSDVPFRPESLSAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13360.1 unknown protein2.6e-1535Show/hide
Query:  NSSKSEQVGSDG-DDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQAL--PPAKTTSDRNETPQAELGYLFEASDDELGLPP
        +S++ ++V  +  D+  +D  E + ++D+L ++L+D D      E +   +LDS ++SFE+E+  +    A+ +S   ET Q +LGYL EASDDELGLPP
Subjt:  NSSKSEQVGSDG-DDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQAL--PPAKTTSDRNETPQAELGYLFEASDDELGLPP

Query:  TGGSS------TEGKMEAI-DFMPACS-SAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDVPF--------RPESLSA
            S       E   E + D + A S S+G+ E+    GFED +  Y   + G G+G         GG++VA+ GLF++SD  F        R ESL A
Subjt:  TGGSS------TEGKMEAI-DFMPACS-SAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDVPF--------RPESLSA

AT1G13360.2 unknown protein4.1e-1334.25Show/hide
Query:  NSSKSEQVGSDG-DDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQAL--PPAKTTSDRNETPQAELGYLFEASDDELGLPP
        +S++ ++V  +  D+  +D  E + ++D+L ++L+D D      E +   +LDS ++SFE+E+  +    A+ +S   ET Q +LGYL EASDDELGLPP
Subjt:  NSSKSEQVGSDG-DDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQAL--PPAKTTSDRNETPQAELGYLFEASDDELGLPP

Query:  TGGSS------TEGKMEAI-DFMPACS-SAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYS
            S       E   E + D + A S S+G+ E+    GFED +  Y   + G G+G         GG++VA+ G F Y+
Subjt:  TGGSS------TEGKMEAI-DFMPACS-SAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYS

AT1G13360.3 unknown protein2.7e-1234.46Show/hide
Query:  NSSKSEQVGSDG-DDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQAL--PPAKTTSDRNETPQAELGYLFEASDDELGLPP
        +S++ ++V  +  D+  +D  E + ++D+L ++L+D D      E +   +LDS ++SFE+E+  +    A+ +S   ET Q +LGYL EASDDELGLPP
Subjt:  NSSKSEQVGSDG-DDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEEIQAL--PPAKTTSDRNETPQAELGYLFEASDDELGLPP

Query:  TGGSS------TEGKMEAI-DFMPACS-SAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGL
            S       E   E + D + A S S+G+ E+    GFED +  Y   + G G+G         GG++VA+ GL
Subjt:  TGGSS------TEGKMEAI-DFMPACS-SAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGL

AT3G25870.1 unknown protein8.3e-0629.84Show/hide
Query:  SKSEQVGSD--GDDLRIDLAESEEIQDELLNILEDG-DAVTERDESIQGLELDSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPP--
        ++++ VG+    D L +D  + + ++D+L +  + G D V++        +LDS ++SFE E+     A ++ +     Q +LGYLFEASDDELGLPP  
Subjt:  SKSEQVGSD--GDDLRIDLAESEEIQDELLNILEDG-DAVTERDESIQGLELDSFIRSFEEEIQALPPAKTTSDRNETPQAELGYLFEASDDELGLPP--

Query:  ------TGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDV-PFRPESLSA
                 S  E   E +    +  S+ V EL    GFED +  +   ++G              G F    G  D  D+  +RPE L A
Subjt:  ------TGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSDV-PFRPESLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATTTACACCGACAACAGAAGGCGAGTTCGCGACGAGTTCGACGACTCGCTTGTCGATTCGGCCGAGTCGAAGCTCAGACGACTCAACTCAGCGGAATCGAGATT
CGTGAAGCCATGCACCAAGGGGAACTGGAATGTTGTTAACTCGTCGAAGTCGGAGCAGGTAGGATCCGACGGAGATGATTTGAGAATCGATTTGGCGGAGTCGGAGGAGA
TTCAAGATGAGTTGCTTAACATCCTCGAAGATGGCGACGCCGTAACGGAGCGAGATGAGAGTATTCAAGGTCTCGAACTCGATTCGTTCATCCGAAGCTTCGAGGAGGAG
ATTCAAGCTCTACCGCCGGCGAAAACGACGTCGGATCGGAATGAGACTCCTCAGGCGGAACTTGGATATCTTTTCGAGGCGTCGGATGATGAACTCGGGCTGCCGCCGAC
GGGAGGTTCGAGTACTGAAGGGAAGATGGAGGCGATTGATTTTATGCCGGCTTGTTCTTCGGCTGGTGTGTTCGAGTTGGATGGGAACTTAGGGTTTGAGGATGAGATAC
CGTGTTATGACTCGTTCGAAATCGGAATCGGCATTGGTTCCGGCGCGGCGGAGGAGAATTGTTTGGGCGGAGAGTTTGTTGCATTGGGCGGTTTGTTTGATTATTCAGAC
GTGCCGTTTCGGCCGGAGTCGTTATCGGCTCTGTAG
mRNA sequenceShow/hide mRNA sequence
CTCACTTCTGCAACAATTCCTTTAATCTTTCTCTCTTTAATGGCGGTTTAAACCCCTATTTCTTCTTCTTCTTCTTCTTCTTCTCTCTATTTCTATTCTTTCTTCTTCCA
TGGAAATTTACACCGACAACAGAAGGCGAGTTCGCGACGAGTTCGACGACTCGCTTGTCGATTCGGCCGAGTCGAAGCTCAGACGACTCAACTCAGCGGAATCGAGATTC
GTGAAGCCATGCACCAAGGGGAACTGGAATGTTGTTAACTCGTCGAAGTCGGAGCAGGTAGGATCCGACGGAGATGATTTGAGAATCGATTTGGCGGAGTCGGAGGAGAT
TCAAGATGAGTTGCTTAACATCCTCGAAGATGGCGACGCCGTAACGGAGCGAGATGAGAGTATTCAAGGTCTCGAACTCGATTCGTTCATCCGAAGCTTCGAGGAGGAGA
TTCAAGCTCTACCGCCGGCGAAAACGACGTCGGATCGGAATGAGACTCCTCAGGCGGAACTTGGATATCTTTTCGAGGCGTCGGATGATGAACTCGGGCTGCCGCCGACG
GGAGGTTCGAGTACTGAAGGGAAGATGGAGGCGATTGATTTTATGCCGGCTTGTTCTTCGGCTGGTGTGTTCGAGTTGGATGGGAACTTAGGGTTTGAGGATGAGATACC
GTGTTATGACTCGTTCGAAATCGGAATCGGCATTGGTTCCGGCGCGGCGGAGGAGAATTGTTTGGGCGGAGAGTTTGTTGCATTGGGCGGTTTGTTTGATTATTCAGACG
TGCCGTTTCGGCCGGAGTCGTTATCGGCTCTGTAGAAACGGGTTTCCAATGGGGGAAGGTTGGTTTTTGGTTTTTAGGGTTGGTGTGGAACCGACGTCGTTTTAAGTCTG
TAGGTTGTAAATCTGAAAGGACAAAAACACAACTCTTTGAATAGCAATGCAGAATTTCCAAAAAAAAAAAAGGAAATGTAGAGGAGTTTTTTTTTAATTTTGTTTTTTTT
AATAGAACGATTCTTAATTTTTACCGTTGTAAAAACCCTGTGTATCCGTCGGTTTCTCTCTTTTGTCGTTTCTCTGATTTTGTTATTTTATTATTGACTGGAGAGGGGTA
GTTTTGGAAAATTAGGAGATAAAAAAAAGGTTGAATCAGTTTCGAGTTGAAAATAGGTGGGGATTGATGGAGTGTCGCTGTGTGTGGGCACCTTTGAGTAAGAGATGATA
TTTTTTCTTTTCTTTTTCTTTTTTGAACCACGCAGGTTTTAATGACTACTTTTTAGTTTTTAAGAGACGTTTAATTCATGAATTTGAAAAGTTTTAGGTTTGTCGTAATT
AATTTTTTTAAAAGAAAATATTGGGTAGAAAATTTATTTTAAATTTAGATTTACGTGACTCGTGGAACTTTTTAAATGGTGCGTCAAGTTTTTAATTCATATAAAATTCC
AACTTACAATTAAAAAAAAATGTAAAATTGTTGAGC
Protein sequenceShow/hide protein sequence
MEIYTDNRRRVRDEFDDSLVDSAESKLRRLNSAESRFVKPCTKGNWNVVNSSKSEQVGSDGDDLRIDLAESEEIQDELLNILEDGDAVTERDESIQGLELDSFIRSFEEE
IQALPPAKTTSDRNETPQAELGYLFEASDDELGLPPTGGSSTEGKMEAIDFMPACSSAGVFELDGNLGFEDEIPCYDSFEIGIGIGSGAAEENCLGGEFVALGGLFDYSD
VPFRPESLSAL