; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020073 (gene) of Snake gourd v1 genome

Gene IDTan0020073
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG02:8611242..8612267
RNA-Seq ExpressionTan0020073
SyntenyTan0020073
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441954.1 PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo]2.9e-5342.48Show/hide
Query:  MAGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSSL-ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFK
        MA  S+  KHTWTK E+ + VE LV LV + GWRSDNGTF+PGYL  LQ+M+AEKLP +++ E +TIDC V++L+KT   ++ EM G  CSGFGWNEEF+
Subjt:  MAGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSSL-ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFK

Query:  CVEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDL----PDT
        C+ AE+++FD+W+KSH  AKG+ +K F +YD+L+ VFGKDRATG  +ET   + SNV+    + I LG  D       TM + G+  +  D++       
Subjt:  CVEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDL----PDT

Query:  PTSRRNTSDMSSRCTGSKRKRS--------SFQTELIDV-----KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA
         + RRN S +S R  GS+R  +         F  E +       KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P + +  
Subjt:  PTSRRNTSDMSSRCTGSKRKRS--------SFQTELIDV-----KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA

Query:  YCMRLL
        YC  LL
Subjt:  YCMRLL

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]2.8e-7251.3Show/hide
Query:  MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCV
        M G SK SKH W+KVEDARLVE+L+YLV  GWRSDNGTFRPGYLQHL+++L EK+P  +L  NTI+CKVR+L+K   +++ EML    SGF WNEEFKCV
Subjt:  MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCV

Query:  EAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRRN
        + E+E+FD WV+SH NAKGM  KPF HYD+L+ VFGKDRA                            D    + R  E+P   D  +++  +  T R +
Subjt:  EAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRRN

Query:  TSDMSSRCTGSKRKRSSFQTELIDV-------------------KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA
            SSR  GSKRKRSSFQ E+ID+                    EKYELE    KEVV+ +Y I+ L E+D+V+LIDL+VTDIQKTDCFL VP  +RK 
Subjt:  TSDMSSRCTGSKRKRSSFQTELIDV-------------------KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA

Query:  YCMRLLGR
        YC+RLLGR
Subjt:  YCMRLLGR

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]3.7e-6952.11Show/hide
Query:  KHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKE
        K SKH W+KVEDA+ VE+L+YLV  GWRSDNGTFR  YLQHL+++  EK+   +L  NTI+CKVR+L+K   +++ EML    SGF WNEEFKCV+ E+E
Subjt:  KHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKE

Query:  VFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRRNTSDMS
        +FD WV+SH NAKGM NKPF HYD+L+ VFGK +A G  +E    MT+N   + E+EIRLGSQD       T E+  +G                     
Subjt:  VFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRRNTSDMS

Query:  SRCTGSKRKRSSFQTELIDVKEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKAYCMRLLGR
                + +S+Q      KEKYELE  RRKEVV+ +Y I+GL E D+V+LIDLLVTDIQKT+CFL VP  +RK YC+RLLGR
Subjt:  SRCTGSKRKRSSFQTELIDVKEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKAYCMRLLGR

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]6.8e-7150.65Show/hide
Query:  MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCV
        MAG+ K SKH W+KVED +LVE+L+YLV  GWRSDNGTFR GYLQ+L+++L EK+P  +L  NTI+CKVR+L+K   +++ EML    SGFGWNEEFKCV
Subjt:  MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCV

Query:  EAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRRN
        + EKE+FD WV+SH NAKGM NK F HYD+L+ VFGKDRA     E     +    ++I+EE                              +  T R +
Subjt:  EAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRRN

Query:  TSDMSSRCTGSKRKRSSFQTELIDV-------------------KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA
            SSR  GSKRKR SFQ E+ID+                   KEKYELE  RRKEVV+ +Y I+GL E D+V+ IDLLVTDIQKTDCFL VP  +RK 
Subjt:  TSDMSSRCTGSKRKRSSFQTELIDV-------------------KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA

Query:  YCMRLLGR
        YC+ LL R
Subjt:  YCMRLLGR

XP_038902479.1 uncharacterized protein At2g29880-like [Benincasa hispida]2.3e-5554.13Show/hide
Query:  MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCV
        M    K SKH W+KVEDA+LVE+L+YLV  GWRSDNGTFRPGYLQHL+++L EK+P  +L  NTI+CKVR+L+K   + + EML    SGF WNEEFKCV
Subjt:  MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCV

Query:  EAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRRN
        + E+E+FD WV SH NAK M NKPF HYD+ + VFGKDR  G  +E    M +N   + E+EIRLGSQD    + R  E+P   D  +++  +  T R +
Subjt:  EAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRRN

Query:  TSDMSSRCTGSKRKRSSF
            SSR  GSKRKR SF
Subjt:  TSDMSSRCTGSKRKRSSF

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859531.4e-5342.48Show/hide
Query:  MAGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSSL-ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFK
        MA  S+  KHTWTK E+ + VE LV LV + GWRSDNGTF+PGYL  LQ+M+AEKLP +++ E +TIDC V++L+KT   ++ EM G  CSGFGWNEEF+
Subjt:  MAGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSSL-ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFK

Query:  CVEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDL----PDT
        C+ AE+++FD+W+KSH  AKG+ +K F +YD+L+ VFGKDRATG  +ET   + SNV+    + I LG  D       TM + G+  +  D++       
Subjt:  CVEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDL----PDT

Query:  PTSRRNTSDMSSRCTGSKRKRS--------SFQTELIDV-----KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA
         + RRN S +S R  GS+R  +         F  E +       KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P + +  
Subjt:  PTSRRNTSDMSSRCTGSKRKRS--------SFQTELIDV-----KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA

Query:  YCMRLL
        YC  LL
Subjt:  YCMRLL

A0A5A7U0H7 Retrotransposon protein1.4e-5342.48Show/hide
Query:  MAGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSSL-ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFK
        MA  S+  KHTWTK E+ + VE LV LV + GWRSDNGTF+PGYL  LQ+M+AEKLP +++ E +TIDC V++L+KT   ++ EM G  CSGFGWNEEF+
Subjt:  MAGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSSL-ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFK

Query:  CVEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDL----PDT
        C+ AE+++FD+W+KSH  AKG+ +K F +YD+L+ VFGKDRATG  +ET   + SNV+    + I LG  D       TM + G+  +  D++       
Subjt:  CVEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDL----PDT

Query:  PTSRRNTSDMSSRCTGSKRKRS--------SFQTELIDV-----KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA
         + RRN S +S R  GS+R  +         F  E +       KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P + +  
Subjt:  PTSRRNTSDMSSRCTGSKRKRS--------SFQTELIDV-----KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA

Query:  YCMRLL
        YC  LL
Subjt:  YCMRLL

A0A5A7UME4 Retrotransposon protein1.6e-4943Show/hide
Query:  MAGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKC
        M  +S+  KHTWTK E+A LVE LV LV+  GWRSDNGTFRPGYL  L +M+A K+P S++  +TID +++ L K +  +L EM G  CSGFGWN+E KC
Subjt:  MAGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKC

Query:  VEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRR
        + AEKEVFD W  SH  AKG+ NK F HYDEL+ VFGKDRATG  AE+  ++ SN     + E      D   T    M +PGL ++  DDL +T T+R 
Subjt:  VEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRR

Query:  NTSDMSSRCTGSKRKRSSFQTELIDVKEK------------------YELEATR-RKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRK
          S+  +  +GSKRKR    T+  D+                        +AT+ R+E+V  L  I  LT  DR  L+ +L+ ++     FL+VP   + 
Subjt:  NTSDMSSRCTGSKRKRSSFQTELIDVKEK------------------YELEATR-RKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRK

Query:  AYCMRLL
         YC  +L
Subjt:  AYCMRLL

A0A5D3CBF7 Retrotransposon protein4.6e-4942.86Show/hide
Query:  MAGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKC
        M  +S+  KHTWTK E+A LVE LV LV+  GWRSDNGTFRPGYL  L +M+A K+P S++  +TID +++ L K +  +L EM G  CSGFGWN+E KC
Subjt:  MAGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKC

Query:  VEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFI-GTKQRTMENPGLGDVGEDDLPDTPTSR
        + AEKEVFD W  SH  AKG+ NK F HYDEL+ VFGKDRATG  AE+  ++ SN     +     G+ D +  T    M +PGL ++  DDL +T T+R
Subjt:  VEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFI-GTKQRTMENPGLGDVGEDDLPDTPTSR

Query:  RNTSDMSSRCTGSKRKRSSFQTELIDVKEK------------------YELEATR-RKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSR
           S+  +  +GSKRKR    T+  D+                        +AT+ R+E+V  L  I  LT  DR  L+ +L+ ++     FL+VP   +
Subjt:  RNTSDMSSRCTGSKRKRSSFQTELIDVKEK------------------YELEATR-RKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSR

Query:  KAYCMRLL
          YC  +L
Subjt:  KAYCMRLL

E5GCB5 Retrotransposon protein1.6e-4943.04Show/hide
Query:  IMAGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFK
        IM  +S+  KHTWTK E+A LVE LV LV+  GWRSDNGTFRPGYL  L +M+A K+P S++  +TID +++ L K +  +L EM G  CSGFGWN+E K
Subjt:  IMAGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFK

Query:  CVEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFI-GTKQRTMENPGLGDVGEDDLPDTPTS
        C+ AEKEVFD W  SH  AKG+ NK F HYDEL+ VFGKDRATG  AE+  ++ SN     +     G+ D +  T    M +PGL ++  DDL +T T+
Subjt:  CVEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFI-GTKQRTMENPGLGDVGEDDLPDTPTS

Query:  RRNTSDMSSRCTGSKRKRSSFQTELIDVKEK------------------YELEATR-RKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQS
        R   S+  +  +GSKRKR    T+  D+                        +AT+ R+E+V  L  I  LT  DR  L+ +L+ ++     FL+VP   
Subjt:  RRNTSDMSSRCTGSKRKRSSFQTELIDVKEK------------------YELEATR-RKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQS

Query:  RKAYCMRLL
        +  YC  +L
Subjt:  RKAYCMRLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein3.5e-0929.61Show/hide
Query:  GTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKML--AEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCV
        G  K   + WT  E   L+E    L+   WR  +G    G L    K+L    K    +        +++ L+   QS L   L    SGFGW+ E K  
Subjt:  GTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKML--AEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCV

Query:  EAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMT
         A  EV+  ++K+H N K M+ +   H+++L ++FG   ATG  A  + + T
Subjt:  EAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCATGTAGCATTATGGCAGGTACTTCAAAACACTCAAAGCATACGTGGACGAAAGTAGAGGATGCGAGGTTGGTAGAGTCACTGGTGTATTTAGTACATAATGG
GTGGCGATCAGACAATGGGACATTCAGGCCTGGGTATCTCCAACATCTTCAGAAGATGCTAGCAGAGAAATTGCCAAATTCATCATTAGAACTAAATACCATCGACTGCA
AAGTGAGAACTCTGGAAAAAACAATACAATCTTCATTGCGAGAGATGCTTGGGAATGGTTGTAGTGGTTTTGGTTGGAATGAAGAATTTAAATGTGTTGAAGCAGAGAAA
GAGGTATTCGATGCGTGGGTTAAGAGCCATACAAATGCCAAAGGGATGAGGAACAAGCCATTTTCGCACTATGATGAGCTGGCAGTTGTCTTCGGAAAAGATAGAGCTAC
AGGAATAGGCGCAGAGACCCTAATGGAAATGACCTCTAATGTTGCGGAACAAATAGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATAGGGACAAAACAACGAACGA
TGGAGAATCCAGGACTTGGTGACGTAGGGGAAGATGACTTGCCAGACACTCCTACTAGTAGGCGTAATACATCTGACATGTCTTCTAGATGTACTGGGAGCAAAAGAAAA
CGATCGTCCTTCCAGACTGAATTAATTGATGTTAAGGAGAAGTATGAATTGGAGGCCACACGAAGGAAGGAAGTAGTCGATCTCTTGTATCAGATAGAAGGATTGACTGA
GCATGATCGTGTCTCCCTGATTGACTTGCTTGTGACTGATATCCAGAAGACTGATTGTTTTCTACAGGTTCCGCCTCAATCGAGAAAGGCGTATTGCATGCGTCTTCTAG
GAAGGACTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGCATGTAGCATTATGGCAGGTACTTCAAAACACTCAAAGCATACGTGGACGAAAGTAGAGGATGCGAGGTTGGTAGAGTCACTGGTGTATTTAGTACATAATGG
GTGGCGATCAGACAATGGGACATTCAGGCCTGGGTATCTCCAACATCTTCAGAAGATGCTAGCAGAGAAATTGCCAAATTCATCATTAGAACTAAATACCATCGACTGCA
AAGTGAGAACTCTGGAAAAAACAATACAATCTTCATTGCGAGAGATGCTTGGGAATGGTTGTAGTGGTTTTGGTTGGAATGAAGAATTTAAATGTGTTGAAGCAGAGAAA
GAGGTATTCGATGCGTGGGTTAAGAGCCATACAAATGCCAAAGGGATGAGGAACAAGCCATTTTCGCACTATGATGAGCTGGCAGTTGTCTTCGGAAAAGATAGAGCTAC
AGGAATAGGCGCAGAGACCCTAATGGAAATGACCTCTAATGTTGCGGAACAAATAGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATAGGGACAAAACAACGAACGA
TGGAGAATCCAGGACTTGGTGACGTAGGGGAAGATGACTTGCCAGACACTCCTACTAGTAGGCGTAATACATCTGACATGTCTTCTAGATGTACTGGGAGCAAAAGAAAA
CGATCGTCCTTCCAGACTGAATTAATTGATGTTAAGGAGAAGTATGAATTGGAGGCCACACGAAGGAAGGAAGTAGTCGATCTCTTGTATCAGATAGAAGGATTGACTGA
GCATGATCGTGTCTCCCTGATTGACTTGCTTGTGACTGATATCCAGAAGACTGATTGTTTTCTACAGGTTCCGCCTCAATCGAGAAAGGCGTATTGCATGCGTCTTCTAG
GAAGGACTGAATGA
Protein sequenceShow/hide protein sequence
MHACSIMAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEK
EVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRRNTSDMSSRCTGSKRK
RSSFQTELIDVKEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKAYCMRLLGRTE