; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016956 (gene) of Snake gourd v1 genome

Gene IDTan0016956
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG09:13650817..13651821
RNA-Seq ExpressionTan0016956
SyntenyTan0016956
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038875070.1 uncharacterized protein LOC120067596 [Benincasa hispida]2.0e-5155.15Show/hide
Query:  MAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVE
        MAG  +  KH W+KVED +LVE+L+ LV   WRSDNGTFRP YLQ  +++L EK+P   L QNTI CKV +LKKQYNA++EMLS   S F WNEE KCV+
Subjt:  MAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVE

Query:  VEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTP
        VE+E+F+ WV+SH N KGM NK FSHYDDL+ VF KDRA G   E P  M  +A  + E+EIRLGSQD    E R  E+     + +D++ + P
Subjt:  VEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTP

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]1.2e-7251.47Show/hide
Query:  MAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVE
        M G S+ SKH W+KVEDARLVE+L+ LV  GWRSDNGTFRP YLQHL+++L EK+P   L +NTI+CKV +LKKQYNA++EMLS   SGF+WNEE KCV+
Subjt:  MAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVE

Query:  VEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTPTSMRNT
        VE+E+FD WV+SH NAKGM  KPF HYDDL+ VF KDRA      TP E+R + +   ++EI                        +++  +  T   + 
Subjt:  VEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTPTSMRNT

Query:  SCMSSRCIGSKRKRSSFQTELIDV-------------------KEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRRTY
           SSR  GSKRKRSSFQ E+ID+                    EKYELE+   KEVV+ +Y I+ L E+D V+LIDL+VTDIQK DCFL VP  +R+ Y
Subjt:  SCMSSRCIGSKRKRSSFQTELIDV-------------------KEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRRTY

Query:  CMRLLGR
        C+RLLGR
Subjt:  CMRLLGR

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]1.3e-6952.65Show/hide
Query:  RNSKHTWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVEVEKEV
        + SKH W+KVEDA+ VE+L+ LV  GWRSDNGTFR  YLQHL+++  EK+    L QNTI+CKV +LKKQ NA++EMLS   SGF WNEE KCV+VE+E+
Subjt:  RNSKHTWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVEVEKEV

Query:  FDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTPTSMRNTSCMSS
        FD WV+SH NAKGM NKPF HYDDL+ VF K +A G   E P  M  +A  + E+EIRLGSQD    E   M  L                         
Subjt:  FDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTPTSMRNTSCMSS

Query:  RCIGSKRKRSSFQTELIDVKEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRRTYCMRLLGR
                 +S+Q      KEKYELE  RRKEVV+ +Y I+GL E D V+LIDLLVTDIQK +CFL VP  +R+ YC+RLLGR
Subjt:  RCIGSKRKRSSFQTELIDVKEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRRTYCMRLLGR

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]1.7e-7150.16Show/hide
Query:  MAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVE
        MAG+ + SKH W+KVED +LVE+L+ LV  GWRSDNGTFR  YLQ+L+++L EK+P   L QNTI+CKV +LKKQYNA++EMLS   SGF WNEE KCV+
Subjt:  MAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVE

Query:  VEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTPT--SMR
        VEKE+FD WV+SH NAKGM NK F HYDDL+ VF KDRA      TP        E  + E                       + +D++ + P   S  
Subjt:  VEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTPT--SMR

Query:  NTSCMSSRCIGSKRKRSSFQTELIDV-------------------KEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRR
          S ++    GSKRKR SFQ E+ID+                   KEKYELE  RRKEVV+ +Y I+GL E D V+ IDLLVTDIQK DCFL VP  +R+
Subjt:  NTSCMSSRCIGSKRKRSSFQTELIDV-------------------KEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRR

Query:  TYCMRLLGR
         YC+ LL R
Subjt:  TYCMRLLGR

XP_038902479.1 uncharacterized protein At2g29880-like [Benincasa hispida]1.6e-5654.75Show/hide
Query:  MAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVE
        M    + SKH W+KVEDA+LVE+L+ LV  GWRSDNGTFRP YLQHL+++L EK+P   L QNTI+CKV +LKKQYN ++EMLS   SGF WNEE KCV+
Subjt:  MAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVE

Query:  VEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTP----TS
        VE+E+FD WV SH NAK M NKPF HYDD + VF KDR  G   E P  M  +A  + E+EIRLGSQD    E R  E+     + +D++ + P    T 
Subjt:  VEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTP----TS

Query:  MRNTSCMSSRCIGSKRKRSSF
          +    SSR  GSKRKR SF
Subjt:  MRNTSCMSSRCIGSKRKRSSF

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859531.4e-5041.31Show/hide
Query:  MAGTSRNSKHTWTKVEDARLVESLVSLVHN-GWRSDNGTFRPSYLQHLQKMLAEKLPNSCL-EQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKC
        MA  SR  KHTWTK E+ + VE LV LV + GWRSDNGTF+P YL  LQ+M+AEKLP + + E +TIDC V +LKK Y+AIAEM   +CSGF WNEE +C
Subjt:  MAGTSRNSKHTWTKVEDARLVESLVSLVHN-GWRSDNGTFRPSYLQHLQKMLAEKLPNSCL-EQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKC

Query:  VEVEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDL----PDTP
        +  E+++FD+W+KSH  AKG+ +K F +YDDL++VF KDRA G   ET   +  + +    + I LG  D    +  TM + G+  +  D++        
Subjt:  VEVEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDL----PDTP

Query:  TSMRNTSCMSSRCIGSKRKRS--------SFQTELIDV-----KEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRRTY
        +  RN S +S R  GS+R  +         F  E +       KEK  +EV  R +VV  L  I  L   D   L+ +L   ++ I+ FL +P + +  Y
Subjt:  TSMRNTSCMSSRCIGSKRKRS--------SFQTELIDV-----KEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRRTY

Query:  CMRLL
        C  LL
Subjt:  CMRLL

A0A5A7U0H7 Retrotransposon protein1.4e-5041.31Show/hide
Query:  MAGTSRNSKHTWTKVEDARLVESLVSLVHN-GWRSDNGTFRPSYLQHLQKMLAEKLPNSCL-EQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKC
        MA  SR  KHTWTK E+ + VE LV LV + GWRSDNGTF+P YL  LQ+M+AEKLP + + E +TIDC V +LKK Y+AIAEM   +CSGF WNEE +C
Subjt:  MAGTSRNSKHTWTKVEDARLVESLVSLVHN-GWRSDNGTFRPSYLQHLQKMLAEKLPNSCL-EQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKC

Query:  VEVEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDL----PDTP
        +  E+++FD+W+KSH  AKG+ +K F +YDDL++VF KDRA G   ET   +  + +    + I LG  D    +  TM + G+  +  D++        
Subjt:  VEVEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDL----PDTP

Query:  TSMRNTSCMSSRCIGSKRKRS--------SFQTELIDV-----KEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRRTY
        +  RN S +S R  GS+R  +         F  E +       KEK  +EV  R +VV  L  I  L   D   L+ +L   ++ I+ FL +P + +  Y
Subjt:  TSMRNTSCMSSRCIGSKRKRS--------SFQTELIDV-----KEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRRTY

Query:  CMRLL
        C  LL
Subjt:  CMRLL

A0A5A7UIE9 Retrotransposon protein6.7e-4540.48Show/hide
Query:  MAGTSRNSKHTWTKVEDARLVESLVSLVH-NGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCV
        MA +SR  KH WTK E+A LVE LV LV+  GWRSDNGTFRP YL  L +M+A K+P S +  +TID ++  LK+ ++AIAEM    CSGF WN+E KC+
Subjt:  MAGTSRNSKHTWTKVEDARLVESLVSLVH-NGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCV

Query:  EVEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTPTSMRN
          EKEVFD WVKSH  AKG+  K F HYD+L++VF KDRA G   E+  ++  +     E  I   + D    +    + L +        PD     RN
Subjt:  EVEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTPTSMRN

Query:  TSCMSSRCI--GSKRKRSSFQTELIDVKEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRRTYCMRLL
              R +  GSKRKR           +  +     R+EVV  L  I  LT  D   L+ +L+ ++  +  FL+VP   +  YC  +L
Subjt:  TSCMSSRCI--GSKRKRSSFQTELIDVKEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRRTYCMRLL

A0A5A7UME4 Retrotransposon protein6.3e-4339.31Show/hide
Query:  MAGTSRNSKHTWTKVEDARLVESLVSLVH-NGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCV
        M  +SR  KHTWTK E+A LVE LV LV+  GWRSDNGTFRP YL  L +M+A K+P S +  +TID ++  +K+ ++A+AEM    CSGF WN+E KC+
Subjt:  MAGTSRNSKHTWTKVEDARLVESLVSLVH-NGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCV

Query:  EVEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARG--------IGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLP
          EKEVFD W  SH  AKG+ NK F HYD+L++VF KDRA G        IG   P      AA+ M +       DF       M + G+ ++  DDL 
Subjt:  EVEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARG--------IGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLP

Query:  DTPTS----MRNTSCMSSRCIGSKRKRSSFQTELIDVKE---KYELE----------------VARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKID
        +T T+     RN S       GSKRKR    T+  D+     +Y  E                   R+E+V  L  I  LT  D   L+ +L+ ++  + 
Subjt:  DTPTS----MRNTSCMSSRCIGSKRKRSSFQTELIDVKE---KYELE----------------VARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKID

Query:  CFLQVPPQSRRTYCMRLL
         FL+VP   +  YC  +L
Subjt:  CFLQVPPQSRRTYCMRLL

A0A5D3CBF7 Retrotransposon protein6.3e-4339.31Show/hide
Query:  MAGTSRNSKHTWTKVEDARLVESLVSLVH-NGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCV
        M  +SR  KHTWTK E+A LVE LV LV+  GWRSDNGTFRP YL  L +M+A K+P S +  +TID ++  +K+ ++A+AEM    CSGF WN+E KC+
Subjt:  MAGTSRNSKHTWTKVEDARLVESLVSLVH-NGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCV

Query:  EVEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARG--------IGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLP
          EKEVFD W  SH  AKG+ NK F HYD+L++VF KDRA G        IG   P      AA+ M +       DF       M + G+ ++  DDL 
Subjt:  EVEKEVFDAWVKSHTNAKGMRNKPFSHYDDLAFVFEKDRARG--------IGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLP

Query:  DTPTS----MRNTSCMSSRCIGSKRKRSSFQTELIDVKE---KYELE----------------VARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKID
        +T T+     RN S       GSKRKR    T+  D+     +Y  E                   R+E+V  L  I  LT  D   L+ +L+ ++  + 
Subjt:  DTPTS----MRNTSCMSSRCIGSKRKRSSFQTELIDVKE---KYELE----------------VARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKID

Query:  CFLQVPPQSRRTYCMRLL
         FL+VP   +  YC  +L
Subjt:  CFLQVPPQSRRTYCMRLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02210.1 unknown protein1.2e-0623.33Show/hide
Query:  TWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVEVEKEVFDAWV
        TW    D   ++ ++     G     G FR      +  +   K   S  + + +  +  +L++Q+NAI  +L +   GF+W+ E + V  +  V+  ++
Subjt:  TWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVEVEKEVFDAWV

Query:  KSHTNAKGMRNKPFSHYDDL
        K+H +A+    +P  +Y DL
Subjt:  KSHTNAKGMRNKPFSHYDDL

AT4G02210.2 unknown protein1.2e-0623.33Show/hide
Query:  TWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVEVEKEVFDAWV
        TW    D   ++ ++     G     G FR      +  +   K   S  + + +  +  +L++Q+NAI  +L +   GF+W+ E + V  +  V+  ++
Subjt:  TWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVEVEKEVFDAWV

Query:  KSHTNAKGMRNKPFSHYDDL
        K+H +A+    +P  +Y DL
Subjt:  KSHTNAKGMRNKPFSHYDDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGGTACTTCGCGAAACTCCAAGCATACGTGGACGAAGGTGGAAGATGCGAGGTTGGTGGAGTCACTTGTGTCTTTAGTACACAATGGGTGGCGATCTGACAACGG
GACCTTCAGGCCTAGCTATTTACAACATCTCCAGAAGATGCTAGCTGAGAAATTACCAAATTCATGCCTAGAACAAAACACAATCGATTGCAAGGTCGGAACTCTCAAAA
AACAATACAATGCTATTGCAGAGATGCTTAGTAATGCATGTAGTGGCTTCAGCTGGAACGAAGAGTTGAAGTGTGTTGAGGTAGAGAAGGAGGTGTTTGATGCATGGGTT
AAGAGCCATACAAACGCAAAGGGGATGAGGAATAAGCCATTTTCGCACTATGATGACCTCGCATTTGTCTTTGAAAAAGATAGAGCTAGAGGAATAGGCGTAGAGACCCC
AATGGAAATGAGATTTAGCGCTGCAGAACAAATGGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATGGGAGTGGAACAACGAACAATGGAGAATCTAGGAATTGGTG
ACATAGGGGAAGATGACTTGCCAGACACTCCTACTAGCATGCGTAATACATCTTGCATGTCTTCTAGATGTATTGGAAGCAAAAGAAAACGATCATCCTTCCAAACTGAA
TTAATTGATGTAAAGGAGAAGTATGAGTTGGAGGTTGCACGAAGGAAAGAAGTAGTCGATCTCTTGTATCAAATAGAAGGATTAACTGAGCATGATCATGTCTCCTTGAT
AGACTTGCTTGTGACTGATATCCAGAAAATTGACTGTTTTCTACAGGTTCCACCTCAATCGAGGAGGACATATTGCATGCGTCTACTGGGAAGGACTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGGTACTTCGCGAAACTCCAAGCATACGTGGACGAAGGTGGAAGATGCGAGGTTGGTGGAGTCACTTGTGTCTTTAGTACACAATGGGTGGCGATCTGACAACGG
GACCTTCAGGCCTAGCTATTTACAACATCTCCAGAAGATGCTAGCTGAGAAATTACCAAATTCATGCCTAGAACAAAACACAATCGATTGCAAGGTCGGAACTCTCAAAA
AACAATACAATGCTATTGCAGAGATGCTTAGTAATGCATGTAGTGGCTTCAGCTGGAACGAAGAGTTGAAGTGTGTTGAGGTAGAGAAGGAGGTGTTTGATGCATGGGTT
AAGAGCCATACAAACGCAAAGGGGATGAGGAATAAGCCATTTTCGCACTATGATGACCTCGCATTTGTCTTTGAAAAAGATAGAGCTAGAGGAATAGGCGTAGAGACCCC
AATGGAAATGAGATTTAGCGCTGCAGAACAAATGGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATGGGAGTGGAACAACGAACAATGGAGAATCTAGGAATTGGTG
ACATAGGGGAAGATGACTTGCCAGACACTCCTACTAGCATGCGTAATACATCTTGCATGTCTTCTAGATGTATTGGAAGCAAAAGAAAACGATCATCCTTCCAAACTGAA
TTAATTGATGTAAAGGAGAAGTATGAGTTGGAGGTTGCACGAAGGAAAGAAGTAGTCGATCTCTTGTATCAAATAGAAGGATTAACTGAGCATGATCATGTCTCCTTGAT
AGACTTGCTTGTGACTGATATCCAGAAAATTGACTGTTTTCTACAGGTTCCACCTCAATCGAGGAGGACATATTGCATGCGTCTACTGGGAAGGACTGGATGA
Protein sequenceShow/hide protein sequence
MAGTSRNSKHTWTKVEDARLVESLVSLVHNGWRSDNGTFRPSYLQHLQKMLAEKLPNSCLEQNTIDCKVGTLKKQYNAIAEMLSNACSGFSWNEELKCVEVEKEVFDAWV
KSHTNAKGMRNKPFSHYDDLAFVFEKDRARGIGVETPMEMRFSAAEQMEEEIRLGSQDFMGVEQRTMENLGIGDIGEDDLPDTPTSMRNTSCMSSRCIGSKRKRSSFQTE
LIDVKEKYELEVARRKEVVDLLYQIEGLTEHDHVSLIDLLVTDIQKIDCFLQVPPQSRRTYCMRLLGRTG