; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002096 (gene) of Chayote v1 genome

Gene IDSed0002096
OrganismSechium edule (Chayote v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG13:4127322..4130439
RNA-Seq ExpressionSed0002096
SyntenySed0002096
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERM96107.1 hypothetical protein AMTR_s02760p00000080, partial [Amborella trichopoda]6.0e-5044.92Show/hide
Query:  TKRKLKDTNYLDWNRTIRNHLLS-NELGHMDDKLPKEEVEKKK-WDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKVTGTYDVCSS
        T  KL  +NYL+W++TIR +L S ++  H+ D  P+E  + +K W R DARL+ QI NS++++++ L++HC  VK+LM Y+EFLYSGK+ ++  YDVC +
Subjt:  TKRKLKDTNYLDWNRTIRNHLLS-NELGHMDDKLPKEEVEKKK-WDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKVTGTYDVCSS

Query:  YFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNLIKQSNSALLGRS
        ++ A++  ++LM Y+M  K+   +LN L+PFS D+ VQQ QRE M ++SFL GL  E+++AKSQ+L G+++ SL DVF R+ RT+       NSAL+ R+
Subjt:  YFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNLIKQSNSALLGRS

Query:  DLRGGRNMSTPVLKPSSDRRQGSSS--DNRRPDSRVGVTCYYCQKPGRLKRDCRKL
        +    RN STP     S  + G+S   DNR P+S   + C YC+KPG  K +CRKL
Subjt:  DLRGGRNMSTPVLKPSSDRRQGSSS--DNRRPDSRVGVTCYYCQKPGRLKRDCRKL

KAB5551599.1 hypothetical protein DKX38_008910 [Salix brachista]9.5e-4843.35Show/hide
Query:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV
        T ++    + T  KL  TNYL+W++TIR +L S E   H+  + P +E EKK W R DARL+ QI NS++ +IV L++HC  VK+LM Y+EFLYSGK  +
Subjt:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV

Query:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNLIKQ
        +  YDVC +++ A++  ++L  Y+M  K+   +LN L+PFS DI VQQ QRE M V+SFL GL  E E  KSQIL   E+ SL +VF R+  T+  L  Q
Subjt:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNLIKQ

Query:  SNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL
         N  L        GR ++          R GS +D+   DS   + CYYC +PG  K+ C+KL
Subjt:  SNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL

KAF9681460.1 hypothetical protein SADUNF_Sadunf05G0003800 [Salix dunnii]7.3e-4843.61Show/hide
Query:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV
        T ++    + T  KL  TNYL+W++TIR +L S E   H+ ++ P +E +KK W R DARL+ QI NS++ +IV L++HC  VK+LM Y+EFLYSGK  +
Subjt:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV

Query:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNL-IK
        +  YDVC +++ AD+  ++L  Y+M  K+   +LN L+PFS DI VQQ QRE M V+SFL GL  E E AKSQIL   E+ SL +VF R+ RT+  L I+
Subjt:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNL-IK

Query:  QSNSALL--GRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL
         +N+ L+  GRS+   GR ++          R GS   +   +    + CYYC +PG  K+ C+KL
Subjt:  QSNSALL--GRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL

KAG8636760.1 hypothetical protein MANES_15G035050v8 [Manihot esculenta]1.2e-4742.03Show/hide
Query:  MADFKQ--TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNELGHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFL
        MA+ K   T VI    + T+ KL  +N+LDW++TIR +L S  +     K P  +  ++ W R DARL+ QI NS++ +++ L+++C  VK+LM Y++FL
Subjt:  MADFKQ--TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNELGHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFL

Query:  YSGKEKVTGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRT
        YS KE ++  YDVC +++   +N +TL  Y+M  KR   +LN LMPFS D+  QQ QRE M V+SFL GL  E+E+AKS IL+ +E+ SL DVF R+ RT
Subjt:  YSGKEKVTGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRT

Query:  KVNLIKQSNSALLGRSDL-----RGGRNMSTPVLKPSS-DRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL
        K  +     SAL+ R+D      RGG+       K S    + GS+SD+       G+ CYYC++PG  K+ C+KL
Subjt:  KVNLIKQSNSALLGRSDL-----RGGRNMSTPVLKPSS-DRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]8.3e-5244.2Show/hide
Query:  MADFKQ---TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVE
        MAD K    ++VI  + + T+ KL  +NY DW RTI  +L S ++  HM +  PK+  +KK W R DARLY QI NS+  +I+ LV HC +VK+L+ +++
Subjt:  MADFKQ---TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVE

Query:  FLYSGKEKVTGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQ
        FLYSGKE+V   ++VC  +F A+Q  E++  Y+M+ K+ IA+L  L+PFS D+ VQQ QRE M V+ FL GL PE+  AK+QIL  +++PSLDD F R+ 
Subjt:  FLYSGKEKVTGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQ

Query:  RTKVN----LIKQSNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL
        R + +     I Q +SAL  +++     N   P         Q +S+D+R+P+S V + C YC+KPG +KRDCRKL
Subjt:  RTKVN----LIKQSNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL

TrEMBL top hitse value%identityAlignment
A0A5N5JC74 Uncharacterized protein1.8e-4742.8Show/hide
Query:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV
        T ++    + T  KL  TNYL+W++TIR +L S E   H+ ++ P +E +KK W R DARL+ QI NS++ +IV L++HC  VK+LM Y+EFLYSGK  +
Subjt:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV

Query:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNL-IK
        +  YDVC +++ A++  ++L  Y+M  K+   +LN L+PFS DI VQQ QRE M V+SFL GL  E E  KSQIL   E+ SL +VF R+ RT+  L I+
Subjt:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNL-IK

Query:  QSNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL
         +N+ LL +     GR   T       + R GS + +   +    + CYYC +PG  K+ C+KL
Subjt:  QSNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL

A0A5N5JJ99 Uncharacterized protein1.8e-4742.8Show/hide
Query:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV
        T ++    + T  KL  TNYL+W++TIR +L S E   H+ ++ P +E +KK W R DARL+ QI NS++ +IV L++HC  VK+LM Y+EFLYSGK  +
Subjt:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV

Query:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNL-IK
        +  YDVC +++ A++  ++L  Y+M  K+   +LN L+PFS DI VQQ QRE M V+SFL GL  E E  KSQIL   E+ SL +VF R+ RT+  L I+
Subjt:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNL-IK

Query:  QSNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL
         +N+ LL +     GR   T       + R GS + +   +    + CYYC +PG  K+ C+KL
Subjt:  QSNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL

A0A5N5KU30 Uncharacterized protein3.9e-4742.42Show/hide
Query:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLS-NELGHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV
        T ++    + T  KL  TNYL+W++TIR +L S  +  H+ ++ P +E +KK W R DARL+ QI NS++ +IV L++HC  VK+LM Y+EFLYSGK  +
Subjt:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLS-NELGHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV

Query:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNL-IK
        +  YDVC +++ A++  ++L  Y+M  K+   +LN L+PFS DI VQQ QRE M V+SFL GL  E E  KSQIL   E+ SL +VF R+ RT+  L I+
Subjt:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNL-IK

Query:  QSNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL
         +N+ LL +     GR   T       + R GS + +   +    + CYYC +PG  K+ C+KL
Subjt:  QSNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL

A0A5N5M9B2 Uncharacterized protein4.6e-4843.35Show/hide
Query:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV
        T ++    + T  KL  TNYL+W++TIR +L S E   H+  + P +E EKK W R DARL+ QI NS++ +IV L++HC  VK+LM Y+EFLYSGK  +
Subjt:  TSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNEL-GHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKV

Query:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNLIKQ
        +  YDVC +++ A++  ++L  Y+M  K+   +LN L+PFS DI VQQ QRE M V+SFL GL  E E  KSQIL   E+ SL +VF R+  T+  L  Q
Subjt:  TGTYDVCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNLIKQ

Query:  SNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL
         N  L        GR ++          R GS +D+   DS   + CYYC +PG  K+ C+KL
Subjt:  SNSALLGRSDLRGGRNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL

U5CZW1 Uncharacterized protein (Fragment)2.9e-5044.92Show/hide
Query:  TKRKLKDTNYLDWNRTIRNHLLS-NELGHMDDKLPKEEVEKKK-WDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKVTGTYDVCSS
        T  KL  +NYL+W++TIR +L S ++  H+ D  P+E  + +K W R DARL+ QI NS++++++ L++HC  VK+LM Y+EFLYSGK+ ++  YDVC +
Subjt:  TKRKLKDTNYLDWNRTIRNHLLS-NELGHMDDKLPKEEVEKKK-WDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKVTGTYDVCSS

Query:  YFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNLIKQSNSALLGRS
        ++ A++  ++LM Y+M  K+   +LN L+PFS D+ VQQ QRE M ++SFL GL  E+++AKSQ+L G+++ SL DVF R+ RT+       NSAL+ R+
Subjt:  YFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNLIKQSNSALLGRS

Query:  DLRGGRNMSTPVLKPSSDRRQGSSS--DNRRPDSRVGVTCYYCQKPGRLKRDCRKL
        +    RN STP     S  + G+S   DNR P+S   + C YC+KPG  K +CRKL
Subjt:  DLRGGRNMSTPVLKPSSDRRQGSSS--DNRRPDSRVGVTCYYCQKPGRLKRDCRKL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-0724.39Show/hide
Query:  DTNYLDWNRTIRNHLLSNELG---HMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKVTGTYDVCSSYFHAD
        D  +  W R +R+ L+   L     +D K P + ++ + W  +D R  + I   ++D +V+ +    T + + T +E LY  K      Y     Y    
Subjt:  DTNYLDWNRTIRNHLLSNELG---HMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKVTGTYDVCSSYFHAD

Query:  QNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNLIKQSNSALLGRSDLRGG
          G   +        ++   N L+   A++ V+  + +    I  L  L   Y+N  + IL+G     L DV   L   +    K  N    G++ +  G
Subjt:  QNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNLIKQSNSALLGRSDLRGG

Query:  RNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDC
        R  S      +  R         R  SRV   CY C +PG  KRDC
Subjt:  RNMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDC

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGATTTTAAGCAAACAAGCGTGATTTCGCATTCGATGGAAAGCACGAAAAGGAAGTTAAAAGACACTAATTATCTAGATTGGAATCGGACGATTCGCAATCACCT
GTTGAGTAATGAATTAGGACACATGGATGATAAACTGCCTAAAGAAGAAGTTGAGAAGAAAAAGTGGGACCGGGTTGATGCACGTCTGTATAATCAGATTACTAACTCGA
TGAATGACAAAATCGTGGATCTTGTTAGTCATTGCACTACAGTAAAACAACTTATGACGTATGTGGAGTTTTTGTACTCCGGGAAGGAAAAAGTAACGGGGACGTATGAC
GTTTGTTCGTCCTACTTTCATGCTGATCAGAATGGTGAGACATTGATGGTCTACTACATGAAGCACAAGAGAAATATTGCTAAGTTAAATTCCCTTATGCCATTCAGTGC
TGATATTGCGGTTCAACAGACTCAAAGAGAAAATATGGGAGTTATCAGCTTCCTACGTGGTCTGGGTCCGGAATATGAGAATGCCAAGTCTCAAATTCTGTATGGAACTG
AACTACCATCATTGGATGATGTGTTTCGTCGACTACAACGAACAAAGGTGAATTTGATTAAACAGTCCAATAGTGCTTTGCTTGGGAGAAGTGACCTCCGTGGTGGGAGA
AATATGAGTACTCCCGTATTGAAACCTAGTTCTGATCGACGCCAAGGTTCTAGTAGTGACAACCGGCGTCCGGATTCTCGAGTGGGAGTGACCTGTTATTACTGTCAGAA
ACCAGGTCGCTTAAAACGAGATTGCAGAAAACTGTAA
mRNA sequenceShow/hide mRNA sequence
TTTTTCTTTTATTTTCCTCTTTTTAGTTTGTATTCTGCTGAACTTTCTACTCCGGATCGTCTTCTAACCTCTGAGCAGGAAAAAGGGGTTGACTGCCGCCGCTGCCGCCG
TTGACCGTCACCACCACCGGGAAATCGCAGCCCAGCCGCCGATCTACACGGTCGTGGAAGCCGCGACTTCGTTCAGTAAAAAACGAAGCCGCACGGGCCGCCGGAGCTCT
CCGCCAGCTCAAAATCGCCGTCGTCGTCGCTCCGATCGAGGATTTTTTCGGTGGGTTTTGCTCGCGCCGAGGAGTAGTTTAAGTATGTCGAAGTTTCAACTTGATCCAAC
GGTGGGATTGCTCGGAAACCTCTCGGGTTTTTTTGCTTTACTGTTTTAAGGAGATGCTTTTTTTGAAAATTCACTGCTTGAAGGCTGTTTTGAGACTGTTTTTGGGCTGT
TTGTTCTACTTTTTCGTTTAGTATACATGGCTGATTTTAAGCAAACAAGCGTGATTTCGCATTCGATGGAAAGCACGAAAAGGAAGTTAAAAGACACTAATTATCTAGAT
TGGAATCGGACGATTCGCAATCACCTGTTGAGTAATGAATTAGGACACATGGATGATAAACTGCCTAAAGAAGAAGTTGAGAAGAAAAAGTGGGACCGGGTTGATGCACG
TCTGTATAATCAGATTACTAACTCGATGAATGACAAAATCGTGGATCTTGTTAGTCATTGCACTACAGTAAAACAACTTATGACGTATGTGGAGTTTTTGTACTCCGGGA
AGGAAAAAGTAACGGGGACGTATGACGTTTGTTCGTCCTACTTTCATGCTGATCAGAATGGTGAGACATTGATGGTCTACTACATGAAGCACAAGAGAAATATTGCTAAG
TTAAATTCCCTTATGCCATTCAGTGCTGATATTGCGGTTCAACAGACTCAAAGAGAAAATATGGGAGTTATCAGCTTCCTACGTGGTCTGGGTCCGGAATATGAGAATGC
CAAGTCTCAAATTCTGTATGGAACTGAACTACCATCATTGGATGATGTGTTTCGTCGACTACAACGAACAAAGGTGAATTTGATTAAACAGTCCAATAGTGCTTTGCTTG
GGAGAAGTGACCTCCGTGGTGGGAGAAATATGAGTACTCCCGTATTGAAACCTAGTTCTGATCGACGCCAAGGTTCTAGTAGTGACAACCGGCGTCCGGATTCTCGAGTG
GGAGTGACCTGTTATTACTGTCAGAAACCAGGTCGCTTAAAACGAGATTGCAGAAAACTGTAATATGATAGTCAGGAGCATCAGAAAGCCAATGTTGTATCTAATGAGAA
AGCCAATGTTGATGAGCTTGCTAAACTCCGGGTGTCTTCTACTTCATCACAGCCATTGTCCTCCGAATTCC
Protein sequenceShow/hide protein sequence
MADFKQTSVISHSMESTKRKLKDTNYLDWNRTIRNHLLSNELGHMDDKLPKEEVEKKKWDRVDARLYNQITNSMNDKIVDLVSHCTTVKQLMTYVEFLYSGKEKVTGTYD
VCSSYFHADQNGETLMVYYMKHKRNIAKLNSLMPFSADIAVQQTQRENMGVISFLRGLGPEYENAKSQILYGTELPSLDDVFRRLQRTKVNLIKQSNSALLGRSDLRGGR
NMSTPVLKPSSDRRQGSSSDNRRPDSRVGVTCYYCQKPGRLKRDCRKL