; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g04670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g04670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:3986084..3987661
RNA-Seq ExpressionMoc07g04670
SyntenyMoc07g04670
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW14266.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.2e-2633.57Show/hide
Query:  TSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------ICDAAIAEKIRTKKSG-------------------KHST
        T +  H+NE+  ++ +L AM+ITF ++++A+ LL SLP++                     +  + + E+ R K SG                   K S 
Subjt:  TSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------ICDAAIAEKIRTKKSG-------------------KHST

Query:  STFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEVLLASAEISTTSIEQASEKL--------------SFTSFT
        +  +S     S+S     C+YCHKKGH+KR CRKL+ ++Q KE        D++     E+++   ++S   I Q ++ +               FTS++
Subjt:  STFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEVLLASAEISTTSIEQASEKL--------------SFTSFT

Query:  TGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASM
         G FG VRMGN  + KI G+GD+ L+T++  KL   DV +V   + NLIS GK D+EGY++ FSDG  KL KGS VVA G K  S+
Subjt:  TGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASM

RVW84195.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.2e-2633.57Show/hide
Query:  TSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------ICDAAIAEKIRTKKSG-------------------KHST
        T +  H+NE+  ++ +L AM+ITF ++++A+ LL SLP++                     +  + + E+ R K SG                   K S 
Subjt:  TSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------ICDAAIAEKIRTKKSG-------------------KHST

Query:  STFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEVLLASAEISTTSIEQASEKL--------------SFTSFT
        +  +S     S+S     C+YCHKKGH+KR CRKL+ ++Q KE        D++     E+++   ++S   I Q ++ +               FTS++
Subjt:  STFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEVLLASAEISTTSIEQASEKL--------------SFTSFT

Query:  TGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASM
         G FG VRMGN  + KI G+GD+ L+T++  KL   DV +V   + NLIS GK D+EGY++ FSDG  KL KGS VVA G K  S+
Subjt:  TGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASM

RVW94144.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.5e-2631.69Show/hide
Query:  WK---MQVTDFLTYKKIHKTLKERSATKMTDREWTKMDETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------
        WK    +   F+    ++K    R       +E T + E     H+NE+  ++ +L AM+ITF ++++A+ LL SLP++                     
Subjt:  WK---MQVTDFLTYKKIHKTLKERSATKMTDREWTKMDETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------

Query:  ICDAAIAEKIRTKKSG-------------------KHSTSTFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEV
        +  + + E+ R K SG                   K S +  +S     S+S     C+YCHKKGH+KR CRKL+ ++Q KE        D++     E+
Subjt:  ICDAAIAEKIRTKKSG-------------------KHSTSTFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEV

Query:  LLASAEISTTSIEQASEKL--------------SFTSFTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSS
        ++   ++S   I Q ++ +               FTS++ G FG VRMGN  + KI G+GD+ L+T++  KL   DV +V   + NLIS GK D+EGY++
Subjt:  LLASAEISTTSIEQASEKL--------------SFTSFTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSS

Query:  EFSDGSRKLVKGSEVVAVGHKDASM
         FSDG  KL KGS VVA G K  S+
Subjt:  EFSDGSRKLVKGSEVVAVGHKDASM

TXG57032.1 hypothetical protein EZV62_018345 [Acer yangbiense]4.2e-2432.07Show/hide
Query:  DETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPD-------NICDAA--------------IAEKIRTKKSGKHSTSTFRSEKH---------
        D T ++ H+N    ++ +L  M I F ++V+ + LL +LPD       ++C++A              + E++R K  G   +    +EK          
Subjt:  DETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPD-------NICDAA--------------IAEKIRTKKSGKHSTSTFRSEKH---------

Query:  -----------FESVSCFYCHKKGHVKRFCRKLEED----QGKE---DSSN-----------YLIA----EVLLASAEI-----STTSIEQASEKLSFTS
                   F +V C++C +KGH+K++CR+L+ D    +GKE   D SN           +L+      V LA  E      S  SI   S++  F S
Subjt:  -----------FESVSCFYCHKKGHVKRFCRKLEED----QGKE---DSSN-----------YLIA----EVLLASAEI-----STTSIEQASEKLSFTS

Query:  FTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASMRF
        +T+G FG V+MGNN + K  G+GDV L+T++   L   +V ++   + NLIS GK D+EG+ + FSDG  KL KGS +VA G K +S+ F
Subjt:  FTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASMRF

TXG65186.1 hypothetical protein EZV62_006461 [Acer yangbiense]5.5e-2432.07Show/hide
Query:  DETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPD-------NICDAA--------------IAEKIRTKKSGKHSTSTFRSEKH---------
        D T ++ H+N    ++ +L  M I F ++V+ + LL +LPD       ++C++A              + E++R K  G   +    +EK          
Subjt:  DETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPD-------NICDAA--------------IAEKIRTKKSGKHSTSTFRSEKH---------

Query:  -----------FESVSCFYCHKKGHVKRFCRKLEED----QGKE---DSSN-----------YLIA----EVLLASAEI-----STTSIEQASEKLSFTS
                   F +V C++C +KGH+K++CR+L+ D    +GKE   D SN           +L+      V LA  E      S  SI   S +  F S
Subjt:  -----------FESVSCFYCHKKGHVKRFCRKLEED----QGKE---DSSN-----------YLIA----EVLLASAEI-----STTSIEQASEKLSFTS

Query:  FTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASMRF
        +T+G+FG V+MGNN + K  G+GDV L+T++   L   +V ++   + NLIS GK D+EG+ + FSDG  KL KGS +VA G K +S+ F
Subjt:  FTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASMRF

TrEMBL top hitse value%identityAlignment
A0A438BTH6 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-2733.57Show/hide
Query:  TSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------ICDAAIAEKIRTKKSG-------------------KHST
        T +  H+NE+  ++ +L AM+ITF ++++A+ LL SLP++                     +  + + E+ R K SG                   K S 
Subjt:  TSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------ICDAAIAEKIRTKKSG-------------------KHST

Query:  STFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEVLLASAEISTTSIEQASEKL--------------SFTSFT
        +  +S     S+S     C+YCHKKGH+KR CRKL+ ++Q KE        D++     E+++   ++S   I Q ++ +               FTS++
Subjt:  STFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEVLLASAEISTTSIEQASEKL--------------SFTSFT

Query:  TGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASM
         G FG VRMGN  + KI G+GD+ L+T++  KL   DV +V   + NLIS GK D+EGY++ FSDG  KL KGS VVA G K  S+
Subjt:  TGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASM

A0A438HI91 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-2733.57Show/hide
Query:  TSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------ICDAAIAEKIRTKKSG-------------------KHST
        T +  H+NE+  ++ +L AM+ITF ++++A+ LL SLP++                     +  + + E+ R K SG                   K S 
Subjt:  TSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------ICDAAIAEKIRTKKSG-------------------KHST

Query:  STFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEVLLASAEISTTSIEQASEKL--------------SFTSFT
        +  +S     S+S     C+YCHKKGH+KR CRKL+ ++Q KE        D++     E+++   ++S   I Q ++ +               FTS++
Subjt:  STFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEVLLASAEISTTSIEQASEKL--------------SFTSFT

Query:  TGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASM
         G FG VRMGN  + KI G+GD+ L+T++  KL   DV +V   + NLIS GK D+EGY++ FSDG  KL KGS VVA G K  S+
Subjt:  TGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASM

A0A438IBT7 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-2631.69Show/hide
Query:  WK---MQVTDFLTYKKIHKTLKERSATKMTDREWTKMDETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------
        WK    +   F+    ++K    R       +E T + E     H+NE+  ++ +L AM+ITF ++++A+ LL SLP++                     
Subjt:  WK---MQVTDFLTYKKIHKTLKERSATKMTDREWTKMDETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDN---------------------

Query:  ICDAAIAEKIRTKKSG-------------------KHSTSTFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEV
        +  + + E+ R K SG                   K S +  +S     S+S     C+YCHKKGH+KR CRKL+ ++Q KE        D++     E+
Subjt:  ICDAAIAEKIRTKKSG-------------------KHSTSTFRSEKHFESVS-----CFYCHKKGHVKRFCRKLE-EDQGKE--------DSSNYLIAEV

Query:  LLASAEISTTSIEQASEKL--------------SFTSFTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSS
        ++   ++S   I Q ++ +               FTS++ G FG VRMGN  + KI G+GD+ L+T++  KL   DV +V   + NLIS GK D+EGY++
Subjt:  LLASAEISTTSIEQASEKL--------------SFTSFTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSS

Query:  EFSDGSRKLVKGSEVVAVGHKDASM
         FSDG  KL KGS VVA G K  S+
Subjt:  EFSDGSRKLVKGSEVVAVGHKDASM

A0A5C7HL28 CCHC-type domain-containing protein2.0e-2432.07Show/hide
Query:  DETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPD-------NICDAA--------------IAEKIRTKKSGKHSTSTFRSEKH---------
        D T ++ H+N    ++ +L  M I F ++V+ + LL +LPD       ++C++A              + E++R K  G   +    +EK          
Subjt:  DETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPD-------NICDAA--------------IAEKIRTKKSGKHSTSTFRSEKH---------

Query:  -----------FESVSCFYCHKKGHVKRFCRKLEED----QGKE---DSSN-----------YLIA----EVLLASAEI-----STTSIEQASEKLSFTS
                   F +V C++C +KGH+K++CR+L+ D    +GKE   D SN           +L+      V LA  E      S  SI   S++  F S
Subjt:  -----------FESVSCFYCHKKGHVKRFCRKLEED----QGKE---DSSN-----------YLIA----EVLLASAEI-----STTSIEQASEKLSFTS

Query:  FTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASMRF
        +T+G FG V+MGNN + K  G+GDV L+T++   L   +V ++   + NLIS GK D+EG+ + FSDG  KL KGS +VA G K +S+ F
Subjt:  FTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASMRF

A0A5C7I9X1 CCHC-type domain-containing protein2.7e-2432.07Show/hide
Query:  DETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPD-------NICDAA--------------IAEKIRTKKSGKHSTSTFRSEKH---------
        D T ++ H+N    ++ +L  M I F ++V+ + LL +LPD       ++C++A              + E++R K  G   +    +EK          
Subjt:  DETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPD-------NICDAA--------------IAEKIRTKKSGKHSTSTFRSEKH---------

Query:  -----------FESVSCFYCHKKGHVKRFCRKLEED----QGKE---DSSN-----------YLIA----EVLLASAEI-----STTSIEQASEKLSFTS
                   F +V C++C +KGH+K++CR+L+ D    +GKE   D SN           +L+      V LA  E      S  SI   S +  F S
Subjt:  -----------FESVSCFYCHKKGHVKRFCRKLEED----QGKE---DSSN-----------YLIA----EVLLASAEI-----STTSIEQASEKLSFTS

Query:  FTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASMRF
        +T+G+FG V+MGNN + K  G+GDV L+T++   L   +V ++   + NLIS GK D+EG+ + FSDG  KL KGS +VA G K +S+ F
Subjt:  FTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASMRF

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-1027.63Show/hide
Query:  KSGKHSTSTFRSEKHFESVSCFYCHKKGHVKRFCRKLEEDQG-----KEDSSNYLIAE-----VLLASAE--------------ISTTSIEQASE-KLSF
        +SG    S  RS+      +C+ C++ GH KR C    + +G     K D +   + +     VL  + E              + T +   A+  +  F
Subjt:  KSGKHSTSTFRSEKHFESVSCFYCHKKGHVKRFCRKLEEDQG-----KEDSSNYLIAE-----VLLASAE--------------ISTTSIEQASE-KLSF

Query:  TSFTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVG-------HKDASMRFG
          +  G FG V+MGN    KI GIGD+ +KT+    L   DV +V   + NLIS    D +GY S F++   +L KGS V+A G         +A +  G
Subjt:  TSFTTGSFGMVRMGNNRLFKIRGIGDVNLKTDSRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVG-------HKDASMRFG

Query:  ELMKSRRRKSASM------KKTTVGAEVKGEVSRVATDLGGSVKLSDEKSFFKGHWV
        EL  ++   S  +        +  G ++  + S ++   G +VK  D   F K H V
Subjt:  ELMKSRRRKSASM------KKTTVGAEVKGEVSRVATDLGGSVKLSDEKSFFKGHWV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCCAGTCAGAAGTCACATCACTACTTTAAAGGAGTGTTAAGATTTCATGGGGACAACTTCGTGTTTTGGAAGATGCAAGTGACAGATTTTCTTACATATAAGAA
GATACACAAGACCTTGAAAGAACGATCGGCCACCAAGATGACAGATAGAGAGTGGACAAAGATGGATGAGACCTCAATGAATTCCCATATAAATGAAGTCACCGATTTGA
TGAAGAAGTTGGAGGCTATGGAAATCACTTTCTTGGAGGATGTGAAGGCCATTAAGTTGTTGTATTCTTTGCCTGACAATATTTGTGACGCTGCAATAGCTGAAAAGATT
CGCACGAAGAAAAGTGGAAAGCATTCTACTTCTACTTTTAGGTCTGAAAAACATTTTGAATCGGTCTCGTGCTTTTATTGCCACAAGAAGGGACACGTCAAGAGATTTTG
CCGGAAGCTCGAGGAGGATCAAGGAAAGGAAGATTCTTCAAACTACTTGATAGCTGAGGTGTTGCTAGCTAGTGCTGAGATTAGTACAACATCTATAGAGCAGGCATCTG
AGAAATTGTCGTTCACATCTTTTACTACAGGGAGCTTTGGCATGGTGAGAATGGGAAACAACAGACTCTTCAAGATCAGGGGCATTGGAGATGTTAATCTAAAGACTGAC
AGTAGAACTAAGCTATTTTTTAGCGATGTTACATACGTGCTCAAATTCAAGAGGAATCTGATATCTGTTGGGAAGTTTGATGAAGAAGGTTATAGTAGTGAATTTTCAGA
TGGTAGCCGGAAGTTAGTGAAGGGATCCGAGGTAGTGGCAGTGGGCCACAAAGATGCTTCAATGCGATTTGGCGAGCTGATGAAGTCGCGTAGACGAAAGAGTGCATCAA
TGAAAAAGACTACAGTGGGTGCTGAAGTTAAGGGTGAAGTCTCTAGAGTGGCAACGGACTTGGGTGGTTCTGTCAAGTTATCAGATGAGAAGTCTTTCTTCAAAGGTCAT
TGGGTTCGATCGAGAAATGAAGCATCAAATACCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATCCAGTCAGAAGTCACATCACTACTTTAAAGGAGTGTTAAGATTTCATGGGGACAACTTCGTGTTTTGGAAGATGCAAGTGACAGATTTTCTTACATATAAGAA
GATACACAAGACCTTGAAAGAACGATCGGCCACCAAGATGACAGATAGAGAGTGGACAAAGATGGATGAGACCTCAATGAATTCCCATATAAATGAAGTCACCGATTTGA
TGAAGAAGTTGGAGGCTATGGAAATCACTTTCTTGGAGGATGTGAAGGCCATTAAGTTGTTGTATTCTTTGCCTGACAATATTTGTGACGCTGCAATAGCTGAAAAGATT
CGCACGAAGAAAAGTGGAAAGCATTCTACTTCTACTTTTAGGTCTGAAAAACATTTTGAATCGGTCTCGTGCTTTTATTGCCACAAGAAGGGACACGTCAAGAGATTTTG
CCGGAAGCTCGAGGAGGATCAAGGAAAGGAAGATTCTTCAAACTACTTGATAGCTGAGGTGTTGCTAGCTAGTGCTGAGATTAGTACAACATCTATAGAGCAGGCATCTG
AGAAATTGTCGTTCACATCTTTTACTACAGGGAGCTTTGGCATGGTGAGAATGGGAAACAACAGACTCTTCAAGATCAGGGGCATTGGAGATGTTAATCTAAAGACTGAC
AGTAGAACTAAGCTATTTTTTAGCGATGTTACATACGTGCTCAAATTCAAGAGGAATCTGATATCTGTTGGGAAGTTTGATGAAGAAGGTTATAGTAGTGAATTTTCAGA
TGGTAGCCGGAAGTTAGTGAAGGGATCCGAGGTAGTGGCAGTGGGCCACAAAGATGCTTCAATGCGATTTGGCGAGCTGATGAAGTCGCGTAGACGAAAGAGTGCATCAA
TGAAAAAGACTACAGTGGGTGCTGAAGTTAAGGGTGAAGTCTCTAGAGTGGCAACGGACTTGGGTGGTTCTGTCAAGTTATCAGATGAGAAGTCTTTCTTCAAAGGTCAT
TGGGTTCGATCGAGAAATGAAGCATCAAATACCACTTAG
Protein sequenceShow/hide protein sequence
MGSSQKSHHYFKGVLRFHGDNFVFWKMQVTDFLTYKKIHKTLKERSATKMTDREWTKMDETSMNSHINEVTDLMKKLEAMEITFLEDVKAIKLLYSLPDNICDAAIAEKI
RTKKSGKHSTSTFRSEKHFESVSCFYCHKKGHVKRFCRKLEEDQGKEDSSNYLIAEVLLASAEISTTSIEQASEKLSFTSFTTGSFGMVRMGNNRLFKIRGIGDVNLKTD
SRTKLFFSDVTYVLKFKRNLISVGKFDEEGYSSEFSDGSRKLVKGSEVVAVGHKDASMRFGELMKSRRRKSASMKKTTVGAEVKGEVSRVATDLGGSVKLSDEKSFFKGH
WVRSRNEASNTT