; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G33190 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G33190
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr1:28065503..28066330
RNA-Seq ExpressionCSPI01G33190
SyntenyCSPI01G33190
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN79190.1 hypothetical protein VITISV_000232 [Vitis vinifera]1.2e-2933.96Show/hide
Query:  TLECKFPRLVRIAINPNGSVSDHCDSSTN-SWSILFRRLLNVEEIFYFQTLLSQISSSQ-PASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSL-YK
        +L  +FPRL+R+ ++ N  +S    S+   SW+  FRR L+  EI   ++L+  +         PD+R WSL +SG F+VKS    LS  S L S    K
Subjt:  TLECKFPRLVRIAINPNGSVSDHCDSSTN-SWSILFRRLLNVEEIFYFQTLLSQISSSQ-PASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSL-YK

Query:  RLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKK
         +W S  P +I   +W++    +N   +LQ + P  ALSP  C LC+   E++ HL   C      W RL Q     WV      + +S    G    K+
Subjt:  RLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKK

Query:  TVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPA
         +  LW+ A  A L  +W ERN R+F DKS      +D     AS W S SK F+      I L+W A
Subjt:  TVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPA

KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]8.0e-5043.68Show/hide
Query:  KFPRLVRIAINPNGSVSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSLYKRLWKSNS
        +F RL RIA+ P       C     SW + FRR L  EEI  FQ+LL  +S+ +  +  D R WS+++ G FS KSL  HL T SP++  L+  + +S S
Subjt:  KFPRLVRIAINPNGSVSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSLYKRLWKSNS

Query:  PRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWE
        PRRINI IWIM+F  +  + +LQ+K P + +SP  CPLC+   +++ H+   C  +S  W R+   FN  W  D     +V Q+L+G NL  KT   +WE
Subjt:  PRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWE

Query:  NAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPA
           KA L EIW ERNQR+FHDK+    E   +A LNA++WCSL K F +YS Q+I LNW A
Subjt:  NAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPA

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]2.8e-5043.35Show/hide
Query:  KFPRLVRIAINPNGSVSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSLYKRLWKSNS
        +F RL  IA+ P       C     SW + FRR L  EEI  FQ+LL  +S+ +  +  D R WS+++ G FS KSL  HL T SP++  L+  + +S S
Subjt:  KFPRLVRIAINPNGSVSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSLYKRLWKSNS

Query:  PRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWE
        PRRINI IWIM+F  +N + +LQ+K P + +SP  CPLC+   +++ H+   C  +S  W R+   FN  W  D     +V Q+L+G NL  KT   +WE
Subjt:  PRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWE

Query:  NAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPAFI
           KA L EIW ERNQR+FHDK+    E   +A LNA++WCSL K F +YS Q+I LNW  F+
Subjt:  NAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPAFI

XP_022153214.1 uncharacterized protein LOC111020765 [Momordica charantia]8.6e-3642.55Show/hide
Query:  VKHLSTFSPLESSLYKRLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVF
        ++   + + +    +  LWK+ SPRR+N+  WI+  G LN A ++Q+K PS AL P  C LC  + E   HL F C FASK W  L   FN  W  D   
Subjt:  VKHLSTFSPLESSLYKRLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVF

Query:  KNNVSQILAGPNLKKKTVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPAFIAS
         +NV Q+L GP     +V  LW N VKA L+E+WFERN R+F +K   + E + SA+  AS WCSL   F  +S   I  NW AFI S
Subjt:  KNNVSQILAGPNLKKKTVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPAFIAS

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]6.5e-6850.38Show/hide
Query:  FPRLVRIAINPNGSVSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSLYKRLWKSNSP
        F  L RI   PNGSV+DH D++  SWSI FRR L  EE+  FQ LL +I S  P+   D+R WS+  +  ++VKSL  HL+ FSPLE  ++  +WK+ SP
Subjt:  FPRLVRIAINPNGSVSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSLYKRLWKSNSP

Query:  RRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWEN
        RR+NI IWIMLFG LN A VLQ+K P+ +LSP  CP C+++ E   HL F C ++S  W +LL  FN    L + FK+NV Q+LA P   K T   LW N
Subjt:  RRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWEN

Query:  AVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPAFIASPP
        AVKA LA++WFERNQR+F++K+++  +R ++AR  ASSWC LS  F+ YS  +  LNW AFI++PP
Subjt:  AVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPAFIASPP

TrEMBL top hitse value%identityAlignment
A0A438CWE0 Putative ribonuclease H protein4.9e-2934.12Show/hide
Query:  TLECKFPRLVRIAINPNGSVSDHCDSSTN-SWSILFRRLLNVEEIFYFQTLLSQISSSQ-PASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSL-YK
        +L  +FPRL+R+ ++ N  +S    S+   SW+  FRR L+  EI   ++L+  +       S PD+R WSL +SG F+VKS    LS  S L      K
Subjt:  TLECKFPRLVRIAINPNGSVSDHCDSSTN-SWSILFRRLLNVEEIFYFQTLLSQISSSQ-PASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSL-YK

Query:  RLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKK
         +W S  P +I   +W++    +N + +LQ + P  ALSP  C LC+   E++ HL   C      W RL Q     WV      + +S    G    K+
Subjt:  RLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKK

Query:  TVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQ
         +  LW+ A  A L  +W ERN R+F DKS      +D     AS W S SK F+
Subjt:  TVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQ

A0A5A7T2Y0 zf-RVT domain-containing protein3.9e-5043.68Show/hide
Query:  KFPRLVRIAINPNGSVSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSLYKRLWKSNS
        +F RL RIA+ P       C     SW + FRR L  EEI  FQ+LL  +S+ +  +  D R WS+++ G FS KSL  HL T SP++  L+  + +S S
Subjt:  KFPRLVRIAINPNGSVSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSLYKRLWKSNS

Query:  PRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWE
        PRRINI IWIM+F  +  + +LQ+K P + +SP  CPLC+   +++ H+   C  +S  W R+   FN  W  D     +V Q+L+G NL  KT   +WE
Subjt:  PRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWE

Query:  NAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPA
           KA L EIW ERNQR+FHDK+    E   +A LNA++WCSL K F +YS Q+I LNW A
Subjt:  NAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPA

A0A5D3DE60 zf-RVT domain-containing protein1.3e-5043.35Show/hide
Query:  KFPRLVRIAINPNGSVSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSLYKRLWKSNS
        +F RL  IA+ P       C     SW + FRR L  EEI  FQ+LL  +S+ +  +  D R WS+++ G FS KSL  HL T SP++  L+  + +S S
Subjt:  KFPRLVRIAINPNGSVSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSLYKRLWKSNS

Query:  PRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWE
        PRRINI IWIM+F  +N + +LQ+K P + +SP  CPLC+   +++ H+   C  +S  W R+   FN  W  D     +V Q+L+G NL  KT   +WE
Subjt:  PRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWE

Query:  NAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPAFI
           KA L EIW ERNQR+FHDK+    E   +A LNA++WCSL K F +YS Q+I LNW  F+
Subjt:  NAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPAFI

A0A6J1DIE2 uncharacterized protein LOC1110207654.2e-3642.55Show/hide
Query:  VKHLSTFSPLESSLYKRLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVF
        ++   + + +    +  LWK+ SPRR+N+  WI+  G LN A ++Q+K PS AL P  C LC  + E   HL F C FASK W  L   FN  W  D   
Subjt:  VKHLSTFSPLESSLYKRLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVF

Query:  KNNVSQILAGPNLKKKTVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPAFIAS
         +NV Q+L GP     +V  LW N VKA L+E+WFERN R+F +K   + E + SA+  AS WCSL   F  +S   I  NW AFI S
Subjt:  KNNVSQILAGPNLKKKTVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPAFIAS

A5BQD9 Reverse transcriptase domain-containing protein5.8e-3033.96Show/hide
Query:  TLECKFPRLVRIAINPNGSVSDHCDSSTN-SWSILFRRLLNVEEIFYFQTLLSQISSSQ-PASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSL-YK
        +L  +FPRL+R+ ++ N  +S    S+   SW+  FRR L+  EI   ++L+  +         PD+R WSL +SG F+VKS    LS  S L S    K
Subjt:  TLECKFPRLVRIAINPNGSVSDHCDSSTN-SWSILFRRLLNVEEIFYFQTLLSQISSSQ-PASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSL-YK

Query:  RLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKK
         +W S  P +I   +W++    +N   +LQ + P  ALSP  C LC+   E++ HL   C      W RL Q     WV      + +S    G    K+
Subjt:  RLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKK

Query:  TVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPA
         +  LW+ A  A L  +W ERN R+F DKS      +D     AS W S SK F+      I L+W A
Subjt:  TVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.0e-0624.75Show/hide
Query:  PDQRFWSLKTSGFFSVKS---LVKH-----LSTFSPLESS--LYKRLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQH
        PD+  W+  T+G ++V+S   L+ H     +   +P   S  L  R+W      ++   +W  L   L     L  +     + P +CP C    ESI H
Subjt:  PDQRFWSLKTSGFFSVKS---LVKH-----LSTFSPLESS--LYKRLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQH

Query:  LLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWENAVKAFLA-EIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSK
         LF C FA+ +W     +     ++ + F+ N+S IL   N  + T    +   +  +L   IW  RN  VF+    +  +   SA+     W + ++
Subjt:  LLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWENAVKAFLA-EIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSK

AT3G25270.1 Ribonuclease H-like superfamily protein5.6e-0925Show/hide
Query:  PLESSLYKRLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSW------FRLLQAFNFCWVLDHVFKN
        P ++ +  ++WK  +  +I   +W +L G L     L+R+   H  +   C  C    E+ QHL FDC +A + W       + L+       ++   + 
Subjt:  PLESSLYKRLWKSNSPRRINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSW------FRLLQAFNFCWVLDHVFKN

Query:  NVSQILAG--PNLKKKTVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQ
         +S  LA   P L    +  LW          +W  RNQ VF  KS +W      AR +   W   + + Q  + Q
Subjt:  NVSQILAG--PNLKKKTVECLWENAVKAFLAEIWFERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQ

AT4G29090.1 Ribonuclease H-like superfamily protein6.9e-0728.48Show/hide
Query:  VSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPAS--FPDQRFWSLKTSGFFSVKSLVKHLSTF-----SPLESS------LYKRLWKSNSPR
        VSD  D S   W        +V E+ + +     I   +P      D   W   +SG ++VKS    L+       SP E S      +Y+++WKS +  
Subjt:  VSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPAS--FPDQRFWSLKTSGFFSVKSLVKHLSTF-----SPLESS------LYKRLWKSNSPR

Query:  RINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSW
        +I   +W  L  +L  A  L  +   H     AC  C    E++ HLLF C FA  +W
Subjt:  RINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATTATCACTTTGGAGTGTAAATTTCCAAGATTGGTTAGGATTGCAATCAATCCAAATGGGTCTGTTTCAGATCATTGTGACTCTTCAACCAATTCTTGGTCCAT
CTTGTTCAGAAGATTGCTAAATGTTGAGGAAATATTTTATTTCCAGACCTTGCTGAGTCAGATTTCTTCTTCACAGCCAGCCTCCTTTCCAGATCAAAGGTTTTGGTCCC
TTAAAACCTCAGGCTTCTTCTCGGTAAAATCATTGGTTAAGCATCTTTCGACATTTTCTCCTTTGGAAAGTTCTTTGTACAAAAGGCTTTGGAAATCTAACAGCCCAAGA
AGGATAAATATTTCCATATGGATTATGCTCTTCGGAAATTTAAACTGTGCCTCCGTACTTCAAAGAAAGCTTCCCTCTCATGCCCTCTCACCTCATGCCTGCCCCCTTTG
TGTTTATAACATGGAAAGCATCCAGCACCTTTTATTTGATTGTGTCTTTGCTTCTAAAAGTTGGTTCCGCCTTCTCCAAGCATTCAATTTTTGTTGGGTTCTTGATCATG
TTTTTAAGAACAATGTGTCTCAAATCCTTGCTGGTCCAAACTTGAAGAAGAAGACTGTTGAATGTTTGTGGGAAAATGCAGTAAAAGCTTTTTTAGCTGAAATCTGGTTT
GAAAGAAACCAAAGAGTCTTCCATGATAAATCATCGACTTGGATGGAACGCTATGATTCTGCTCGTTTGAATGCATCTTCATGGTGCTCATTATCCAAATTTTTTCAGGA
TTATTCTTCGCAGGAGATTGTTTTAAATTGGCCAGCCTTTATAGCCTCCCCACCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACATTATCACTTTGGAGTGTAAATTTCCAAGATTGGTTAGGATTGCAATCAATCCAAATGGGTCTGTTTCAGATCATTGTGACTCTTCAACCAATTCTTGGTCCAT
CTTGTTCAGAAGATTGCTAAATGTTGAGGAAATATTTTATTTCCAGACCTTGCTGAGTCAGATTTCTTCTTCACAGCCAGCCTCCTTTCCAGATCAAAGGTTTTGGTCCC
TTAAAACCTCAGGCTTCTTCTCGGTAAAATCATTGGTTAAGCATCTTTCGACATTTTCTCCTTTGGAAAGTTCTTTGTACAAAAGGCTTTGGAAATCTAACAGCCCAAGA
AGGATAAATATTTCCATATGGATTATGCTCTTCGGAAATTTAAACTGTGCCTCCGTACTTCAAAGAAAGCTTCCCTCTCATGCCCTCTCACCTCATGCCTGCCCCCTTTG
TGTTTATAACATGGAAAGCATCCAGCACCTTTTATTTGATTGTGTCTTTGCTTCTAAAAGTTGGTTCCGCCTTCTCCAAGCATTCAATTTTTGTTGGGTTCTTGATCATG
TTTTTAAGAACAATGTGTCTCAAATCCTTGCTGGTCCAAACTTGAAGAAGAAGACTGTTGAATGTTTGTGGGAAAATGCAGTAAAAGCTTTTTTAGCTGAAATCTGGTTT
GAAAGAAACCAAAGAGTCTTCCATGATAAATCATCGACTTGGATGGAACGCTATGATTCTGCTCGTTTGAATGCATCTTCATGGTGCTCATTATCCAAATTTTTTCAGGA
TTATTCTTCGCAGGAGATTGTTTTAAATTGGCCAGCCTTTATAGCCTCCCCACCTTGA
Protein sequenceShow/hide protein sequence
MDIITLECKFPRLVRIAINPNGSVSDHCDSSTNSWSILFRRLLNVEEIFYFQTLLSQISSSQPASFPDQRFWSLKTSGFFSVKSLVKHLSTFSPLESSLYKRLWKSNSPR
RINISIWIMLFGNLNCASVLQRKLPSHALSPHACPLCVYNMESIQHLLFDCVFASKSWFRLLQAFNFCWVLDHVFKNNVSQILAGPNLKKKTVECLWENAVKAFLAEIWF
ERNQRVFHDKSSTWMERYDSARLNASSWCSLSKFFQDYSSQEIVLNWPAFIASPP