; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G013200 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G013200
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationchr01:11452102..11459406
RNA-Seq ExpressionLsi01G013200
SyntenyLsi01G013200
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008449224.1 PREDICTED: uncharacterized protein LOC103491166 isoform X1 [Cucumis melo]4.9e-21673.58Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEGPFL APLNDSNEV D LVE KS+HVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCVTPLEGNICDTIL
        GEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR  TDAFGISELSATMVM+ EFNNTPVERGLTHELSPGL TKGRCVTPLEGNIC TIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCVTPLEGNICDTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESH
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE+NKGRRK P KDKYLKV STEES HIRHEVQM+ PRS+S 
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESH

Query:  CGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHV
        CGTSVPVQ +S+RRHP  HVPVS                                                                             
Subjt:  CGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHV

Query:  SFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
                                 GFLSEDESSATEC NVYSSA+RCKKYDRRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
Subjt:  SFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR

Query:  DKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI
        DKWRNLL+ASCVNIQN+KG+E KQ+HASRPLPKSLLQRVYELANI
Subjt:  DKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI

XP_008449225.1 PREDICTED: uncharacterized protein LOC103491166 isoform X2 [Cucumis melo]1.2e-21473.58Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEGPFL APLNDSNEV D LVE KS+HVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCVTPLEGNICDTIL
        GEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR  TDAFGISELSATMVM+ EFNNTPVERGLTHELSPGL TKGRCVTPLEGNIC TIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCVTPLEGNICDTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESH
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE+NKGRRK P KDKYLKV STEES HIRHEVQM+ PRS+S 
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESH

Query:  CGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHV
        CGTSVPVQ +S+RRHP  HVPVS                                                                             
Subjt:  CGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHV

Query:  SFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
                                 GFLSEDESSATEC NVYSSA+RCKKYDRRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
Subjt:  SFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR

Query:  DKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI
        DKWRNLL+ASCVNIQN+KG+E KQ+HASRPLPKSLLQRVYELANI
Subjt:  DKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI

XP_038881566.1 uncharacterized protein LOC120073047 isoform X1 [Benincasa hispida]1.0e-22977.37Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGDLDQLLDDANE
        MDQEVHFCQKFTNMKSHWV+VEGPFL APLNDSNEV D LVEPKSDHVLGNCLRVQDFSCDFGYGIQTN  GGLDSNSKQGGEHELKFGDLDQLLDDANE
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGDLDQLLDDANE

Query:  VGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCV--TPLEGNICD
        VGEFHATNNL +TYAEVAENSFRQNRGLQLGN SS SKSQGPSRS TDAFGISELSATMVM+DEFNNTPVERGLTHELSPGLRTKGRCV  TPLEGNICD
Subjt:  VGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCV--TPLEGNICD

Query:  TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRS
        TILDNRNIHKFNTNENYIENGDLSDENVKGDIVAN+LASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRS
Subjt:  TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRS

Query:  ESHCGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFAC
        E HCGTSVPVQSRSQRRHPK HVPVS                                                                          
Subjt:  ESHCGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFAC

Query:  LHVSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPI
                                    GFLSEDESSATEC NVYSS KRCKKYDRRRHQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSPHRTPI
Subjt:  LHVSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPI

Query:  DLRDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI
        DLRDKWRNLL+ASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI
Subjt:  DLRDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI

XP_038881567.1 uncharacterized protein LOC120073047 isoform X2 [Benincasa hispida]4.4e-21776.6Show/hide
Query:  VKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGDLDQLLDDANEVGEFHATNNLPNTYAEVA
        V+VEGPFL APLNDSNEV D LVEPKSDHVLGNCLRVQDFSCDFGYGIQTN  GGLDSNSKQGGEHELKFGDLDQLLDDANEVGEFHATNNL +TYAEVA
Subjt:  VKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGDLDQLLDDANEVGEFHATNNLPNTYAEVA

Query:  ENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCV--TPLEGNICDTILDNRNIHKFNTNENYI
        ENSFRQNRGLQLGN SS SKSQGPSRS TDAFGISELSATMVM+DEFNNTPVERGLTHELSPGLRTKGRCV  TPLEGNICDTILDNRNIHKFNTNENYI
Subjt:  ENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCV--TPLEGNICDTILDNRNIHKFNTNENYI

Query:  ENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESHCGTSVPVQSRSQRRH
        ENGDLSDENVKGDIVAN+LASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSE HCGTSVPVQSRSQRRH
Subjt:  ENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESHCGTSVPVQSRSQRRH

Query:  PKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHVSFKWKFFVVFSCSKF
        PK HVPVS                                                                                            
Subjt:  PKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHVSFKWKFFVVFSCSKF

Query:  KVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQ
                  GFLSEDESSATEC NVYSS KRCKKYDRRRHQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSPHRTPIDLRDKWRNLL+ASCVNIQ
Subjt:  KVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQ

Query:  NRKGIERKQSHASRPLPKSLLQRVYELANI
        NRKGIERKQSHASRPLPKSLLQRVYELANI
Subjt:  NRKGIERKQSHASRPLPKSLLQRVYELANI

XP_038881569.1 uncharacterized protein LOC120073047 isoform X3 [Benincasa hispida]2.1e-22777.19Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGDLDQLLDDANE
        MDQEVHFCQKFTNMKSHWV+VEGPFL APLNDSNEV D LVEPKSDHVLGNCLRVQDFSCDFGYGIQTN  GGLDSNSKQGGEHELKFGDLDQLLDDANE
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTN-SGGLDSNSKQGGEHELKFGDLDQLLDDANE

Query:  VGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCV--TPLEGNICD
        VGEFHATNNL N  AEVAENSFRQNRGLQLGN SS SKSQGPSRS TDAFGISELSATMVM+DEFNNTPVERGLTHELSPGLRTKGRCV  TPLEGNICD
Subjt:  VGEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCV--TPLEGNICD

Query:  TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRS
        TILDNRNIHKFNTNENYIENGDLSDENVKGDIVAN+LASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRS
Subjt:  TILDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRS

Query:  ESHCGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFAC
        E HCGTSVPVQSRSQRRHPK HVPVS                                                                          
Subjt:  ESHCGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFAC

Query:  LHVSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPI
                                    GFLSEDESSATEC NVYSS KRCKKYDRRRHQKMW+LTEVMRLVDGIAEYGTGRWT IKKHLFASSPHRTPI
Subjt:  LHVSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPI

Query:  DLRDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI
        DLRDKWRNLL+ASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI
Subjt:  DLRDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI

TrEMBL top hitse value%identityAlignment
A0A1S3BKX9 uncharacterized protein LOC103491166 isoform X12.3e-21673.58Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEGPFL APLNDSNEV D LVE KS+HVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCVTPLEGNICDTIL
        GEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR  TDAFGISELSATMVM+ EFNNTPVERGLTHELSPGL TKGRCVTPLEGNIC TIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCVTPLEGNICDTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESH
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE+NKGRRK P KDKYLKV STEES HIRHEVQM+ PRS+S 
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESH

Query:  CGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHV
        CGTSVPVQ +S+RRHP  HVPVS                                                                             
Subjt:  CGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHV

Query:  SFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
                                 GFLSEDESSATEC NVYSSA+RCKKYDRRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
Subjt:  SFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR

Query:  DKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI
        DKWRNLL+ASCVNIQN+KG+E KQ+HASRPLPKSLLQRVYELANI
Subjt:  DKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI

A0A1S3BLJ8 uncharacterized protein LOC103491166 isoform X31.5e-19172.2Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEGPFL APLNDSNEV D LVE KS+HVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCVTPLEGNICDTIL
        GEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR  TDAFGISELSATMVM+ EFNNTPVERGLTHELSPGL TKGRCVTPLEGNIC TIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCVTPLEGNICDTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESH
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE+NKGRRK P KDKYLKV STEES HIRHEVQM+ PRS+S 
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESH

Query:  CGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHV
        CGTSVPVQ +S+RRHP  HVPVS                                                                             
Subjt:  CGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHV

Query:  SFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
                                 GFLSEDESSATEC NVYSSA+RCKKYDRRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
Subjt:  SFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR

A0A1S3BLK0 uncharacterized protein LOC103491166 isoform X25.8e-21573.58Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKVEGPFL APLNDSNEV D LVE KS+HVLGNCLRVQDFSCDFGYGIQTN GGLDSNSKQGGEHELKFGD DQLLDDANEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCVTPLEGNICDTIL
        GEFHATNNLPNTYAEVAENSFR+NR  QLGN SSE+KS GPSR  TDAFGISELSATMVM+ EFNNTPVERGLTHELSPGL TKGRCVTPLEGNIC TIL
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCVTPLEGNICDTIL

Query:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESH
        DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEF DSKSE+NKGRRK P KDKYLKV STEES HIRHEVQM+ PRS+S 
Subjt:  DNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESH

Query:  CGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHV
        CGTSVPVQ +S+RRHP  HVPVS                                                                             
Subjt:  CGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHV

Query:  SFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
                                 GFLSEDESSATEC NVYSSA+RCKKYDRRR QKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR
Subjt:  SFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLR

Query:  DKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI
        DKWRNLL+ASCVNIQN+KG+E KQ+HASRPLPKSLLQRVYELANI
Subjt:  DKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI

A0A6J1CRG2 uncharacterized protein LOC111013581 isoform X21.5e-19968.5Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKV+G FL APLN+ NEV   LVEPKS+HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF DLDQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNN-TPVERGLTHELSPGLRTKGRCVTPLEGNICDTI
         EFHATNNLPNTY EVAENSFR+NRGLQLGNLSSESKSQG SR+ T+AF ISELSA MV + E NN TPV+RGLTHEL  GLRTKGRC TPL+G+IC TI
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNN-TPVERGLTHELSPGLRTKGRCVTPLEGNICDTI

Query:  LDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSES
        LDN NIHKF+TNE  +ENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEEFADSKSE++KG+RKPPTKDKY+KVTS EESNHIRH+VQMLTP  ES
Subjt:  LDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSES

Query:  HCGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLH
        HCGTS+PVQSRSQRR PK HVPVS                                                                            
Subjt:  HCGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLH

Query:  VSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL
                                  GFLSE+ESSATEC  VYSSAKRCKK+DRR+HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDL
Subjt:  VSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL

Query:  RDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI
        RDKWRNLL+ASCVNIQNR GIERKQSHASRPLPKSLLQRVYELANI
Subjt:  RDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI

A0A6J1CRQ1 uncharacterized protein LOC111013581 isoform X13.8e-19868.37Show/hide
Query:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV
        MDQEVHFCQKFTNMKSHWVKV+G FL APLN+ NEV   LVEPKS+HVLG+CLR QDFSCDF YGIQTN GGLDSNSKQ GEHELKF DLDQLL D NEV
Subjt:  MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEV

Query:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNN-TPVERGLTHELSPGLRTKGRCVTPLEGNICDTI
         EFHATNNLPNTY EVAENSFR+NRGLQLGNLSSESKSQG SR+ T+AF ISELSA MV + E NN TPV+RGLTHEL  GLRTKGRC TPL+G+IC TI
Subjt:  GEFHATNNLPNTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNN-TPVERGLTHELSPGLRTKGRCVTPLEGNICDTI

Query:  LDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSES
        LDN NIHKF+TNE  +ENG LSDENVKG+I A++LA CSR+RRLRKPTRRYIEEFADSKSE++KG+RKPPTKDKY+KVTS EESNHIRH+VQMLTP  ES
Subjt:  LDNRNIHKFNTNENYIENGDLSDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSES

Query:  HCGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLH
        HCGTS+PVQSRSQRR PK HVPVS                                                                            
Subjt:  HCGTSVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLH

Query:  VSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL
                                  GFLSE+ESSATEC  VYSSAKRCKK+DRR+HQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFA+SP+RTPIDL
Subjt:  VSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDL

Query:  R-DKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI
        R DKWRNLL+ASCVNIQNR GIERKQSHASRPLPKSLLQRVYELANI
Subjt:  R-DKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI

SwissProt top hitse value%identityAlignment
Q9C7B1 Telomere repeat-binding protein 36.8e-1137.93Show/hide
Query:  RRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRV
        +RR ++ +++TEV  LV  + E GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV
Subjt:  RRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRV

Q9FFY9 Telomere repeat-binding protein 46.8e-1136.11Show/hide
Query:  ESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRPL
        ES A     V    KR  +  +RR ++ +++TEV  LV  + E GTGRW  +K   F ++ HRT +DL+DKW+ L+  + ++ Q R+G          P+
Subjt:  ESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRPL

Query:  PKSLLQRV
        P+ LL RV
Subjt:  PKSLLQRV

Q9LL45 Telomere-binding protein 17.5e-1034.86Show/hide
Query:  DESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRP
        D +S    N   S +KR   + +RR ++ +T+ EV  LV+ +   GTGRW  +K   F +  HRT +DL+DKW+ L+  + +  Q R+G          P
Subjt:  DESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRP

Query:  LPKSLLQRV
        +P+ LL RV
Subjt:  LPKSLLQRV

Q9M347 Telomere repeat-binding protein 63.4e-1036.78Show/hide
Query:  RRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRV
        +RR ++ +T++EV  LV  +   GTGRW  +K H F    HRT +DL+DKW+ L+  + ++ + R+G          P+P+ LL RV
Subjt:  RRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRV

Q9SNB9 Telomere repeat-binding protein 24.4e-1036.78Show/hide
Query:  RRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRV
        +RR ++ +++TEV  LV  + + GTGRW  +K   F  + HRT +DL+DKW+ L+  + ++ Q R+G          P+P+ LL RV
Subjt:  RRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRV

Arabidopsis top hitse value%identityAlignment
AT1G17460.1 TRF-like 34.2e-1638.39Show/hide
Query:  ESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRPL
        +SS    ++ +  A   +    R+  + WT++EV +LV+G+++YG G+WT IKK  F+   HRT +DL+DKWRNL KAS  N +   G+++   H S  +
Subjt:  ESSATECNNVYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRPL

Query:  PKSLLQRVYELA
        P  ++ +V ELA
Subjt:  PKSLLQRVYELA

AT1G72650.1 TRF-like 62.0e-1827.85Show/hide
Query:  RRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESHCGT--SVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANI
        +R+RKPTRRYIEE +++  +    +   P+KD+ L   S   S  +    ++   R  S  G+   VP  S  +R  P+ ++         L+  H    
Subjt:  RRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESHCGT--SVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANI

Query:  IIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHVSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATEC
                                S    KA  +  +S ++   L P  LS    N   +  S        F+ S         ++   LSE +    E 
Subjt:  IIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHVSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATEC

Query:  NNVYSSAKRCKKYD-----------RRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHA
         ++ SS     + +           RR+H + WTL+E+ +LV+G+++YG G+W+ IKKHLF+S  +RT +DL+DKWRNLLK S     +   +   + H 
Subjt:  NNVYSSAKRCKKYD-----------RRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHA

Query:  SRPLPKSLLQRVYELA
        S  +P  +L RV ELA
Subjt:  SRPLPKSLLQRVYELA

AT1G72650.2 TRF-like 62.0e-1827.85Show/hide
Query:  RRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESHCGT--SVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANI
        +R+RKPTRRYIEE +++  +    +   P+KD+ L   S   S  +    ++   R  S  G+   VP  S  +R  P+ ++         L+  H    
Subjt:  RRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESHCGT--SVPVQSRSQRRHPKNHVPVSVREFSLLISVHPANI

Query:  IIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHVSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATEC
                                S    KA  +  +S ++   L P  LS    N   +  S        F+ S         ++   LSE +    E 
Subjt:  IIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHVSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATEC

Query:  NNVYSSAKRCKKYD-----------RRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHA
         ++ SS     + +           RR+H + WTL+E+ +LV+G+++YG G+W+ IKKHLF+S  +RT +DL+DKWRNLLK S     +   +   + H 
Subjt:  NNVYSSAKRCKKYD-----------RRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHA

Query:  SRPLPKSLLQRVYELA
        S  +P  +L RV ELA
Subjt:  SRPLPKSLLQRVYELA

AT2G37025.1 TRF-like 82.9e-2544.53Show/hide
Query:  YSSAHLHGFLSEDESSATECNNVYSSAKRCK-KYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNR
        Y    +    S+D+ + +E  +  S  K  + K DRR++Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLLKAS     N 
Subjt:  YSSAHLHGFLSEDESSATECNNVYSSAKRCK-KYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNR

Query:  KGIERKQSHASRPLPKSLLQRVYELANI
           E K+   +R +PK +L RV ELA++
Subjt:  KGIERKQSHASRPLPKSLLQRVYELANI

AT2G37025.2 TRF-like 82.9e-2544.53Show/hide
Query:  YSSAHLHGFLSEDESSATECNNVYSSAKRCK-KYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNR
        Y    +    S+D+ + +E  +  S  K  + K DRR++Q++WTL EVM LVDGI+ +G G+WT IK H F  + HR P+D+RDKWRNLLKAS     N 
Subjt:  YSSAHLHGFLSEDESSATECNNVYSSAKRCK-KYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNR

Query:  KGIERKQSHASRPLPKSLLQRVYELANI
           E K+   +R +PK +L RV ELA++
Subjt:  KGIERKQSHASRPLPKSLLQRVYELANI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAGAAGTGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGTAAAAGTGGAGGGACCCTTTCTTACTGCGCCATTAAATGATTCAAATGAAGTTGG
GGATTATCTTGTGGAGCCTAAAAGCGACCATGTTTTAGGAAATTGCTTGAGGGTTCAAGATTTCTCTTGTGACTTCGGCTATGGAATACAAACAAACAGCGGTGGATTGG
ATTCTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAGATCTTGATCAACTGCTGGATGATGCCAATGAAGTAGGGGAATTCCATGCAACAAACAATCTGCCA
AATACATATGCCGAAGTTGCTGAAAATTCTTTCAGACAGAATAGGGGATTACAATTGGGAAACTTAAGTTCAGAGAGTAAATCTCAGGGACCAAGCAGGAGTGGTACTGA
TGCTTTTGGAATATCAGAATTGTCAGCAACAATGGTAATGGACGATGAATTCAATAATACACCTGTTGAAAGGGGTTTAACTCATGAGTTGTCCCCGGGTCTGAGGACCA
AAGGTAGGTGTGTAACACCACTTGAAGGCAACATCTGTGATACGATACTTGATAATAGAAATATCCATAAGTTCAATACTAATGAAAACTATATAGAAAATGGCGATTTA
TCTGATGAAAATGTGAAGGGTGATATTGTGGCAAACGAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGTAAGCCTACTCGAAGATACATTGAAGAATTTGCCGATTC
AAAGTCTGAAAATAACAAGGGAAGGAGAAAACCTCCTACAAAAGATAAATACCTGAAAGTGACGTCTACGGAAGAATCCAATCACATTAGACATGAGGTACAAATGTTGA
CTCCTAGAAGTGAATCGCATTGTGGTACGTCTGTTCCAGTGCAGTCTCGATCTCAAAGAAGACATCCAAAGAACCATGTACCAGTTTCAGTACGAGAGTTTTCTCTCTTG
ATATCTGTTCATCCTGCCAATATTATTATCATTGAGATTTCTTTGTCTTATTTTGCCTTTATATTATGCGTTTTTCTAATGGTTCATTTGGCCAAAAGTCTTTTTAGATT
TAAAGCTGGAACCACTTTAGTGAAGTCATGCATGCATCCCCCCCTTCTCCTTCCTCTCTCTCTCTCTCTCTACACATACAATTTTGCCTGTCTACACGTGAGTTTTAAAT
GGAAATTTTTTGTTGTCTTTTCCTGCAGTAAGTTCAAAGTTTGCTACAGCTCCGCTCATCTACATGGATTTCTATCTGAAGATGAATCTTCTGCAACTGAGTGTAACAAT
GTTTATTCATCTGCTAAAAGATGTAAAAAGTATGATAGGAGGAGGCACCAGAAGATGTGGACCCTTACTGAAGTAATGCGATTAGTTGATGGAATCGCTGAATATGGAAC
TGGCCGCTGGACTCATATAAAGAAGCACCTATTTGCATCATCTCCTCATCGCACACCTATAGATCTCAGGGACAAATGGCGAAATCTTCTGAAAGCTAGCTGTGTTAACA
TACAGAACAGAAAAGGGATCGAACGGAAGCAGTCACATGCCTCACGTCCACTGCCAAAGTCCCTGCTCCAACGTGTTTATGAACTGGCCAATATCTAG
mRNA sequenceShow/hide mRNA sequence
GACATGTAGGATTGATTGAGCTTAAAAGTAAACAAATTATGGATCAAGAAGTGCATTTCTGCCAGAAGTTCACAAATATGAAATCTCATTGGGTAAAAGTGGAGGGACCC
TTTCTTACTGCGCCATTAAATGATTCAAATGAAGTTGGGGATTATCTTGTGGAGCCTAAAAGCGACCATGTTTTAGGAAATTGCTTGAGGGTTCAAGATTTCTCTTGTGA
CTTCGGCTATGGAATACAAACAAACAGCGGTGGATTGGATTCTAATAGCAAGCAGGGAGGCGAACATGAACTTAAATTTGGAGATCTTGATCAACTGCTGGATGATGCCA
ATGAAGTAGGGGAATTCCATGCAACAAACAATCTGCCAAATACATATGCCGAAGTTGCTGAAAATTCTTTCAGACAGAATAGGGGATTACAATTGGGAAACTTAAGTTCA
GAGAGTAAATCTCAGGGACCAAGCAGGAGTGGTACTGATGCTTTTGGAATATCAGAATTGTCAGCAACAATGGTAATGGACGATGAATTCAATAATACACCTGTTGAAAG
GGGTTTAACTCATGAGTTGTCCCCGGGTCTGAGGACCAAAGGTAGGTGTGTAACACCACTTGAAGGCAACATCTGTGATACGATACTTGATAATAGAAATATCCATAAGT
TCAATACTAATGAAAACTATATAGAAAATGGCGATTTATCTGATGAAAATGTGAAGGGTGATATTGTGGCAAACGAACTTGCCAGTTGTTCAAGGGAGAGGAGATTGCGT
AAGCCTACTCGAAGATACATTGAAGAATTTGCCGATTCAAAGTCTGAAAATAACAAGGGAAGGAGAAAACCTCCTACAAAAGATAAATACCTGAAAGTGACGTCTACGGA
AGAATCCAATCACATTAGACATGAGGTACAAATGTTGACTCCTAGAAGTGAATCGCATTGTGGTACGTCTGTTCCAGTGCAGTCTCGATCTCAAAGAAGACATCCAAAGA
ACCATGTACCAGTTTCAGTACGAGAGTTTTCTCTCTTGATATCTGTTCATCCTGCCAATATTATTATCATTGAGATTTCTTTGTCTTATTTTGCCTTTATATTATGCGTT
TTTCTAATGGTTCATTTGGCCAAAAGTCTTTTTAGATTTAAAGCTGGAACCACTTTAGTGAAGTCATGCATGCATCCCCCCCTTCTCCTTCCTCTCTCTCTCTCTCTCTA
CACATACAATTTTGCCTGTCTACACGTGAGTTTTAAATGGAAATTTTTTGTTGTCTTTTCCTGCAGTAAGTTCAAAGTTTGCTACAGCTCCGCTCATCTACATGGATTTC
TATCTGAAGATGAATCTTCTGCAACTGAGTGTAACAATGTTTATTCATCTGCTAAAAGATGTAAAAAGTATGATAGGAGGAGGCACCAGAAGATGTGGACCCTTACTGAA
GTAATGCGATTAGTTGATGGAATCGCTGAATATGGAACTGGCCGCTGGACTCATATAAAGAAGCACCTATTTGCATCATCTCCTCATCGCACACCTATAGATCTCAGGGA
CAAATGGCGAAATCTTCTGAAAGCTAGCTGTGTTAACATACAGAACAGAAAAGGGATCGAACGGAAGCAGTCACATGCCTCACGTCCACTGCCAAAGTCCCTGCTCCAAC
GTGTTTATGAACTGGCCAATATCTAGAGGTTATCAGATTCTGTTGCAGCAAGGAAAGGAAGTCATGTTCAAGTTGGAGCTAATTGCTCGGATGGAGGAGAAATCAGCATA
CGCCTTGAGCGAATATCTCCAATCTCCATGGTTACTAATGTTTTACCTCTACTGAAGAACACAACCACTCGCGTATCTTCTACTAATTCTTACTCACTTAGAGGTAAGTG
TATTTTTACTATTCAATTAAATGATTATTACTCTGATTCTTTTCGAAGTATGTATTTTTTCCAGTTTAAATCAACATATTTCTTCTCTATGCCAGGTGACTTCTTATCTA
GATTTTTGTTTACACTATGTATCATTTGGCATCAGTATCTTCTTATTAATAGCTGTCTCTCTCATAGATACATGCATAAACGTAGACCTTTATATACACTAAGGGATGAC
CATCATTAGTCCTTTGGATTTTCTAGACAGAAGGTAATAATTACAGCAGGAAAGCGCACCTCCCTCTACCTTCATACCCTGCCTACTCCATGGAACATAGGAACATGAAG
AGCTAGAACAAAAAGAACCAGGATCTTTCTGGTACACTCTATCAAATATCATATGCTGGTCCTATGGATTTGCAACAGTTACGAGTTACCCACTTCCACCATTGTCATTT
CCTTGCGAGTCGTTGAAGTACTCTCTAGGCAATGAGTGATGGTTGCTAATGCCACTTTCTGGGTGCTTGAAGTTGCTCTTTCTGCTGTTATTTTCTTCAGCTGTTTGTGC
TTGCTTGAAGAGATATTGGTTTCCTTGTCCATGAAGTCCTCCCAAAGCAATCCTTCTTGAAGTCGCCGAGCAAGTGTAGCTCAATACCATAATTGATAAGATAAGTACTA
AGAACTTCATCCTGGGGTTTTGAGTTTCGTACCCTTAGATACATCAGAAATAAGGCTTCAGCATTTATAGAACAATAGCTAAGCTCTTAGATTTGATGATTAATATTAAA
AAAATATCCTAGTGGTTTTATAATTGAAAGTTAGTATTAAAAATGAAGTATGGAGTCTCCACCAGAAGACTAGACAAGATATAAAGTCTTGTATGTATTTTAAGGCCAAT
GCGCTTTATTTAATTAACGAATGATTATGGTTGAAATGAAATTCTTGGTCTACTTTATCAGCCTCAACAGGGAACATCCACCTTCGTAATGGAAATCTTTACGAAGACCT
AAAAGTCAGTTAGGCATAATGATGTTTCCTGTTTGAGTTGCTTAGTGATGTCAATTTATAAATTACAAACACAAACAGTAGTTTCTCAC
Protein sequenceShow/hide protein sequence
MDQEVHFCQKFTNMKSHWVKVEGPFLTAPLNDSNEVGDYLVEPKSDHVLGNCLRVQDFSCDFGYGIQTNSGGLDSNSKQGGEHELKFGDLDQLLDDANEVGEFHATNNLP
NTYAEVAENSFRQNRGLQLGNLSSESKSQGPSRSGTDAFGISELSATMVMDDEFNNTPVERGLTHELSPGLRTKGRCVTPLEGNICDTILDNRNIHKFNTNENYIENGDL
SDENVKGDIVANELASCSRERRLRKPTRRYIEEFADSKSENNKGRRKPPTKDKYLKVTSTEESNHIRHEVQMLTPRSESHCGTSVPVQSRSQRRHPKNHVPVSVREFSLL
ISVHPANIIIIEISLSYFAFILCVFLMVHLAKSLFRFKAGTTLVKSCMHPPLLLPLSLSLYTYNFACLHVSFKWKFFVVFSCSKFKVCYSSAHLHGFLSEDESSATECNN
VYSSAKRCKKYDRRRHQKMWTLTEVMRLVDGIAEYGTGRWTHIKKHLFASSPHRTPIDLRDKWRNLLKASCVNIQNRKGIERKQSHASRPLPKSLLQRVYELANI