; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g31410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g31410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:22950938..22956361
RNA-Seq ExpressionMoc11g31410
SyntenyMoc11g31410
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP35727.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]6.3e-17244.58Show/hide
Query:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE ++I+LGCMDLDLALR ++PT   ENP++ +++KW+RSNRMCLMIMKRS+PE F GSI E  NAKGFL  +EQYFT N+K +AS+L+AK
Subjt:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY GKGNIREYIM+MSN+A+KLKALKLE+S+D LVHLVL SLP  +  F+VSYNTQKDKW LNELISHCVQEEER QREKTES H+AS+S++ KRK
Subjt:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAESSKQ-KKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E + Q KK KK ++ P CFFCKK GHMKK+C KY +W +KKG FLS VCSE+NLAFVP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAESSKQ-KKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----
         DGN+  VEAIGTFRL L TGFHLDLFETFVVPSFRRNLIS+SSLDKFG+SCSFGNNK SL  NSN++G+GSLIDNLY+LD V  + E+ H+ SR     
Subjt:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI
                                                        ARPY+P+E KL+SRT+SCYFVGY ERSRG+KFYNPT+R+FFETGNARFLE++
Subjt:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI

Query:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERR--IRNKSLI-----SDG
        EF      ++VVFEEE     + V+   TI     +  D    +P  D++ +QDN +     PI+Q Q +QEV LRRSTRER+  I +  ++      DG
Subjt:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERR--IRNKSLI-----SDG

Query:  FGLT-GSPIHHSTRNQKISGQ
         GLT   PI+     Q  S Q
Subjt:  FGLT-GSPIHHSTRNQKISGQ

KYP39716.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]3.0e-17444.95Show/hide
Query:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE ++I+LGCMDLDLALR ++PT   ENP++ +++KW+RSNRMCLMIMKRS+PE FRGSI E  NAKGFL  +EQYFT N+KA+AS+L+AK
Subjt:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY GKGNIREYIM+MSN+A+KLKALKLE+S+D LVHLVL SLP  +  F+VSYNTQKDKW LNELISHCVQEEER QREKTES H+AS+S++ KRK
Subjt:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAE-SSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E +S+QKK KK ++ P CFFCKK GHMKK+C KY +W +KKG FLS VCSE+NLAFVP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAE-SSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----
         DGN+  VEAIGTFRL L TGFHLDLFETFVVPSFRRNLIS+SSLDKFG+SCSFGNNK SL  NSN++G+GSLIDNLY+LD V  + E+ H+ SR     
Subjt:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI
                                                        ARPY+P+E KL+SRT+SCYFVGY ERSRG+KFYNPT+R+FFETGNARFLE++
Subjt:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI

Query:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERR--IRNKSLI-----SDG
        EF      ++VVFEEE     + V+   TI     +  D    +P  D++ +QDN +     PI+Q Q +QEV LRRSTRER+  I +  ++      DG
Subjt:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERR--IRNKSLI-----SDG

Query:  FGLT-GSPIHHSTRNQKISGQ
         GLT   PI+     Q  S Q
Subjt:  FGLT-GSPIHHSTRNQKISGQ

KYP69815.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.3e-16444.08Show/hide
Query:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE ++I+LGCMDLDLAL  ++PT T ENPN+ +++KW+ SNRMCLMIMKRS+ E  RG I E  NAKGFL  +EQYFT N+K +AS L+AK
Subjt:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY GKGNIREYIM+MSN+A+KLKALKLE+S+D LVHLVL SLP  +  F+V YNTQKDKW LNELISHCVQEEER  REKTES H+AS+S++ KRK
Subjt:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAESSKQ-KKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E + Q KK KK ++ P CFFCKK GHMKK+C KY +W +KKG FLS VCSE+NLAFVP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAESSKQ-KKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----
         DGN+  VEAIGTFRL L TGFHLDLFETFVVPSFRRNLIS+SSLDKFG+SCSFGNNK SL  NSN++G+GSLIDNLY+LD V  + E+ H+ SR     
Subjt:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI
                                                        ARPY+P+E KL+SRT+SCYFVGY E SRG+KFYN T+++FFETGNARFLE++
Subjt:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI

Query:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERRIRNKSLISDGF
        EF      ++++F+EE     + V+   TI     +       +P  D++  QDN +     PI+Q Q  QEVSLRRSTRER    KS ISD +
Subjt:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERRIRNKSLISDGF

RVW13644.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.2e-14951Show/hide
Query:  TQVNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRP---TSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKN
        + V NIP LNG NFK WKE + IVLGCMDLD  LR DRP   T       +  I+KW+RSN M LMIMK SIPE  +G+I + T AK FL +    F  N
Subjt:  TQVNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRP---TSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKN

Query:  DKAEASTLMAKLTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVH
         K E ST+++KL S RY GK NIREYIM+MSN+ T+LK LKLE SED LVHLVL SLP ++S F++SYNTQK+KW LNELI+ CVQEEER+++EK ES H
Subjt:  DKAEASTLMAKLTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVH

Query:  MASTSKSV---KRKRVNN------AAESSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVS
        +ASTS+     K+++++N       +E+SKQK +KKQD    CFFCKK GHMKK C KY AW  KKG  L+ VCSEINLA VP  TWW+D GATTHISV+
Subjt:  MASTSKSV---KRKRVNN------AAESSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVS

Query:  MQGCIWSRPPSDAEAFIYVADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLID------
        M+GC+ SR P+D E +IYV +GN+  V+AIG FRL L  G  LDL ETFVV SFRRNLISVS LDKFGY CSF N  +  L +  I+      D      
Subjt:  MQGCIWSRPPSDAEAFIYVADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLID------

Query:  ----------------------NLYMLDNVPFDN----------ESLHVSSRARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARF
                                Y +   P  N          + +   + ARPYKPNEKKLDSRT+SCYFVGY+ERSRGFKFY+P+SR+FFETGNA+F
Subjt:  ----------------------NLYMLDNVPFDN----------ESLHVSSRARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARF

Query:  LENIEFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPI--------------DQPQHTQ-EVSLRRSTRE
        +E++E  GR + + VVFE+E+       V++PT     ++ +DT   +  + +IT   +T + SP  +               QPQ  Q +V LRRSTRE
Subjt:  LENIEFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPI--------------DQPQHTQ-EVSLRRSTRE

Query:  RR
        +R
Subjt:  RR

RVX10077.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.0e-15050.15Show/hide
Query:  VNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRP---TSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDK
        VNNIP LNG NFK WKE + IVLGCMDLD ALR DRP   TS      +  ++KW+RSNRM LMIMK SIPE  RG+I E T AK FL ++   FT N+K
Subjt:  VNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRP---TSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDK

Query:  AEASTLMAKLTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMA
         E ST+++KL S RY GK NIREYIM+MSN+ T+LKALKLE+SED L+HLVL S+P ++S F++SYNTQK+KW LNELI+ CVQEEER+++EK ES H+A
Subjt:  AEASTLMAKLTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMA

Query:  STSK----SVKRKRVNNAAE-----SSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQ
        STS+    + KRKR N   +     +SKQK++KKQD    CFFCKK GHMKK C KY AW  KKG  L+ VCSEINLA VP  TWW+D+GATTHISV+MQ
Subjt:  STSK----SVKRKRVNNAAE-----SSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQ

Query:  GCIWSRPPSDAEAFIYVADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGN---------NKLS--------------
        GC+ SR P+D E +IYV + N+  V+AIG FRL L +G  LDL ETFVVPSFRRNLISVS LDKFGY CSFGN         N+LS              
Subjt:  GCIWSRPPSDAEAFIYVADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGN---------NKLS--------------

Query:  -----------------------LLHNSNIIGTGSL-------------IDNLYMLDNVPF----------------DNESLHV---SSRARPYKPNEKK
                               ++    + GT S              +  +Y+L+ VP                     LHV    + ARPYKPNEKK
Subjt:  -----------------------LLHNSNIIGTGSL-------------IDNLYMLDNVPF----------------DNESLHV---SSRARPYKPNEKK

Query:  LDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENIEFEGRNKSKDVVFEEENATIMNDVVTLPTIVS--KVVLDNDTQDKVPTLDVITSQDNT
        LDSR +SCYFVGY+ERSRGFKFY+P++R+FFE GNA+F+E++E  GR   + VVFEEE        V +P I +    ++ NDT   +  +  IT   +T
Subjt:  LDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENIEFEGRNKSKDVVFEEENATIMNDVVTLPTIVS--KVVLDNDTQDKVPTLDVITSQDNT

Query:  DDTSPFPI-----------DQPQHTQ-EVSLRRSTRERRIRNKSLISDGF
         +  P  +            QPQ  Q +V LRRSTRERR    S ISD +
Subjt:  DDTSPFPI-----------DQPQHTQ-EVSLRRSTRERRIRNKSLISDGF

TrEMBL top hitse value%identityAlignment
A0A151QZJ9 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-17244.58Show/hide
Query:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE ++I+LGCMDLDLALR ++PT   ENP++ +++KW+RSNRMCLMIMKRS+PE F GSI E  NAKGFL  +EQYFT N+K +AS+L+AK
Subjt:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY GKGNIREYIM+MSN+A+KLKALKLE+S+D LVHLVL SLP  +  F+VSYNTQKDKW LNELISHCVQEEER QREKTES H+AS+S++ KRK
Subjt:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAESSKQ-KKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E + Q KK KK ++ P CFFCKK GHMKK+C KY +W +KKG FLS VCSE+NLAFVP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAESSKQ-KKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----
         DGN+  VEAIGTFRL L TGFHLDLFETFVVPSFRRNLIS+SSLDKFG+SCSFGNNK SL  NSN++G+GSLIDNLY+LD V  + E+ H+ SR     
Subjt:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI
                                                        ARPY+P+E KL+SRT+SCYFVGY ERSRG+KFYNPT+R+FFETGNARFLE++
Subjt:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI

Query:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERR--IRNKSLI-----SDG
        EF      ++VVFEEE     + V+   TI     +  D    +P  D++ +QDN +     PI+Q Q +QEV LRRSTRER+  I +  ++      DG
Subjt:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERR--IRNKSLI-----SDG

Query:  FGLT-GSPIHHSTRNQKISGQ
         GLT   PI+     Q  S Q
Subjt:  FGLT-GSPIHHSTRNQKISGQ

A0A151RB35 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-17444.95Show/hide
Query:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE ++I+LGCMDLDLALR ++PT   ENP++ +++KW+RSNRMCLMIMKRS+PE FRGSI E  NAKGFL  +EQYFT N+KA+AS+L+AK
Subjt:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY GKGNIREYIM+MSN+A+KLKALKLE+S+D LVHLVL SLP  +  F+VSYNTQKDKW LNELISHCVQEEER QREKTES H+AS+S++ KRK
Subjt:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAE-SSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E +S+QKK KK ++ P CFFCKK GHMKK+C KY +W +KKG FLS VCSE+NLAFVP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAE-SSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----
         DGN+  VEAIGTFRL L TGFHLDLFETFVVPSFRRNLIS+SSLDKFG+SCSFGNNK SL  NSN++G+GSLIDNLY+LD V  + E+ H+ SR     
Subjt:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI
                                                        ARPY+P+E KL+SRT+SCYFVGY ERSRG+KFYNPT+R+FFETGNARFLE++
Subjt:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI

Query:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERR--IRNKSLI-----SDG
        EF      ++VVFEEE     + V+   TI     +  D    +P  D++ +QDN +     PI+Q Q +QEV LRRSTRER+  I +  ++      DG
Subjt:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERR--IRNKSLI-----SDG

Query:  FGLT-GSPIHHSTRNQKISGQ
         GLT   PI+     Q  S Q
Subjt:  FGLT-GSPIHHSTRNQKISGQ

A0A151TRZ9 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-16544.08Show/hide
Query:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
        LNG NFK WKE ++I+LGCMDLDLAL  ++PT T ENPN+ +++KW+ SNRMCLMIMKRS+ E  RG I E  NAKGFL  +EQYFT N+K +AS L+AK
Subjt:  LNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK

Query:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK
        L S RY GKGNIREYIM+MSN+A+KLKALKLE+S+D LVHLVL SLP  +  F+V YNTQKDKW LNELISHCVQEEER  REKTES H+AS+S++ KRK
Subjt:  LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRK

Query:  RVNNAAESSKQ-KKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV
           +  E + Q KK KK ++ P CFFCKK GHMKK+C KY +W +KKG FLS VCSE+NLAFVP  TWWVDSGATTHISVSMQGC+WSRPPSD E FIYV
Subjt:  RVNNAAESSKQ-KKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYV

Query:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----
         DGN+  VEAIGTFRL L TGFHLDLFETFVVPSFRRNLIS+SSLDKFG+SCSFGNNK SL  NSN++G+GSLIDNLY+LD V  + E+ H+ SR     
Subjt:  ADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSR-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI
                                                        ARPY+P+E KL+SRT+SCYFVGY E SRG+KFYN T+++FFETGNARFLE++
Subjt:  ------------------------------------------------ARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENI

Query:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERRIRNKSLISDGF
        EF      ++++F+EE     + V+   TI     +       +P  D++  QDN +     PI+Q Q  QEVSLRRSTRER    KS ISD +
Subjt:  EFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERRIRNKSLISDGF

A0A438BRQ0 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-15051Show/hide
Query:  TQVNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRP---TSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKN
        + V NIP LNG NFK WKE + IVLGCMDLD  LR DRP   T       +  I+KW+RSN M LMIMK SIPE  +G+I + T AK FL +    F  N
Subjt:  TQVNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRP---TSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKN

Query:  DKAEASTLMAKLTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVH
         K E ST+++KL S RY GK NIREYIM+MSN+ T+LK LKLE SED LVHLVL SLP ++S F++SYNTQK+KW LNELI+ CVQEEER+++EK ES H
Subjt:  DKAEASTLMAKLTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVH

Query:  MASTSKSV---KRKRVNN------AAESSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVS
        +ASTS+     K+++++N       +E+SKQK +KKQD    CFFCKK GHMKK C KY AW  KKG  L+ VCSEINLA VP  TWW+D GATTHISV+
Subjt:  MASTSKSV---KRKRVNN------AAESSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVS

Query:  MQGCIWSRPPSDAEAFIYVADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLID------
        M+GC+ SR P+D E +IYV +GN+  V+AIG FRL L  G  LDL ETFVV SFRRNLISVS LDKFGY CSF N  +  L +  I+      D      
Subjt:  MQGCIWSRPPSDAEAFIYVADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLID------

Query:  ----------------------NLYMLDNVPFDN----------ESLHVSSRARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARF
                                Y +   P  N          + +   + ARPYKPNEKKLDSRT+SCYFVGY+ERSRGFKFY+P+SR+FFETGNA+F
Subjt:  ----------------------NLYMLDNVPFDN----------ESLHVSSRARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARF

Query:  LENIEFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPI--------------DQPQHTQ-EVSLRRSTRE
        +E++E  GR + + VVFE+E+       V++PT     ++ +DT   +  + +IT   +T + SP  +               QPQ  Q +V LRRSTRE
Subjt:  LENIEFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPI--------------DQPQHTQ-EVSLRRSTRE

Query:  RR
        +R
Subjt:  RR

A0A438JMC3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-15050.15Show/hide
Query:  VNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRP---TSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDK
        VNNIP LNG NFK WKE + IVLGCMDLD ALR DRP   TS      +  ++KW+RSNRM LMIMK SIPE  RG+I E T AK FL ++   FT N+K
Subjt:  VNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRP---TSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDK

Query:  AEASTLMAKLTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMA
         E ST+++KL S RY GK NIREYIM+MSN+ T+LKALKLE+SED L+HLVL S+P ++S F++SYNTQK+KW LNELI+ CVQEEER+++EK ES H+A
Subjt:  AEASTLMAKLTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMA

Query:  STSK----SVKRKRVNNAAE-----SSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQ
        STS+    + KRKR N   +     +SKQK++KKQD    CFFCKK GHMKK C KY AW  KKG  L+ VCSEINLA VP  TWW+D+GATTHISV+MQ
Subjt:  STSK----SVKRKRVNNAAE-----SSKQKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQ

Query:  GCIWSRPPSDAEAFIYVADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGN---------NKLS--------------
        GC+ SR P+D E +IYV + N+  V+AIG FRL L +G  LDL ETFVVPSFRRNLISVS LDKFGY CSFGN         N+LS              
Subjt:  GCIWSRPPSDAEAFIYVADGNRAKVEAIGTFRLSLGTGFHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGN---------NKLS--------------

Query:  -----------------------LLHNSNIIGTGSL-------------IDNLYMLDNVPF----------------DNESLHV---SSRARPYKPNEKK
                               ++    + GT S              +  +Y+L+ VP                     LHV    + ARPYKPNEKK
Subjt:  -----------------------LLHNSNIIGTGSL-------------IDNLYMLDNVPF----------------DNESLHV---SSRARPYKPNEKK

Query:  LDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENIEFEGRNKSKDVVFEEENATIMNDVVTLPTIVS--KVVLDNDTQDKVPTLDVITSQDNT
        LDSR +SCYFVGY+ERSRGFKFY+P++R+FFE GNA+F+E++E  GR   + VVFEEE        V +P I +    ++ NDT   +  +  IT   +T
Subjt:  LDSRTISCYFVGYAERSRGFKFYNPTSRTFFETGNARFLENIEFEGRNKSKDVVFEEENATIMNDVVTLPTIVS--KVVLDNDTQDKVPTLDVITSQDNT

Query:  DDTSPFPI-----------DQPQHTQ-EVSLRRSTRERRIRNKSLISDGF
         +  P  +            QPQ  Q +V LRRSTRERR    S ISD +
Subjt:  DDTSPFPI-----------DQPQHTQ-EVSLRRSTRERRIRNKSLISDGF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53670.1 unknown protein1.2e-3237.98Show/hide
Query:  TQVNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEG-TNAKGFLKEMEQYFTKNDK
        + V++IP L+G+NF +WKE + +VL  MDLDL+L  +RP+S +      E+K WDRSNR+ +MIMK  IP+ FRG + +  T AK FL  +E +F KN++
Subjt:  TQVNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEG-TNAKGFLKEMEQYFTKNDK

Query:  AEASTLMAKLTSSRYVGKGNIREYIMQMSNVATKLKALKLE---VSEDFLVHLVLNSLPAEYSHFRVSYNTQKDK-------------WYLNELISHCVQ
        AE S + A+ +S  Y+   N+RE IM+M  +  K K L +     ++  L H  +  LP +Y   +  Y+  + K             W   ELIS C  
Subjt:  AEASTLMAKLTSSRYVGKGNIREYIMQMSNVATKLKALKLE---VSEDFLVHLVLNSLPAEYSHFRVSYNTQKDK-------------WYLNELISHCVQ

Query:  EEERMQRE
        EEE ++ E
Subjt:  EEERMQRE

AT5G53690.1 unknown protein3.7e-0542.86Show/hide
Query:  VLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSV---KRKRVNNAAES
        VL+SLP++Y   R +Y+  K +W  ++LISHCVQEEER+  EK E  H     K +   KRK+ +   E+
Subjt:  VLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSV---KRKRVNNAAES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCACTCAAGTCAACAACATTCCTAGACTAAATGGGGCTAATTTTAAGGACTGGAAAGAAGACATCCAGATAGTACTTGGGTGTATGGATTTAGACCTTGCATTAAG
GGTAGACCGTCCTACTTCAACTGAGGAAAATCCTAATAAGGTTGAAATTAAGAAGTGGGATAGGTCTAATCGCATGTGTCTAATGATCATGAAGCGCTCAATTCCAGAAA
CATTTAGAGGCTCTATTGTTGAGGGAACGAATGCCAAAGGCTTTCTAAAGGAAATGGAGCAGTACTTTACCAAAAACGATAAGGCAGAAGCGAGTACCCTTATGGCAAAA
CTCACCTCTTCAAGATACGTTGGTAAAGGAAACATAAGGGAATACATAATGCAAATGTCAAATGTTGCAACAAAACTTAAGGCACTGAAGTTGGAAGTTTCTGAAGACTT
TTTAGTGCATTTAGTTTTGAACTCTCTTCCAGCAGAGTATAGCCACTTCAGGGTGAGTTACAACACTCAGAAGGATAAATGGTACCTGAATGAGCTAATCTCTCACTGTG
TTCAAGAGGAAGAGAGGATGCAGCGAGAGAAGACAGAAAGTGTTCACATGGCTTCTACCTCAAAGAGTGTAAAGAGAAAGAGAGTGAATAATGCTGCGGAATCTTCTAAG
CAGAAAAAGGAAAAGAAACAGGATTCGGGACCTGCTTGTTTTTTCTGTAAAAAGACTGGGCACATGAAAAAACAATGTGCCAAATATGTTGCATGGCTATTAAAGAAGGG
TATGTTTCTCTCCCTTGTTTGTTCTGAGATTAATCTAGCTTTTGTACCTATGCATACGTGGTGGGTAGACTCAGGTGCTACTACTCACATAAGTGTATCCATGCAGGGTT
GCATTTGGAGCCGACCGCCAAGTGATGCTGAGGCTTTCATCTATGTGGCTGACGGCAATAGGGCAAAAGTAGAAGCAATAGGAACATTTAGATTATCTTTAGGAACTGGT
TTTCATTTGGATTTGTTTGAGACTTTTGTTGTTCCGTCATTTAGACGGAATTTAATTTCTGTTTCTTCATTGGACAAATTTGGTTATTCTTGTTCATTTGGAAATAACAA
ACTAAGTCTTTTGCATAATTCAAATATTATTGGTACTGGTTCACTGATTGATAATTTATATATGCTTGATAATGTTCCTTTTGATAATGAAAGCTTGCATGTTTCATCAC
GTGCAAGGCCTTATAAGCCAAATGAAAAGAAACTGGACTCAAGAACCATAAGTTGCTACTTTGTTGGGTATGCTGAGCGCTCTCGGGGCTTTAAGTTCTATAATCCCACT
TCAAGAACTTTTTTCGAGACGGGAAATGCTCGATTTCTTGAGAATATTGAGTTTGAGGGGAGAAATAAAAGTAAAGATGTTGTTTTTGAAGAAGAAAATGCTACTATCAT
GAATGATGTGGTTACTTTACCTACCATTGTTAGTAAAGTAGTTTTAGATAATGACACTCAAGATAAAGTTCCAACGCTCGATGTCATTACCTCTCAAGACAACACTGATG
ACACATCACCCTTCCCTATAGATCAACCTCAACATACTCAAGAAGTGTCATTAAGAAGATCCACTAGAGAAAGGAGAATAAGGAATAAATCTTTGATAAGTGATGGATTT
GGATTGACGGGGAGCCCTATTCACCACTCGACGAGGAACCAGAAGATCTCCGGCCAAATGGGTGCGGGTGTTTCCAGCTATTCGGCTTCAGATTCAACCGGAATGCCGAT
TACGAAGGTAGAAATCTTCTACGGCGGGGAGAGGAATCTTGGATGGCTAGGAAATTGGAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCACTCAAGTCAACAACATTCCTAGACTAAATGGGGCTAATTTTAAGGACTGGAAAGAAGACATCCAGATAGTACTTGGGTGTATGGATTTAGACCTTGCATTAAG
GGTAGACCGTCCTACTTCAACTGAGGAAAATCCTAATAAGGTTGAAATTAAGAAGTGGGATAGGTCTAATCGCATGTGTCTAATGATCATGAAGCGCTCAATTCCAGAAA
CATTTAGAGGCTCTATTGTTGAGGGAACGAATGCCAAAGGCTTTCTAAAGGAAATGGAGCAGTACTTTACCAAAAACGATAAGGCAGAAGCGAGTACCCTTATGGCAAAA
CTCACCTCTTCAAGATACGTTGGTAAAGGAAACATAAGGGAATACATAATGCAAATGTCAAATGTTGCAACAAAACTTAAGGCACTGAAGTTGGAAGTTTCTGAAGACTT
TTTAGTGCATTTAGTTTTGAACTCTCTTCCAGCAGAGTATAGCCACTTCAGGGTGAGTTACAACACTCAGAAGGATAAATGGTACCTGAATGAGCTAATCTCTCACTGTG
TTCAAGAGGAAGAGAGGATGCAGCGAGAGAAGACAGAAAGTGTTCACATGGCTTCTACCTCAAAGAGTGTAAAGAGAAAGAGAGTGAATAATGCTGCGGAATCTTCTAAG
CAGAAAAAGGAAAAGAAACAGGATTCGGGACCTGCTTGTTTTTTCTGTAAAAAGACTGGGCACATGAAAAAACAATGTGCCAAATATGTTGCATGGCTATTAAAGAAGGG
TATGTTTCTCTCCCTTGTTTGTTCTGAGATTAATCTAGCTTTTGTACCTATGCATACGTGGTGGGTAGACTCAGGTGCTACTACTCACATAAGTGTATCCATGCAGGGTT
GCATTTGGAGCCGACCGCCAAGTGATGCTGAGGCTTTCATCTATGTGGCTGACGGCAATAGGGCAAAAGTAGAAGCAATAGGAACATTTAGATTATCTTTAGGAACTGGT
TTTCATTTGGATTTGTTTGAGACTTTTGTTGTTCCGTCATTTAGACGGAATTTAATTTCTGTTTCTTCATTGGACAAATTTGGTTATTCTTGTTCATTTGGAAATAACAA
ACTAAGTCTTTTGCATAATTCAAATATTATTGGTACTGGTTCACTGATTGATAATTTATATATGCTTGATAATGTTCCTTTTGATAATGAAAGCTTGCATGTTTCATCAC
GTGCAAGGCCTTATAAGCCAAATGAAAAGAAACTGGACTCAAGAACCATAAGTTGCTACTTTGTTGGGTATGCTGAGCGCTCTCGGGGCTTTAAGTTCTATAATCCCACT
TCAAGAACTTTTTTCGAGACGGGAAATGCTCGATTTCTTGAGAATATTGAGTTTGAGGGGAGAAATAAAAGTAAAGATGTTGTTTTTGAAGAAGAAAATGCTACTATCAT
GAATGATGTGGTTACTTTACCTACCATTGTTAGTAAAGTAGTTTTAGATAATGACACTCAAGATAAAGTTCCAACGCTCGATGTCATTACCTCTCAAGACAACACTGATG
ACACATCACCCTTCCCTATAGATCAACCTCAACATACTCAAGAAGTGTCATTAAGAAGATCCACTAGAGAAAGGAGAATAAGGAATAAATCTTTGATAAGTGATGGATTT
GGATTGACGGGGAGCCCTATTCACCACTCGACGAGGAACCAGAAGATCTCCGGCCAAATGGGTGCGGGTGTTTCCAGCTATTCGGCTTCAGATTCAACCGGAATGCCGAT
TACGAAGGTAGAAATCTTCTACGGCGGGGAGAGGAATCTTGGATGGCTAGGAAATTGGAGAAGGTGA
Protein sequenceShow/hide protein sequence
MFTQVNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAK
LTSSRYVGKGNIREYIMQMSNVATKLKALKLEVSEDFLVHLVLNSLPAEYSHFRVSYNTQKDKWYLNELISHCVQEEERMQREKTESVHMASTSKSVKRKRVNNAAESSK
QKKEKKQDSGPACFFCKKTGHMKKQCAKYVAWLLKKGMFLSLVCSEINLAFVPMHTWWVDSGATTHISVSMQGCIWSRPPSDAEAFIYVADGNRAKVEAIGTFRLSLGTG
FHLDLFETFVVPSFRRNLISVSSLDKFGYSCSFGNNKLSLLHNSNIIGTGSLIDNLYMLDNVPFDNESLHVSSRARPYKPNEKKLDSRTISCYFVGYAERSRGFKFYNPT
SRTFFETGNARFLENIEFEGRNKSKDVVFEEENATIMNDVVTLPTIVSKVVLDNDTQDKVPTLDVITSQDNTDDTSPFPIDQPQHTQEVSLRRSTRERRIRNKSLISDGF
GLTGSPIHHSTRNQKISGQMGAGVSSYSASDSTGMPITKVEIFYGGERNLGWLGNWRR